TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Diving Deep on S3 Consistency

126 点作者 themarkers大约 4 年前

11 条评论

swyx大约 4 年前
i find this very light on the actual "diving deep" part promised in the title. theres a lot of self congratulatory chest thumping, not a lot of technical detail. Werner of course doesnt owe us any explanation whatsoever. i just dont find this particularly deep.
评论 #26975869 未加载
rossmohax大约 4 年前
Recent S3 consistency improvements are welcome, but S3 still falls behind Google GCS until they support conditional PUTs.<p>GCS allows object to be replaced conditionally with `x-goog-if-generation-match` header, which sometimes can be quite useful.
评论 #26969803 未加载
评论 #26973147 未加载
评论 #26972864 未加载
valenterry大约 4 年前
Here&#x27;s what I take away from this post:<p>&gt; We built automation that can respond rapidly to load concentration and individual server failure. Because the consistency witness tracks minimal state and only in-memory, we are able to replace them quickly without waiting for lengthy state transfers.<p>So this means that the &quot;system&quot; that contains the witness(es) is a single point of truth and failure (otherwise we would lose consistency again), but because it does not have to store a lot of information, it can be kept in-memory and can be exchanged quickly in case of failure.<p>Or in other words: minimize the amount of information that is strictly necessary to keep a system consistent and then make that part its own in-memory and quickly failover-able system which is then the bar for the HA component.<p>Is that what they did?
评论 #26970611 未加载
iblaine大约 4 年前
Anyone else still seeing consistency problems w&#x2F;S3 &amp; EMR? The latest AWS re:Invent made it sound like this would be fixed but as of yesterday I was still using emrfs to correct S3 consistency problems.
评论 #26970781 未加载
wolf550e大约 4 年前
AWS fixed S3 consistency in December 2020:<p><a href="https:&#x2F;&#x2F;aws.amazon.com&#x2F;about-aws&#x2F;whats-new&#x2F;2020&#x2F;12&#x2F;amazon-s3-now-delivers-strong-read-after-write-consistency-automatically-for-all-applications&#x2F;" rel="nofollow">https:&#x2F;&#x2F;aws.amazon.com&#x2F;about-aws&#x2F;whats-new&#x2F;2020&#x2F;12&#x2F;amazon-s3...</a>
评论 #26970328 未加载
评论 #26970577 未加载
pawelmi大约 4 年前
So it is both available and consistent (but perhaps only in read your own writes way?). What is then with resilence to network partitions, referring to CAP theorm? Did they build super reliable global network, so this is never a real issue?
评论 #26973026 未加载
juancampa大约 4 年前
Can someone elaborate on this Witness system OP talks about?<p>I&#x27;m picturing a replicated, in-memory KV store where the value is some sort of version or timestamp representing the last time the object was modified. Cached reads can verify they are fresh by checking against this version&#x2F;timestamp, which is acceptable because it&#x27;s a network+RAM read. Is this somewhat accurate?
评论 #26970968 未加载
crashocaster大约 4 年前
I would have been interested to hear more about the verification techniques and tools they used for this project.
评论 #26969380 未加载
评论 #26969850 未加载
评论 #26970027 未加载
MeteorMarc大约 4 年前
And for those who use minio server, the self hosted s3 storage, that has strong consistency, too.
评论 #26971312 未加载
nhoughto大约 4 年前
Would love a dive (hopefully deep) into IAM, the innards of that must be some impressive wizardry. Surprising there isn&#x27;t more around about the amazing technical workings of these foundational AWS products.
评论 #26975894 未加载
whydoineedthis大约 4 年前
I&#x27;m confused...did you fix the caching issue in S3 or not?<p>The article seems to explain why there is a caching issue, and that&#x27;s understandable, but it also reads as if you wanted to fix it. I would think the headliner and bold font if it was actually fixed.<p>For those curious, the problem is that S3 is &quot;eventually consistent&quot;, which is normally not a problem. But consider a scenario where you store a config file on S3, update that config file, and redeploy your app. The way things are today you can (and yes, sometimes do) get a cached version. So now there would be uncertainty of what was actually released. Even worse, some of your redeployed apps could get the new config and others the old config.<p>Personally, I would be happy if there was simply an extra fee for cache-busting the S3 objects on demand. That would prevent folks from abusing it but also give the option when needed.
评论 #26969269 未加载
评论 #26969252 未加载
评论 #26969280 未加载
评论 #26969265 未加载
评论 #26969229 未加载