TechEcho

8 comments

sciurusover 5 years ago

Before they'd be affected by Route 53 outages, Cloudfront outages, and S3 outages. Now they can add Lambda outages to that list too.It's also unclear how this actually solves the problem. Now if S3 in _either_ region is unavailable they'll start to fail 50% of uncached requests. I'm guessing they're using Route 53 health checks with some cloudwatch alarm to cut over to one region when they think the other is unhealthy. Presumably this is covered in the unavailable part 2.I'm mildly skeptical that this is worth the increased risks plus the increased cost from running Lambda@Edge on cache misses.

评论 #21808149 未加载

评论 #21809049 未加载

评论 #21808229 未加载

ReidZBover 5 years ago

If the "Cross-region replication" line in the picture is talking about the native S3 cross-region replication (as I assume it is), beware the replication latency in this setup. AWS recently released "replication with an SLA" for S3 [0], but at "99.99% of the objects will be replicated within 15 minutes", it's not a good enough SLA to rely on in setups like this.Presumably Part 2 of this post will address this limitation, or maybe their product isn't affected by it. (I've never looked into Contentful, though maybe I will now -- blog post purpose achieved?)I'm also not sure if "active-active" is the best name for this setup, since objects can't be written to the 2nd bucket (replication only goes one direction). Generally I associate "active-active" with "writes can happen anywhere", though maybe I'm wrong?[0] <a href="https://aws.amazon.com/blogs/aws/s3-replication-update-replication-sla-metrics-and-events/" rel="nofollow">https://aws.amazon.com/blogs/aws/s3-replication-update-repli...</a>

rynopover 5 years ago

Confused - why not use CloudFront Origin Groups? <a href="https://docs.aws.amazon.com/AmazonCloudFront/latest/DeveloperGuide/high_availability_origin_failover.html" rel="nofollow">https://docs.aws.amazon.com/AmazonCloudFront/latest/Develope...</a>Full disclosure, I've never used, but pretty sure this feature was created for the scenario you are trying to solve.

评论 #21808078 未加载

Zaheerover 5 years ago

Although it may make sense to this company in _majority_ of companies this would be over-engineering. S3 availability is some of the best in the business. If S3 is down, a good chunk of the internet is down with it.

评论 #21811344 未加载

advisedwangover 5 years ago

Google Cloud Storage has multi-region storage classes. Does S3 not have an equivalent of this?

评论 #21809199 未加载

knodiover 5 years ago

Sorry I can't condone the use of AWS lambda@edge. No central logs aggregation in an event of an issue or alerting.

评论 #21811876 未加载

zackbloomover 5 years ago

It's worth pointing out you can just point Cloudflare Load Balancing at two S3 buckets and call it a day.

评论 #21811860 未加载

jugg1esover 5 years ago

The architecture described here is pretty simple. The article states the fix was 20 lines of code. If this is the hardest problem you have to solve at work, I envy you.

8 comments

sciurusover 5 years ago

评论 #21808149 未加载

评论 #21809049 未加载

评论 #21808229 未加载

ReidZBover 5 years ago

rynopover 5 years ago

评论 #21808078 未加载

Zaheerover 5 years ago

评论 #21811344 未加载

advisedwangover 5 years ago

Google Cloud Storage has multi-region storage classes. Does S3 not have an equivalent of this?

评论 #21809199 未加载

knodiover 5 years ago

Sorry I can't condone the use of AWS lambda@edge. No central logs aggregation in an event of an issue or alerting.

评论 #21811876 未加载

zackbloomover 5 years ago

It's worth pointing out you can just point Cloudflare Load Balancing at two S3 buckets and call it a day.

评论 #21811860 未加载

jugg1esover 5 years ago

The architecture described here is pretty simple. The article states the fix was 20 lines of code. If this is the hardest problem you have to solve at work, I envy you.

Making S3 More Resilient Using Lambda Edge

8 comments

Making S3 More Resilient Using Lambda Edge

8 comments