TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Netflix: Lessons We’ve Learned Using AWS

256 点作者 jeffmiller超过 14 年前

13 条评论

c2超过 14 年前
Sounds like the AWS architecture caused Netflix to write better code ( read: more durable, more fault tolerant ). Less assumptions baked in the code, and it will be easier to port it to a new data center/cloud architecture if AWS doesn't meet their needs.<p>As Netflix continues to scale, these changes will make managing that growth much easier.<p>A lot of you seem to take this post as being negative against AWS architecture. I take it more as a good collection of common things that you need to watch out for in distributed environments, specifically the dangers of assumptions within your current infrastructure which may change dramatically as you scale.
jemfinch超过 14 年前
Their "Chaos Monkey" approach reminds me of an excellent paper on "Crash Only Software": <a href="http://goo.gl/dqDII" rel="nofollow">http://goo.gl/dqDII</a><p>The best way to test the uncommon case is to make it more common.
评论 #2013719 未加载
评论 #2013707 未加载
评论 #2013699 未加载
briandoll超过 14 年前
This reads like the 'fallacies of distributed computing' paper (<a href="http://en.wikipedia.org/wiki/Fallacies_of_Distributed_Computing" rel="nofollow">http://en.wikipedia.org/wiki/Fallacies_of_Distributed_Comput...</a>).<p>While the likelihood of failure (or added latency, impacting upstream changes, etc.) is greater in large-scale distributed environments for which you do not control vs. your home-grown datacenter, those scenarios are just facts of life in distributed environments.<p>An awesome side effect of hosting an app in a cloud environment is that you must face up to those fallacies immediately or they'll eat you alive.
评论 #2013961 未加载
aristus超过 14 年前
I'm pretty sure "session-based memory management" should be "memory-based session management", ie they kept user session state in memory.
评论 #2013317 未加载
wccrawford超过 14 年前
I want a Chaos Monkey, too!<p>Actually, that was my first reaction, but after thinking for a moment, that isn't really a reliable way to test. If you make changes to something, you don't know for sure if the chaos monkey hit while you were testing a certain thing or not. Proper unit tests would seem to be a lot more useful.
评论 #2013304 未加载
评论 #2013325 未加载
评论 #2013072 未加载
评论 #2013068 未加载
Jd超过 14 年前
Basically the gist is: You need to be prepared for anything to stop working at any time.<p>The tone of this post indicates to me that the criticism and problems experienced by Netflix with AWS are understated, which I can understand given their position as a flagship AWS customer, etc.
评论 #2013343 未加载
评论 #2015946 未加载
wglb超过 14 年前
Interesting: <i>The Chaos Monkey’s job is to randomly kill instances</i><p>Another way to say "If it ain't tested, it's broken".
kondro超过 14 年前
Hardware is always going to fail eventually. Moving to AWS caused NetFlix to write better code to deal with these failures.<p>Failures were always going to happen, even in their own datacentre. What they have now is a more fault-tolerant system which should have less downtime overall.
byteclub超过 14 年前
If you do decide to adopt your very own pet Chaos Monkey in your next project, make sure you ARE able to gracefully degrade your service in case of failures. Otherwise your customers will see the monkey in action, manifested by "we'll be back shortly" messages. It's easier said than done, since a lot of the time all of us forget to write (or feel lazy, or have no idea how to properly handle) the "else" statements in case of errors/unavailable services/unreachable databases.<p>Otherwise, good idea. It forces you to think about the perils of distributed environment from the very beginning, as opposed to leaving it to be an afterthought.
dochtman超过 14 年前
Lesson they have not yet learned: including a HTML title tag in their Blogger templates.
jasonkester超过 14 年前
I love the idea of setting up a fully working system on AWS, then repeating all traffic from your live site over to it to see how it stands up under load.<p>No need to simulate traffic for testing purposes. Here's our <i>actual</i> traffic. All of it.<p>Nice.
ergo98超过 14 年前
Reading both this entry and the one that explained why they went with AWS, I'm left confused about why they ever went to AWS in the first place.
评论 #2013333 未加载
评论 #2013111 未加载
评论 #2013192 未加载
评论 #2013592 未加载
评论 #2013664 未加载
sabat超过 14 年前
I'll bet other companies (e.g. Heroku, Dropbox) that use AWS/EC2 would have similar things to say.<p>I did have this one question, being a guy with an IT background: they expected stability? Really? I always expect host/app/system failure, and am pleasantly surprised when it doesn't happen.
评论 #2013483 未加载
评论 #2020185 未加载