TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

A Beginner's Guide to Scaling to 11M Users on Amazon's AWS (2016)

253 点作者 febin超过 6 年前

13 条评论

chmod775超过 6 年前
Here&#x27;s some anecdata from someone serving 1.5 million monthly users with a realtime (chat etc.) web app. Disregarding the file serving functionality, this is the entire infrastructure:<p>- The frontend is running on a 30 bucks&#x2F;mo VPS with cycles and RAM to spare. - There&#x27;s 2 DB servers for 5 bucks &#x2F; mo for redundancy.<p>This thing would scale pretty linearly in theory, so for 10 million users just add like 4 servers.<p>So about 200 bucks &#x2F; mo to run your average startup&#x27;s web app at 10 mil users.<p>Alternatively write bad software and throw 50k&#x2F;mo at AWS I guess.
评论 #18763109 未加载
评论 #18763038 未加载
评论 #18762980 未加载
评论 #18763648 未加载
评论 #18763021 未加载
altmind超过 6 年前
&gt; Amazon S3 is an object base store. &gt; Highly durable, 11 9’s of reliability.<p>My eye twitched when I read this, considering that I remember more than one aws outage that affected s3 availability&#x2F;reliability.<p>AWS own pages mention 10 nines for &#x2F;durability&#x2F;, however, I doubt this durability number, this level is almost unachievable by any end customer product. Could it be some clever durability accounting?<p>I&#x27;m quite disappointed on AWS service status reporting when the service may be disrupted so so badly, to the point of not being serve a single request, however status pages not mentioning problems at all, or just reporting &quot;increased error rates&quot;.
评论 #18762877 未加载
评论 #18767897 未加载
bufferoverflow超过 6 年前
One important thing it doesn&#x27;t discuss is the insane AWS costs compared to the other clouds and even regular self-managed solutions running on cheap VPSes and dedicated servers.
评论 #18763882 未加载
评论 #18761619 未加载
评论 #18762643 未加载
评论 #18762736 未加载
NightlyDev超过 6 年前
And just remember, public clouds are expensive. If you need to handle millions of requests per second then there is a very high probability that you can afford a dedicated infrastructure team and save a ton of money while working just as well.<p>Just look at the insane(!!!) bandwidth costs...
评论 #18761996 未加载
评论 #18762277 未加载
评论 #18762185 未加载
评论 #18761952 未加载
sethammons超过 6 年前
They say &quot;you need to add monitoring, metrics and logging&quot; at the 500k user mark. That should be happening with user 0 with few exceptions that I can think of.
评论 #18762812 未加载
packetslave超过 6 年前
This is a written summary (like all High Scalability articles) of this video: <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=vg5onp8TU6Q" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=vg5onp8TU6Q</a>
评论 #18761649 未加载
ac29超过 6 年前
Some of this advice seems like wildly premature optimization. Splitting into multiple hosts and&#x2F;or using a database service at 10 users?<p>At 10 users, you should be focusing on growth, not fussing with your tech stack.
munns超过 6 年前
I wrote the first version of this deck back in 2013: <a href="https:&#x2F;&#x2F;www.slideshare.net&#x2F;AmazonWebServices&#x2F;scaling-on-aws-for-the-first-10-million-users" rel="nofollow">https:&#x2F;&#x2F;www.slideshare.net&#x2F;AmazonWebServices&#x2F;scaling-on-aws-...</a> and it since then went on to become one of the most delivered talks by AWS across the world. I thought I&#x27;d pop into this thread and answer some questions&#x2F;help provide guidance as to why it exists. Note that first deck was mostly tossed together while on an Amtrak from NYC to Philly in an early morning, so excuse the typos and graphics glitches.<p>Generally speaking we&#x27;ve&#x2F;I&#x27;ve seen a number of customers struggle with pre-optimization of their infrastructure and&#x2F;or just not knowing what they should think about and when, when it comes to scale and architecture patterns. Think of this deck as an 80&#x2F;20 rule for the most general and basic of people. Also, 2013 was a very very different world in the cloud and in infrastructure than today.<p>This deck was all pre-Docker&#x2F;k8s, pre-Lambda&#x2F;FaaS, pre-alot-of-cool-db-tech. However as a general starting point a lot of it still holds just fine. You <i>should</i> probably start with a relational DB if you have a more traditional background using them. You <i>probably</i> don&#x27;t need autoscaling Day 1 (but it&#x27;s nice via things like ElasticBeanstalk).<p>Someone commented that metrics&#x2F;logging&#x2F;monitoring is a Day 0 thing, and it absolutely is, but it is the kind of thing most companies skimp out on until very late in the game. Not pushing for excellence in this area will block you from success down the line. The same is hiring someone with dedicated Ops&#x2F;SRE responsibilities, and&#x2F;or making sure someone on the team has the right experience in that space.<p>DB growth&#x2F;scaling is now a lot more transparent than it used to be, but I still see people doing less than ideal things with them. This is an area the industry needs to be sharing more best practices on (while still respecting some of the best practices from decades of SQL).<p>On costs; today some of the biggest sites in the world run on a public cloud. They do that for many reasons, but cost effectiveness at scale is pretty well proven when measured the right way. Most math ignores people and opportunity cost and that&#x27;s what get&#x27;s you more than metal and power. There is also now today the wealth of managed services that equate to replacement of people with a returned value greater than the people cost (essentially the ROI on managed services outpaces the value of you doing it yourself greatly). The SaaS ecosystem is also vastly larger than it was in 2013. I hope to never run a logging cluster, monitoring systems, backup tools, etc, myself again.<p>Anyway, glad to see this deck still kicks around and that people are discussing it. Happy to try and answer more questions on this. - munns@
karmakaze超过 6 年前
A glaring omission in AWS offerings is a NewSQL database that let&#x27;s you start with SQL, have redundancy and scale horizontally having consistent reads the whole time. Scaling to 10M+ users and then switching datastores makes for good story writeups because it&#x27;s hard. It&#x27;s also avoidable.
评论 #18765640 未加载
squigs25超过 6 年前
I don&#x27;t think it&#x27;s necessarily a good idea to use SQL for everything. If you are constantly inserting records and don&#x27;t need relational structures, DynamoDB might be a better choice. DynamoDB scales quickly and cheaply, so it&#x27;s potentially the preferable option if it works in your architecture.
评论 #18762890 未加载
debarshri超过 6 年前
I think the guide does not work when you have less users and more data, for instance if you are a b2b saas provider. I think the way you architect an infrastructure should be driven by the value of the user.
bradknowles超过 6 年前
(2016)
评论 #18761306 未加载
koehler超过 6 年前
Is this still valid ?
评论 #18761874 未加载