AWS mistakes to avoid

537 点作者 hellomichibye超过 9 年前

31 条评论

ddevault超过 9 年前

Full disclosure: I work for Linode and I am super biased. This is my own opinion etc etcPerhaps the first AWS mistake you might make is... using AWS? Even before I started at Linode, I thought it was terrible. It's extremely, unreasonably pricey. The UI is terrible. Their offerings are availble elsewhere. I started MediaCrush, a now-defunct media hosting website, on AWS. After a while, we switched to dedicated hosting (really sexy servers). We were looking at $250 a month and scaled up to millions of visitors per day! I ran our bandwidth and CPU usage and such numbers through the AWS price calculator a while ago - over $20,000 per month. AWS is a racket. It seems to me like the easiest way to burn through your new startup's seed money real fast.Edit: not trying to sell you on Linode, just disclosing that I work there. There are lots of options, just do the research before you reach for AWS.

评论 #10796501 未加载

评论 #10796262 未加载

评论 #10796446 未加载

评论 #10796249 未加载

评论 #10797613 未加载

评论 #10796122 未加载

评论 #10796114 未加载

评论 #10796364 未加载

评论 #10796586 未加载

评论 #10796462 未加载

评论 #10797242 未加载

评论 #10797295 未加载

评论 #10796270 未加载

评论 #10796155 未加载

评论 #10796194 未加载

评论 #10797810 未加载

评论 #10801175 未加载

评论 #10797439 未加载

评论 #10816168 未加载

评论 #10796350 未加载

评论 #10796263 未加载

评论 #10796119 未加载

zerotosixty超过 9 年前

You know what I've realized that's really important. More AWS tutorials is really needed. There's numerous of new programmers who want to learn AWS, but can't finish building anything because they get buried in documentation.I find there are a lot of high-level abstracted tutorials, but for the new services, there aren't a lot of detailed tutorials.For instance, an implemented cognito->gateway->lambda->dynamodb is really hard for a newbie to do.

评论 #10796477 未加载

评论 #10795461 未加载

评论 #10795491 未加载

评论 #10795133 未加载

评论 #10795621 未加载

评论 #10795353 未加载

评论 #10797410 未加载

评论 #10795315 未加载

评论 #10795614 未加载

评论 #10795269 未加载

评论 #10795691 未加载

narsil超过 9 年前

> There is no reason - beside manually managed infrastructure - to not decrease the instance size (number of machines or c3.xlarge to c3.large) if you realize that your EC2 instances are underutilized.CPU/Memory aren't the only measures of underutilization. If you require high instantaneous bandwidth throughput, then the networking capacity available to your instance roughly increases with the size of your instance. This includes both EBS as well as other Network traffic.Table with Low/Medium/High: <a href="https://aws.amazon.com/ec2/instance-types/#instance-type-matrix" rel="nofollow">https://aws.amazon.com/ec2/instance-types/#instance-type-mat...</a>Example benchmark with c3 instances: <a href="http://blog.flux7.com/blogs/benchmarks/benchmarking-network-performance-analysis-of-c3-instances-using-iperf-tool" rel="nofollow">http://blog.flux7.com/blogs/benchmarks/benchmarking-network-...</a>If you're more concerned with just EBS network throughput, check out the table on this page instead: <a href="https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ebs-ec2-config.html" rel="nofollow">https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ebs-ec2-...</a>

评论 #10796107 未加载

elwell超过 9 年前

I'd like to add, if using Elastic Beanstalk, don't directly attach an RDS instance when creating the environment. If you do, you won't be able to destroy your environment without also deleting the RDS instance. Instead, create the RDS instance separately, and just add the proper security group for the environment to be able to access the host. Then you can easily create a new eb environment with any config changes (there are some config changes you can't make to an eb environment without creating a new one from scratch) and then connect to your existing db.

评论 #10796000 未加载

评论 #10796180 未加载

misiti3780超过 9 年前

Another, somewhat obvious one:Be extremely careful when using public customized AMIs, a lot of times ~/.ssh/authorized_hosts contains public keys and this is obviously a huge security problem

评论 #10795322 未加载

评论 #10795236 未加载

peterwaller超过 9 年前

I like CloudFormation. Unfortunately it is very unwieldy to write CloudFormation templates directly, and we're not about to start using the AWS CFN GUI editor!It seems like the assembly of the AWS ecosystem.Does anyone else have a favourite hammer for this particular nail? I'd love to have something better than our home-baked solution, but I'm yet to find anything which doesn't introduce other flaws, such as an incomplete implementation (missing parameters or resource types) or ultimately making a leaky abstraction on top of CloudFormation somehow.I ended up brewing a reasonably straightforward solution using Python as a (minimal) DSL which emits JSON. Its primary purpose is to support the whole of the CFN ecosystem (not just implement some small part of EC2, for instance) while also not trying to be too clever.It has about 50-100 lines of python which implements helper functions such as ref(), join() and load_user_data(), and not many other things. There is an almost 1-to-1 correspondence between the generated CFN configuration and the python source. As a bonus it checks for a few common mistakes like broken refs or parameters which aren't used.I have heard that similar solutions have been reinvented in a few places, including the BBC. But I'm yet to see a good public solution!

评论 #10795354 未加载

评论 #10795265 未加载

评论 #10795267 未加载

评论 #10795242 未加载

评论 #10797381 未加载

评论 #10795536 未加载

评论 #10807021 未加载

评论 #10795685 未加载

评论 #10795180 未加载

评论 #10795490 未加载

评论 #10795495 未加载

alttab超过 9 年前

Also, dont use Dynamo unless you have a really good reason. "It scales better than MySQL" is not a good reason.Have fun migrating data and re-indexing constantly!

评论 #10795286 未加载

评论 #10796357 未加载

评论 #10800321 未加载

评论 #10795729 未加载

评论 #10795155 未加载

评论 #10795111 未加载

nodesocket超过 9 年前

My huge recommendation is to put production instances in a completely seperate region than development and staging instances. I actually just discovered that you can limit IAM API keys to a specific region, you just need to create a custom policy. The following policy is an example:<pre><code> { "Version": "2012-10-17", "Statement": [ { "Sid": "SOME-ID-HERE", "Effect": "Allow", "Action": [ "ec2:*" ], "Condition": { "StringEquals": { "ec2:Region": "us-west-2" } }, "Resource": [ "*" ] } ] }</code></pre>

评论 #10795212 未加载

评论 #10795466 未加载

评论 #10795172 未加载

dantiberian超过 9 年前

The biggest mistake I've seen with AWS (and committed myself), is not reading the manuals for the services you're using. While some people complain about the AWS manuals not being complete, there is still a lot of good information in there that you might miss if you're just clicking through the console.

tomglindmeier超过 9 年前

I wonder if AWS is a good location to run a VoIP server. VoIP is a real time application that is very prone to jitter, latency and packet loss. I'm concerned about "noisy neighbors" and decreased network performance at AWS.Does anybody have experience with running a VoIP (e. g. Asterisk) on AWS?

评论 #10796171 未加载

评论 #10797259 未加载

评论 #10796217 未加载

评论 #10797581 未加载

andrioni超过 9 年前

I second the recommendation to use CloudFormation + packer + ELBs + auto scaling groups for web applications whenever possible, it just makes everything so easy and automatic. Of course, there's a learning curve and you pay a premium for all that automation, but in my experience it has been usually worth it so far.

评论 #10795803 未加载

sergiotapia超过 9 年前

I wonder when Amazon is going to invest their ungodly earnings into UX. Their UI is absolutely terrible and I shudder every time I have to log into it. Is it purposely built that way to confuse people who don't belong in there?

评论 #10796128 未加载

buremba超过 9 年前

Use OpsWorks if possible. It's free and provides a simple interface that allows you to deploy/upgrade your apps automatically and monitors your instances automatically using CloudWatch.

评论 #10796647 未加载

rgawdzik超过 9 年前

Check out <a href="http://convox.com/" rel="nofollow">http://convox.com/</a> (YC S15) for an alternative to avoiding manual infrastructure.

评论 #10795816 未加载

cachemiss超过 9 年前

So I'd modify these a bit. We run a very large AWS infrastructure as a engineering team (no dedicated ops).1. Use CloudFormation only for infrastructure that largely doesn't change. Like VPC's, subnets/ internet gateways etc. Do not use it for your instances / databases etc, I can't recommend that enough, you'll get into a place where updating them is risky. We have a regional migration (like database migrations) that runs in each region we deploy to that sets up ASG, RDS etc. It allows us control over how things change. If we need to change a launch conf etc.2. Use auto-scaling groups in your stateless front ends that don't have really bursty loads, it isn't responsive enough for really sharp spikes (though not much is). Otherwise do your own cluster management if you can (though you should probably default to autoscaling if you can't make a strong case not to use it).3. Use different accounts for dev / qa / prod etc. Not just different regions. Force yourself to put in the correct automation to bootstrap yourself into a new account / region (we run in 5 regions in prod, and 3 in qa, and having automation is a lifesaver).4. Don't use ip addresses for things if you can help it, just create a private hosted zone in Route53 and map it that way.5. Use instance roles, and in dev force devs to put their credentials in a place where they get picked up by the provider chain, don't get into a place where you are copying creds everywhere, assume they'll get picked up from the environment.6. Don't use DynamoDB (or any non-relational store) until oyu have to (even though it is great), RDS is a great service and you should stick with it as long as you can (you can make it scale a long way with the correct architecture and bumping instance sizes is easy). IMO a relational store is more flexible than others since you (at least with postgres) get transactional guarantees on DDL operations, so it makes it easier to build in correct migration logic.6. If you are using cloudformation, use troposphere: <a href="https://github.com/cloudtools/troposphere" rel="nofollow">https://github.com/cloudtools/troposphere</a>7. Understand what instances need internet access and which ones don't, so you can either give them public ips, or put in a NAT. Sometimes security teams get grumpy (for good reason) when you open up machines that don't need to be to the internet, even if its just outbound.8. Set up ELB logging, and pay attention to CloudTrail.9. We use Cloudwatch Logs, it has its warts (and its a bit expensive), but it's better than a lot of the infrastructure you see out there (we don't generally index our logs, we just need them to be able to be viewed in a browser and exported for grep). It's also easy to get started with, just make sure your date formats are correct.10. By default, stripe yourself across AZs if possible (and its almost always possible). Don't leave it for later, take the pain up front, you'll be happy about it later.11. Don't try and be multi-region if you can at first, just replicate your infrastructure into different regions (other than users / accounts etc.). People get hung up on being able to flip back and forth between regions, and its usually not necessary.edit: Track everything in cloudwatch, everything.

评论 #10796387 未加载

评论 #10795959 未加载

评论 #10796388 未加载

vitoc超过 9 年前

I work in a team that uses quite a lot of AWS for various reasons beyond cost. We also use various other clouds too, depending on needs of a specific project or situation. Our experience with cloud is that it is a journey. Obviously, coupled with DevOps tooling, it’d allowed us to deal with environment requirements at the speed of software, i.e. Infrastructure as Code and all. One thing we find ourselves doing over and over is changing the infrastructure, either because requirements change, new cloud services are launched and its useful to us or simply because we find more efficient ways of running cloud resources (Trusted Advisor helps).We’d built a tool called Liquid Sky (<a href="https://liquidsky.singtel-labs.com" rel="nofollow">https://liquidsky.singtel-labs.com</a>) to help us keep track of the cost impact of the changes we make constantly. I did mention that we use cloud for reasons beyond cost, but we definitely still want to know that we’re sensible and maximise cost efficiency as well, its just another (important) factor. Because we change our cloud resources so frequently, we didn’t want to make it a very rigid process when dealing with the sensibilities of cloud cost. Hence, we’d built Liquid Sky in a way that gives our engineers the freedom to explore better way of running things on the cloud while keeping cost in check as well as keeping the team (including cost guardians) in the loop.

fibo超过 9 年前

Nice article, I am working in a small company (Beintoo) dice september 2015 and we use AWS here. I think is a very interesting set if products and you can build any kind of business. For sure the advices given in the article are really useful, in fact I will apply them at my job place.About comparison with other services, I was working in Deloitte Analytics before, managing the cloud services provided by IBM Softlayer. You cannot compare them, AWS offers many more and I was not really satisfied with SoftLayer, for example I had a problem with a network upgrade they did on January 2014 and I have lost a lot of data, with poor support to restore it. Also the starting price of 25$ per month is really expensive. AWS is far more mature and interesting.Then for my own servers I use CloudAtCost cause is cheaper but if I run a business for sure I would go with AWS. If you gain money, is not that expensive and if you stick with Amazon advices and philosophy is very reliable.

tedmiston超过 9 年前

I'll add one that was especially common for people coming off the year of free tier a couple years ago. I'm not sure that AWS has changed it yet.6. Not starting a box/instance/database and forgetting it's running until you receive the bill after your free tier expires.

评论 #10795436 未加载

kennu超过 9 年前

I warmly recommend the Serverless framework for building basic web applications on AWS. It handles CloudFormation details for you, but lets you customize them if needed. Not suitable for every possible app though.

overgard超过 9 年前

This might be the wrong place to ask, but I'm curious how people feel Azure stacks up to AWS? The services seem comparable (maybe even nicer), but I'm unclear how it compares on cost.

ap22213超过 9 年前

I don't get all the complaints about AWS prices. I just spun up a 60 node Spark cluster with over 5TB of memory, processed 100B data records, and spent $5.50!

nikolay超过 9 年前

As there's too much push to Terraform, which I personally dislike due to many opinionated features and the marketing push to cover as many services as possible and not do one thing and do it great (AWS), you can look at Bazaarvoice's CloudFormation Ruby DSL [0].[0]: <a href="https://github.com/bazaarvoice/cloudformation-ruby-dsl" rel="nofollow">https://github.com/bazaarvoice/cloudformation-ruby-dsl</a>

matdrewin超过 9 年前

Personally never quite got the appeal for EC2. You're basically replicating what you would be doing on physical servers anyway. The real productivity gains come from using a fully managed PaaS (Heroku, Azure Web Apps, Google App Engine, OpenShift etc.) where there is no maintenance and scaling and redundancy is taken care for you. Granted those are even more expensive but they are the only ones that provide any kind of value.

voltagex_超过 9 年前

I find IAM particularly difficult to use - I feel like there should be a button to create a user/group that can do only X, Y, Z. I realise policy templates get most of the way there but I still had to go and read the syntax for them because DescribeRegions wasn't in the list I needed.I'm also not sure how to make the jump from exporting AWS_ACCESS_KEY_ID and having my instances automatically request the permissions they need - STS?

评论 #10795599 未加载

评论 #10795603 未加载

siddharth_mal超过 9 年前

I'd like to add two more:<pre><code> 1) Not giving out your access and secret keys in scripts/buckets. 2) Always using IAM roles with your EC2</code></pre>

评论 #10795154 未加载

评论 #10797086 未加载

评论 #10795206 未加载

ninjay超过 9 年前

What are the current ways to make creating CloudFormation templates not so painful?

评论 #10796453 未加载

CRConrad超过 9 年前

From the article: "The biggest problem with Auto Scaling Groups is that people assume that they are about auto scaling which they are not!"Hmm... Then maybe the biggest problem with them is actually that they are misnamed?

benmanns超过 9 年前

Has anyone tried the Trusted Advisor feature out? Have you found it worth the 3-10% on top of existing monthly usage?

评论 #10796646 未加载

diziet超过 9 年前

Something major is missing:Running Demand instances instead of Reserved

评论 #10797081 未加载

vacri超过 9 年前

> There is no reason why you should manage your infrastructure manually. It's unprofessional! It's a mess!Nonsense. Cloudformation has it's issues. It takes time to learn and implement. The templates can break, requiring the stack to be destroyed and remade. In the sample in the article, the database is in the same template as everything else - what fun that will be when an update breaks the template and you have to reapply the stack (which destroys the existing database).Cloudformation is good, but it comes with caveats, and the idea that you should only manage an AWS stack with CF is utter tripe. As with everything, it depends on your use case.Also weird is the article's demand of using autoscaling groups to monitor single instances. Why not just monitor them directly with cloudwatch?> There is no reason - beside manually managed infrastructure - to not decrease the instance size (number of machines or c3.xlarge to c3.large) if you realize that your EC2 instances are underutilized.This is wrong, too. Autoscaling takes time to scale up, and it scales up in stages. If you get sudden traffic, autoscaling can take too long. Again, it's about knowing your use case. Unfortunately for us, we can get sudden traffic when one of our clients does a media release and they don't tell us ahead of time, The five or so minutes it takes for instances to trigger the warning, start up a new set, and then attach these to a load balancer is too long for this particular use case, so we just have to run with a certain amount of excess capacity.Autoscaling is awesome, but this article is way too didactic in it's No True Scotsman approach.

评论 #10796776 未加载

ommunist超过 9 年前

I attended aws workshop, several years ago, there I realized, that this vast ecosystem of infrastructure services require a popularisation effort of similar scale. And not just in plain English. IAM that days was not very clearly understandable, search supported only ascii and was not really documented, now that ecosystem is in order of magnitude larger and more complex. And efforts like cloudonaut's should be greatly appreciated. For the greater public good. Thank you, man!