AWS instance was scheduled for retirement

77 pointsby themonkover 11 years ago

If you are hosting on cloud, you must automate everything.

17 comments

mdellabittaover 11 years ago

They generally send you an advance email. I just had to migrate our Jenkins server a week or two ago because of this. I received something like 15 days notice on that one.But obviously if there's a hard failure, they aren't always going to be able to give you the amount of time you'd want. Generally speaking, you should have accounted for this situation ahead of time in your engineering plans. Amazon EC2 doesn't have anything like vmotion, it's just a bunch of KVM virts.If you're using the GUI, the first time you try a shutdown, it will do a normal request, but then if you go back and try it again while the first request is still pending, you should see the option for doing a hard restart. Try that and give it some time. Sometimes it takes an hour or two to get through. Otherwise, Amazon's tech support can help you.

评论 #6621825 未加载

评论 #6623127 未加载

评论 #6621881 未加载

noonespecialover 11 years ago

Remember kids, an EC2 is not a server. It's a process on someone else's server and all of your data is stored in /tmp. Do plan accordingly.

评论 #6622689 未加载

评论 #6622121 未加载

Corradoover 11 years ago

Ok, the key to working with AWS EC2 instances is to remember that they are ephemeral and can disappear at any point in time. If your treating it like a traditional server that you have in a rack you're doing it wrong. Just turn it off and start a new one. You are using a configuration manager (puppet, chef, etc) aren't you?

评论 #6622010 未加载

评论 #6622495 未加载

评论 #6623088 未加载

apetrescover 11 years ago

Not only do they send you an e-mail about this, they even have an API call for it: <a href="http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/monitoring-instances-status-check_sched.html" rel="nofollow">http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/monitorin...</a>Anyone who's surprised that this happens has not used EC2 very much. It is this way by design.

kartikkumarover 11 years ago

I think I'm missing something. Why isn't Amazon sorting this out behind the scenes so that any failing hardware is seamlessly replaced and the user is none the wiser? Am I expecting too much?

评论 #6621848 未加载

评论 #6622116 未加载

评论 #6622756 未加载

tschellenbachover 11 years ago

Usually they send you a nice email about this. Then you have to lookup the instance and hope its a webworker and not our main database :)

sudhirjover 11 years ago

I'm working with another team of people who haven't yet tried working with cloud servers, and one of the things they're struggling with the most is that cloud servers need to be thought of as disposable. They can't easily digest the idea that servers can and will go down randomly for no known reason.I think Amazon needs to put a lot more effort into educating people about the best practices involved here - creating immutable and disposable servers, make it easier (console access) to create availability groups, etc.

评论 #6622787 未加载

RockyMcNutsover 11 years ago

I've gotten one of those emails and thought, OK it's gonna reboot, not a problem for that instance, has no persistent data I care about.Then it kept running, but there was no way to reboot it from EC2 console or ssh, so that was a bit of a problem, had to get support to do it.Moral - reboot it yourself at a convenient time.

keithgabryelskiover 11 years ago

To work in AWS's system you must have redundant nodes -- such that any single node can be rebooted without affecting the system as a whole.Notification that your system is on old hardware that has been deprecated is part of the price of doing business in this cloud system.As others have noted: yes, it is a little tense (is this my production database or my Continuous Integrations machine) -- The email you get just gives you an aws-id token, so you must look it up.but, AWS has enough components that help you build resilient systems that, if you've done you job correctly, you shouldn't care about these messages other than the labor of spinning up a replacement.

chris_wotover 11 years ago

What, you don't get notified?

评论 #6622335 未加载

评论 #6621756 未加载

评论 #6621727 未加载

评论 #6622084 未加载

评论 #6621733 未加载

评论 #6621730 未加载

dabs_returnover 11 years ago

Luke@AWS updated your thread. Makes a lot more sense now as a notice would only be sent if it was a scheduled eviction.

regularfryover 11 years ago

Interesting that they've gone that way rather than attempt any sort of live migration.

评论 #6622015 未加载

评论 #6622375 未加载

评论 #6624367 未加载

sudhirjover 11 years ago

Reminds me of <a href="http://www.goodreads.com/quotes/379100-there-s-no-point-in-acting-surprised-about-it-all-the" rel="nofollow">http://www.goodreads.com/quotes/379100-there-s-no-point-in-a...</a>

rpm4321over 11 years ago

This is somewhat unrelated, but what's the general consensus on the security of EC2 for very sensitive computation?For example, I have a client who has some algorithms and data that are potentially quite valuable. EC2 and other AWS services would be a huge help with their project, but is there any way measures could be taken to ensure that no one - even Amazon employees - can get to their code and data?Edit: devicenull makes some good points - I guess I had the CIA's $600 million AWS contract in my head when asking my question.

评论 #6622137 未加载

评论 #6622021 未加载

评论 #6622117 未加载

gyepiover 11 years ago

War story: I was once called in to scale an application that had been running on AWS for 6 or 7 months and was failing due to excessive traffic. Normally a good problem to have, but this turned into a difficult problem because the application stored critical data on an EBS and those are, of course, not sharable. The only solution was to move to increasingly larger instances until the application could be rewritten. Moral: If you are on the "cloud", make sure your application design fits your infrastructure.

aidosover 11 years ago

Once upon a time there was EC2, without EBS. It was actually a pretty good place to be. There was no ambiguity because everyone who used EC2 was given a lot of warnings about how they'd have to architect their systems to avoid critical failure. I wonder if the introduction of EBS has actually increased data loss because people aren't as paranoid about it.

tbarbugliover 11 years ago

Whats the point of this entry ? Are we surprised that hardware fails ? I am the complete opposite of an EC2 fanboy but every time they decided to shut down a machine they had the good taste of sending an email to us.

17 comments

mdellabittaover 11 years ago

评论 #6621825 未加载

评论 #6623127 未加载

评论 #6621881 未加载

noonespecialover 11 years ago

Remember kids, an EC2 is not a server. It's a process on someone else's server and all of your data is stored in /tmp. Do plan accordingly.

评论 #6622689 未加载

评论 #6622121 未加载

Corradoover 11 years ago

评论 #6622010 未加载

评论 #6622495 未加载

评论 #6623088 未加载

apetrescover 11 years ago

kartikkumarover 11 years ago

I think I'm missing something. Why isn't Amazon sorting this out behind the scenes so that any failing hardware is seamlessly replaced and the user is none the wiser? Am I expecting too much?

评论 #6621848 未加载

评论 #6622116 未加载

评论 #6622756 未加载

tschellenbachover 11 years ago

Usually they send you a nice email about this. Then you have to lookup the instance and hope its a webworker and not our main database :)

sudhirjover 11 years ago

评论 #6622787 未加载

RockyMcNutsover 11 years ago

keithgabryelskiover 11 years ago

chris_wotover 11 years ago

What, you don't get notified?

评论 #6622335 未加载

评论 #6621756 未加载

评论 #6621727 未加载

评论 #6622084 未加载

评论 #6621733 未加载

评论 #6621730 未加载

dabs_returnover 11 years ago

Luke@AWS updated your thread. Makes a lot more sense now as a notice would only be sent if it was a scheduled eviction.

regularfryover 11 years ago

Interesting that they've gone that way rather than attempt any sort of live migration.

评论 #6622015 未加载

评论 #6622375 未加载

评论 #6624367 未加载

sudhirjover 11 years ago

Reminds me of <a href="http://www.goodreads.com/quotes/379100-there-s-no-point-in-acting-surprised-about-it-all-the" rel="nofollow">http://www.goodreads.com/quotes/379100-there-s-no-point-in-a...</a>