TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Summary of the AWS Service Event in the Sydney Region

104 pointsby mcbainalmost 9 years ago

11 comments

jaketayalmost 9 years ago
Our instances with ap-southeast-2 were out for around 12 hours. We used multiple availability zones and it didn&#x27;t prevent downtime at all. It&#x27;s very interesting the difference between AWS and Google outage responses. AWS is down for 12+ hours for some customers, force each customer to chase service level credits and sign off the postmortem with a nameless &amp; faceless &quot;-The AWS Team&quot;. Not one person at AWS was willing to take responsibility for this failure.<p>Whereas Google was recently down for less than 18 minutes. A VP at Google sent an email advising all affected customers, posted continuous updates to their status page, sent a further apology email at the conclusion, posted a service credit exceeding the SLA to all customers in the zone (without forcing customers to chase this themselves with billing) and lastly wrote one of the most well written post mortems I&#x27;ve ever seen. AWS has much to learn from Google about how to handle outages properly.
评论 #11870369 未加载
评论 #11869187 未加载
评论 #11870164 未加载
评论 #11870005 未加载
评论 #11869394 未加载
评论 #11871639 未加载
评论 #11870247 未加载
chrismorganalmost 9 years ago
Why, oh why do they report times in PDT rather than AEST (the zone of the affected area) or UTC (the standard everything else is based on)?<p>(Mutter, mutter, … something about Americans and their timezones … and northern hemispherians and their seasons …)
评论 #11868312 未加载
评论 #11869998 未加载
thomasfoster96almost 9 years ago
For those wondering what the &quot;severe weather&quot; was:<p>* <a href="http:&#x2F;&#x2F;www.smh.com.au&#x2F;national&#x2F;australias-wild-weather-sydneys-massive-storm-in-pictures-20160606-gpcyu7.html" rel="nofollow">http:&#x2F;&#x2F;www.smh.com.au&#x2F;national&#x2F;australias-wild-weather-sydne...</a><p>* <a href="http:&#x2F;&#x2F;www.abc.net.au&#x2F;news&#x2F;2016-06-07&#x2F;sydney-weather-storm-damaged-beachfront-homes-likely-dismantled&#x2F;7487056" rel="nofollow">http:&#x2F;&#x2F;www.abc.net.au&#x2F;news&#x2F;2016-06-07&#x2F;sydney-weather-storm-d...</a><p>* <a href="http:&#x2F;&#x2F;www.sbs.com.au&#x2F;news&#x2F;gallery&#x2F;pictures-wild-weather-savages-nsw-and-tasmania" rel="nofollow">http:&#x2F;&#x2F;www.sbs.com.au&#x2F;news&#x2F;gallery&#x2F;pictures-wild-weather-sav...</a>
bigiainalmost 9 years ago
Heh - I love the image in my head of the flywheel providing a few extra seconds of power to the coffee urn in the Blackwoods warehouse out the back and to all the fan heaters and big screen TVs in Toongabbie - just as Foxtel, Dominos, and Channel 9&#x27;s Nagios dashboards all start turning red and their ops staff phones start beeping.
daniel-levinalmost 9 years ago
&gt;&gt; The specific signature of this weekend’s utility power failure resulted in an unusually long voltage sag (rather than a complete outage)<p>It is false to assume that the state of the electrical supply is either on or off. This may come as a surprise, but not to me. In 2008, Eskom (South Africa&#x27;s electricity suppliers) experienced similar faults. The mains supply voltage is 220v here. At one point, some devices started to fail in my house, and others, such as lights, continued to work, but significantly dimmer. We measured 180v at the plugs. There were similar outages in my area last year, where an outright cut-off was preceded by voltage drops. This outage is interesting because it is an example of a bug owing to false assumptions!<p>There have also been incidences where certain cables have been stolen [1] and that has caused the opposite: voltage spikes.<p>[1] I couldn&#x27;t tell you which, or what kind, but I remember it has something to do with &quot;the neutral&quot;
评论 #11870285 未加载
PhantomGremlinalmost 9 years ago
I love reading about problems like these, it&#x27;s great that Amazon is forthcoming about them. There&#x27;s always some new wrinkle.<p>E.g. in this case, in normal operation, power from the utility power grid spins a flywheel. When the grid fails, the flywheel provides a holdover until Amazon&#x27;s diesel generators can start.<p>But in this failure the voltage from the grid sagged, rather than going away completely. The breaker isolating the flywheel from the grid didn&#x27;t open quickly enough. So power from the flywheel was sent out to the grid. It didn&#x27;t succeed in powering the grid for very long. Oops.
评论 #11867980 未加载
评论 #11869758 未加载
shermozlealmost 9 years ago
I&#x27;m a bit dubious about their &quot;if you used multi-AZ you&#x27;ll be fine&quot; when I had multiple outages in a multi-AZ Elastic Beanstalk application of over an hour. Methinks the load balancers aren&#x27;t as magical as they&#x27;d like to make out.
评论 #11871381 未加载
vacrialmost 9 years ago
I knew that this was a big event when it happened last Sunday, because the AWS service status page had a yellow triangle rather than a green tick. Usually when they have an outage, they just put a tiny blue &#x27;i&#x27; on the green tick...
评论 #11867848 未加载
clentaminatoralmost 9 years ago
Or, in summary, &quot;Uninterruptable power supply is actually interrupted.&quot;
评论 #11869738 未加载
mryanalmost 9 years ago
There is something Orwellian about referring to this as a &#x27;service event&#x27;.<p>I am reminded of &#x27;The Event&#x27; from That Mitchell and Webb Look [0]. We don&#x27;t talk about The Event.<p><a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=wnd1jKcfBRE" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=wnd1jKcfBRE</a>
voltagex_almost 9 years ago
I&#x27;d love to see a write up from the power company&#x27;s point of view.