TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

OVH Incident in Strasbourg

311 pointsby fvvover 7 years ago

40 comments

lodeover 7 years ago
More info on Twitter from OVH&#x27;s CEO: <a href="https:&#x2F;&#x2F;twitter.com&#x2F;olesovhcom" rel="nofollow">https:&#x2F;&#x2F;twitter.com&#x2F;olesovhcom</a><p>and on <a href="https:&#x2F;&#x2F;twitter.com&#x2F;ovh_support_en" rel="nofollow">https:&#x2F;&#x2F;twitter.com&#x2F;ovh_support_en</a><p>&quot;SBG: ERDF is trying to find out the default. 2 separated 20kV lines are down. We are trying to restart 2 generators A+B for SBG1&#x2F;SG4. 2 others generators A+B work in SBG2. 1 routing room is in SBG1, the second in SBG2. Both are down. &quot;<p>&quot;An incident is ongoing impacting our network. We are all on the problem. Sorry for the inconvenience.&quot;<p>&quot;SBG: 1 gen restarted.&quot;<p>&quot;RBX: all optical links 100G from RBX to TH2, GSW, LDN, BRU, FRA, AMS are down.&quot;
评论 #15661131 未加载
评论 #15660992 未加载
qwerty69over 7 years ago
It started with all our SBG servers going down simultaneously. Approximately 1h later all our RBX servers went down as well including the OVH status page and all other OVH web applications. Either their SBG and RBX data centers are somehow connected or those are indeed two independent incidents.
评论 #15660934 未加载
评论 #15661219 未加载
arekkasover 7 years ago
I moved away from OVH after I paid 3 months advance (~$300) for a server which burned down after 1 1&#x2F;2 months. They did not issue any refunds (data, blood, sweat and tears were lost that day). I have been an OVH customer for 12 years.<p>Today, I&#x27;m glad to have moved away all my production environments as well.
评论 #15660920 未加载
评论 #15661005 未加载
评论 #15660974 未加载
评论 #15665869 未加载
wiz21cover 7 years ago
Damn, every emergency power supply I have encountered (the big ones with fuel and hundreds of batteries) always fail to start when they have to... Why is that ?
评论 #15661301 未加载
评论 #15661425 未加载
评论 #15661428 未加载
评论 #15663200 未加载
fvvover 7 years ago
UPDATE: not all datacenters are down, it seems like that in europe because ovh routing hasn&#x27;t been updated so from our point of view everythign is down but really it is not :)
评论 #15660872 未加载
fxaguessyover 7 years ago
Network and RBX are UP again: <a href="https:&#x2F;&#x2F;twitter.com&#x2F;olesovhcom&#x2F;status&#x2F;928556358353539072" rel="nofollow">https:&#x2F;&#x2F;twitter.com&#x2F;olesovhcom&#x2F;status&#x2F;928556358353539072</a> (but SGB&#x27;s datacenters are still being restarted)
评论 #15661145 未加载
therealmarvover 7 years ago
wow, yesterday I was playing with their public cloud because considering choosing them. I had some connection problem with my private networking there (deleted it more than once) and opened a ticket. If it was me... sorry, haha. Not good advertisement but it can happen to everyone.
评论 #15661015 未加载
评论 #15661808 未加载
评论 #15660940 未加载
NiklasMortover 7 years ago
can&#x27;t wait to read about the detailed followup on this in a few days, it is always interesting to see how such major outages happen
jedisct1over 7 years ago
Not &quot;all datacenters&quot;. Only 2 of them. They have 22, not counting all the POPs.
评论 #15662415 未加载
drchaosover 7 years ago
This affects DNS as well, since domaindiscount24 (a rather large registrar in Germany) happens to host all three of their nameservers with OVH.<p>Just in case you wonder why your sites don&#x27;t work, even if you host them somewhere else.
评论 #15661289 未加载
pmontraover 7 years ago
The status page is up again <a href="http:&#x2F;&#x2F;status.ovh.net&#x2F;" rel="nofollow">http:&#x2F;&#x2F;status.ovh.net&#x2F;</a><p>I paste the report so far:<p>-------------<p>FS#15162 — SBG<p>Attached to Project— Network<p>Task Type: Incident<p>Category: Strasbourg<p>Status: In progress<p>Percent Complete: 0%<p>Details<p>We are experiencing an electrical outage on Strasbourg site.<p>We are investigating.<p>Comments (2)<p>Comment by OVH - Thursday, 09 November 2017, 10:55AM<p>SBG: ERDF repared 1 line 20KV. the second is still down. All Gens are UP. 2 routing rooms coming UP. SBG2 will be UP in 15-20min (boot time). SBG1&#x2F;SBG4: 1h-2h<p>Comment by OVH - Thursday, 09 November 2017, 12:04PM<p>Traffic is getting back up. About 30% of the IP are now UP and running.<p>-------------<p>VPSes are still marked as read in the dashboard. I can&#x27;t access mine.
评论 #15665688 未加载
oelmekkiover 7 years ago
Btw, note for those who use ovh ISP like me (this is a thing in France): your connection works, only the DNS&#x27;s do not.<p>Fix (debian-like):<p><pre><code> sudo apt-get install bind9 </code></pre> Then put in &#x2F;etc&#x2F;resolv.conf, if it&#x27;s not already there:<p><pre><code> nameserver 127.0.1.1 </code></pre> This runs a local nameserver that you use directly for resolving.<p>Oh, obviously, you need resolving to install the resolver :) Hope you have a 4g connection available.<p>Alternatively, you can just use google dns:<p><pre><code> nameserver 8.8.8.8 nameserver 8.8.4.4</code></pre>
评论 #15661524 未加载
tyingqover 7 years ago
My OVH dedicated servers seem fine. Webservers, ssh, all working. All ones in Canada.
评论 #15661191 未加载
评论 #15661400 未加载
qeternityover 7 years ago
All of our dozen or so bare metal boxes are up in GRA as well as all of our cloud instances. However object storage is down.
dx034over 7 years ago
They now posted their explanation [1] but I don&#x27;t buy it. I find it hard to believe that the RBX incident happened shortly after the SGB incident without any connection between these two. They should have redundant networking (at least that&#x27;s what they say) so one corrupted DB in RBX shouldn&#x27;t have brought down the whole DC (or 7 DCs according to their system). Maybe they pulled corrupt data from SGB because it was down but I don&#x27;t believe that at the same time of a power failure, two redundant network nodes got corrupted without any notice. Otherwise wouldn&#x27;t that mean that one hardware issue can also bring down a whole region?<p>[1] <a href="http:&#x2F;&#x2F;status.ovh.net&#x2F;?do=details&amp;id=15162&amp;PHPSESSID=7220be21848b5db440d2cb66c5ee7e14" rel="nofollow">http:&#x2F;&#x2F;status.ovh.net&#x2F;?do=details&amp;id=15162&amp;PHPSESSID=7220be2...</a>
dx034over 7 years ago
Some servers in GRA still appear to work if that&#x27;s of any help. All data centres offline at once sounds more like an attack than a power failure in one location. According to them, there was a power failure in SBG but I don&#x27;t see how that should affect routing in data centres several hundred miles away.<p><a href="https:&#x2F;&#x2F;twitter.com&#x2F;olesovhcom&#x2F;status&#x2F;928541667283623936" rel="nofollow">https:&#x2F;&#x2F;twitter.com&#x2F;olesovhcom&#x2F;status&#x2F;928541667283623936</a><p>EDIT: Maybe related to the Cisco issue?<p><a href="https:&#x2F;&#x2F;blogs.cisco.com&#x2F;security&#x2F;cisco-psirt-mitigating-and-detecting-potential-abuse-of-cisco-smart-install-feature" rel="nofollow">https:&#x2F;&#x2F;blogs.cisco.com&#x2F;security&#x2F;cisco-psirt-mitigating-and-...</a>
评论 #15660882 未加载
评论 #15660893 未加载
评论 #15661115 未加载
评论 #15660848 未加载
jedisct1over 7 years ago
Details here: <a href="http:&#x2F;&#x2F;travaux.ovh.net&#x2F;?do=details&amp;id=28244" rel="nofollow">http:&#x2F;&#x2F;travaux.ovh.net&#x2F;?do=details&amp;id=28244</a><p>Apparently, the root cause of that issue is a critical software bug in Cisco NCS 2000 transponders.
评论 #15662931 未加载
dorfsmayover 7 years ago
Not &quot;all&quot;!<p>Maybe their main DCs, or their largest, but not all of them. I have virtual servers in thier Quebec DC (BHS) and it hasn&#x27;t gone down since the last time I rebooted it.
ashitlerferadover 7 years ago
I have 30+ servers on OVH. All are online.
xmichael99over 7 years ago
This happens to Internap almost weekly... I always wondered why they never make it in the news.
dredmorbiusover 7 years ago
How Complex Systems Fail<p><a href="http:&#x2F;&#x2F;web.mit.edu&#x2F;2.75&#x2F;resources&#x2F;random&#x2F;How%20Complex%20Systems%20Fail.pdf" rel="nofollow">http:&#x2F;&#x2F;web.mit.edu&#x2F;2.75&#x2F;resources&#x2F;random&#x2F;How%20Complex%20Sys...</a>
ever1over 7 years ago
Detailed report <a href="https:&#x2F;&#x2F;twitter.com&#x2F;olesovhcom&#x2F;status&#x2F;928904373949919232" rel="nofollow">https:&#x2F;&#x2F;twitter.com&#x2F;olesovhcom&#x2F;status&#x2F;928904373949919232</a>
perlgeekover 7 years ago
A website that I host on ovh is up: <a href="https:&#x2F;&#x2F;sudokugarden.de&#x2F;" rel="nofollow">https:&#x2F;&#x2F;sudokugarden.de&#x2F;</a><p>ovh.com looks down for me too.<p>You can check it&#x27;s hosted by OVH:<p>$ whois $(dig sudokugarden.de +short)
评论 #15660996 未加载
fapjacksover 7 years ago
Huh. I have services active on two dedicated machines from OVH in Canada, and I was logged into both via SSH all night, and didn&#x27;t have any interruption at all.
r1chover 7 years ago
Looks like only their routing &#x2F; network was down. My servers just came back up and haven&#x27;t experienced any power outage.
评论 #15661397 未加载
nstricevicover 7 years ago
I just moved 2 apps to OVH. So this was totally unexpected. My apps are unavailable for more than 7 hours.<p>Does this happen often with OVH?
askmikeover 7 years ago
My server hosted on OVH had some problems (DNS lookups) but has stayed up and works fine right now.<p>EDIT: Hosted in EU.
gizzlonover 7 years ago
Now this status page is down as well. Sucks to be them right now =&#x2F; (I&#x27;m in Europe)
pavlakoosover 7 years ago
I&#x27;m trying to find ETA for solving the issue, but they didn&#x27;t post it on Twitter.<p>Anybody knows ETA?
评论 #15661163 未加载
评论 #15660967 未加载
stevenhover 7 years ago
My OVH servers in Canada and Australia are running fine.<p>My OVH servers in France are all inaccessible.
评论 #15661303 未加载
jagermoover 7 years ago
This has to be one of the least informative status page I have ever seen.
treoover 7 years ago
Looks like they are starting to come back up. My VPS is accessible again.
aerovistaeover 7 years ago
I had never heard of this company til I saw this post. Shrugged, thought, &quot;huh, wonder who that&#x27;s affecting.&quot;<p>Opened up Age of Empires II....no connection. Go to website for game servers...&quot;Our provider, OVH, is down....&quot;<p>Go figure.
评论 #15661209 未加载
评论 #15661206 未加载
oronover 7 years ago
not all of them, I have some servers in Canada, working OK
thejoshover 7 years ago
Sydney is fine.
KeitIGover 7 years ago
I imagine Mr Good Guy at OVH telling some others:<p>&quot;guys we have a single point of failure in our architecture with SBG, maybe we should...<p>- naaah it&#x27;s fine, we do not have time nor resources&quot;<p>Then shit happens.<p><i>edit: I have no idea what is happening exactly, but OVH being what it is, it seems extremely weird that all datacenters &quot;can&quot; get down at the same time, and it looks like a serious architecture problem to me (or backup systems, like generators, not being correctly tested... whatever). I am really curious about the future explanation with what happened exactly</i><p><i>edit2: Why all the downvotes? Even the status page of OVH is down, do not tell me it is good design. We are not here to be charitable, but realist.</i>
评论 #15661023 未加载
评论 #15661813 未加载
评论 #15662618 未加载
评论 #15660927 未加载
评论 #15661956 未加载
contingenciesover 7 years ago
<i>To make error is human. To propagate error to all server in automatic way is #devops.</i> - @devopsborat
评论 #15661503 未加载
评论 #15661211 未加载
Sami_Lehtinenover 7 years ago
Title is misleading. Only RBX and SBG were affected.<p>06:15 UTC SBG serves failed.<p>OVH network weathermap: <a href="http:&#x2F;&#x2F;weathermap.ovh.net" rel="nofollow">http:&#x2F;&#x2F;weathermap.ovh.net</a><p>Btw. First post: <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=15660524" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=15660524</a>
评论 #15661245 未加载
评论 #15661779 未加载
评论 #15661339 未加载
metafunctorover 7 years ago
Someone with access might wish to update the title of this post, because all OVH datacenters are definitely not down.
评论 #15661782 未加载
评论 #15660966 未加载
评论 #15663171 未加载
评论 #15660987 未加载
Hates_over 7 years ago
Trending on Twitter with the hashtag #OVHGATE<p><a href="https:&#x2F;&#x2F;twitter.com&#x2F;hashtag&#x2F;OVHGATE?src=hash" rel="nofollow">https:&#x2F;&#x2F;twitter.com&#x2F;hashtag&#x2F;OVHGATE?src=hash</a>
评论 #15661090 未加载