TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Scaling in the presence of errors–don’t ignore them

2 pointsby r4umover 4 years ago

1 comment

jeffreygoestoover 4 years ago
&quot;That’s why almost ever network service gets run in a loop—restarting immediately upon crashing, hoping the fault was transient. It often is.&quot;<p>Same line of thoughts as <a href="https:&#x2F;&#x2F;ferd.ca&#x2F;the-zen-of-erlang.html" rel="nofollow">https:&#x2F;&#x2F;ferd.ca&#x2F;the-zen-of-erlang.html</a><p>While hardware transients exist (look up Google papers about DRAM failure frequencies in their datacenters), you more probably run into an inconsistent internal software state and restarting may be better and faster than trying to find out which part of the internal state you still can trust and which not.