TechEcho

1 comment

"That’s why almost ever network service gets run in a loop—restarting immediately upon crashing, hoping the fault was transient. It often is."<p>Same line of thoughts as <a href="https://ferd.ca/the-zen-of-erlang.html" rel="nofollow">https://ferd.ca/the-zen-of-erlang.html</a><p>While hardware transients exist (look up Google papers about DRAM failure frequencies in their datacenters), you more probably run into an inconsistent internal software state and restarting may be better and faster than trying to find out which part of the internal state you still can trust and which not.

Scaling in the presence of errors–don’t ignore them

1 comment

Scaling in the presence of errors–don’t ignore them

1 comment