Relevant: <a href="http://www.usenix.org/event/osdi04/tech/full_papers/candea/candea.pdf" rel="nofollow">http://www.usenix.org/event/osdi04/tech/full_papers/candea/c...</a><p>Microreboot – A Technique for Cheap Recovery<p>"A significant fraction of software failures in large-scale Internet systems are cured by rebooting, even when the exact failure causes are unknown. However, rebooting can be expensive, causing nontrivial service disruption or downtime even when clusters and failover are employed. In this work we use separation of process recovery from data recovery to enable microrebooting – a fine-grain technique for surgically recovering faulty application components, without disturbing the rest of the application."