TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Remus: possible high availability of apps through replicating virtual machines

17 pointsby ktomover 16 years ago

4 comments

glymorover 16 years ago
Brendan Cully's masters thesis has more detail <a href="http://www.cs.ubc.ca/grads/resources/thesis/Nov07/Cully_Brendan.pdf" rel="nofollow">http://www.cs.ubc.ca/grads/resources/thesis/Nov07/Cully_Bren...</a><p>This was also interesting: <i>"we believe that the high-frequency checkpointing mechanism we have engineered in support of Remus will have many other interesting applications, ranging from forensics and error recovery tools based on replayable history to software engineering applications such as concurrency-aware time-travelling debuggers."</i>
glymorover 16 years ago
The problem with these systems is they don't know what is significant state so they have to copy everything to the slave.<p>The way Remus gets round this is it bulk copies (upto 40 times a second) rather than on every change. So the master runs slightly ahead.<p>Terracotta is something similar for the JVM. I think they get round it by exploiting the fact the JVM knows what's going on so for example you could say I want only this field on a class to be replicated. (But I've never used terracotta so someone might have to correct me on that.)
hedgehogover 16 years ago
I imagine the performance hit is pretty substantial but for things like VOIP or messaging servers this will make real HA possible on commodity hardware. Pretty cool.
jacquesmover 16 years ago
someone should combine this with openmosix!