TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Scalable but Wasteful, or why fast replication protocols are slow

93 pointsby hugofirthalmost 4 years ago

7 comments

rdwalmost 4 years ago
The hypothesis in the article could be correct (that industry is not adopting new academic innovations because they fail in the real world). Based on my experience in this industry, though, it could just be that there isn't a super strong connection between academia and the people implementing these kinds of systems. I've had many conversations with my academically-minded friends where they're astonished that we haven't jumped on some latest innovation, and I have to let them down by saying that the problem that paper was addressing is super far down our list of fires to put out. Maybe there are places where teams of top-tier engineers are free to spend 6 months every year rewriting critical core systems use un-battle-scarred new algorithms that might have 20% performance improvements, but most places I've worked would achieve the same result for far less money by spending 20% more on hardware.
评论 #27827390 未加载
评论 #27830478 未加载
评论 #27830851 未加载
评论 #27830746 未加载
评论 #27827046 未加载
luhnalmost 4 years ago
Honestly I think the answer is simpler: People don&#x27;t <i>need</i> better algorithms. Paxos and Raft are generally used to build service discovery and node coordination, these are not demanding workloads and overwhelmingly read-heavy. Even the largest deployments can probably be serviced by a set of modestly-sized VMs. Paxos and Raft are well-understood algorithms with a choice of battle-tested implementations, why would anyone choose different?<p>The whole section on &quot;bin-packing Paxos&#x2F;Raft is more efficient&quot; is strange, because people don&#x27;t generally bin-pack Paxos&#x2F;Raft—The bin-packing orchestrators are built off of Paxos&#x2F;Raft!
评论 #27827294 未加载
hugofirthalmost 4 years ago
Another thing which makes the Raft&#x2F;Paxos vs new-consensus-algorithm comparisons complicated is caching.<p>If your raft state machines are doing IO via some write through cache (which they often are) then having specific machines do specific jobs can increase the cache quality. I.e. your leader node can have a better cache for your write workload, whilst your follower nodes can have better caches for your read workload.<p>This may lead to higher throughput (yay) but then also leave you vulnerable to significant slow-downs after leader elections (boo).<p>What makes sense will depend on your use case, but I personally agree with the author that multiple simple raft&#x2F;paxos groups scheduled across nodes by some workload aware component might be the best of both worlds.
toolslivealmost 4 years ago
None of this actually matters. Consensus algorithms allow you to achieve consensus. Period. There&#x27;s no requirement whatsoever on what you&#x27;re getting consensus on. A consensus value could be _one_ database update, but it doesn&#x27;t need to be. It can also consist of 666 database transactions across 42 different namespaces.
LAC-Techalmost 4 years ago
&gt; The protocol presents a leader-less solution, where any node can become an opportunistic coordinator for an operation.<p>Does leader = master here? My first reaction is that this is a multi-master system but I can&#x27;t quite unpack &quot;opportunistic coordinator&quot;.
评论 #27827672 未加载
mistralefobalmost 4 years ago
So, why?
评论 #27826590 未加载
评论 #27826444 未加载
orangepandaalmost 4 years ago
Can you not with the clickbait patterns?<p>Actual title - why fast replication protocols are slow