"If this happened our cluster would become unavailable and may have trouble re-clustering."<p>This was basically the repeated experience I had which caused me to abandon etcd for the time being.<p>If it can barely ever heal, what the fuck good is it? And I found that it could barely ever heal. A 3-node CoreOS cluster I ran _always_ crashed when it attempted a coordinated update, and rarely could be repaired with the help of #CoreOS over hours.<p>Because CoreOS pushes out updates with versions of etcd incompatible with recent versions, the etcd cluster could never survive the upgrade.<p>Add this to the fact that the CEO of CoreOS told me in person that he expected them to be the _only_ Operating System on the internet, and I'm generally not along for the ride with CoreOS any longer.<p>Consul, Mesos, and Docker are looking good.<p>Anyone interested in this space should check out:<p><pre><code> https://github.com/CiscoCloud/microservices-infrastructure</code></pre>