CouchDB and by extension Cloudant are really cool.<p>Change feeds and replication are first class citizens if you wish, not some side-feature.<p>This means 3 things for me:<p>* Can build your own cluster topology. Replicate in a star pattern, in a circle with overlapped neighbors, in a hierarchy. You decide. Replication is explicit and configurable.<p>* Ideal for real-time applications. Having client poll for changes can grind things down. CouchDB support continuous updates and changes feed. It can go via long polling, event sourcing, can still to GET's with a 'since' parameters. Up to the user.<p>* Having to handle merge conflicts explicitly. MVCC semantics is not hidden or hand-waved away with a timestamp-last-writer-wins-unless-your-ntp-is-messed-up. Conflicts are first class citizens as well and one needs to know how to handle. That is good for some case but bad in others. I like it. For example one can attach an application specific conflict resolver that knows how to solve conflicts for that particular database in an application specific manner. Riak is another database that handles explicit conflicts well.<p>BTW: Just noticed CouchDB 1.5 was just released.<p><a href="http://www.apache.org/dist/couchdb/notes/1.5.0/apache-couchdb-1.5.0.html" rel="nofollow">http://www.apache.org/dist/couchdb/notes/1.5.0/apache-couchd...</a><p>Will have to play with the new Admin UI and Node.JS query server.
I think it was Rich Hickey that said the document database is the worst of them all, because you are now married to that structure.<p>Having used Couchdb in production for two years, I have to agree with his analysis, and offer my own opinion that Couchdb is highly overrated. Not because it is not a good implementation of a document style database, but because the document store itself is not a good match for most use cases.<p>If the only requirement is a replicated JSON document store, it may work OK for you. But so would Riak, Postgres and some others.<p>If you need to update the data in those documents or ever need to query the data in ways you did not initially envision, you will quickly find yourself missing features which even traditional SQL databases are very good at. Development is slower.<p>Writing map/reduce for queries seems particularly cumbersome, particularly if you prefer not to use Javascript. And you have to plug them into a textarea in a webpage interface, or manually put them into Couchdb over http using curl or some library that abstracts this away. Either way it is a degree of separation that makes the data feel more out of reach than through a console interface like psql or mysql.<p>Consider the scenario where you want to update the value in an attribute on several thousand, or even just several documents that match some criteria. In SQL, you would simply jump in the console and in a few seconds or minutes complete that as a transaction with something like:<p>> update table set col=val where criteria.<p>There is no such feature in Couchdb. You will need to write code to filter and fetch each matching document, manipulate it as needed, then write the entire thing back. All to update a few bits that hopefully were not nested too deep as that really increases the complexity of the code you will need to write.<p>As memracom stated, the replication is not perfect. My experience even on a low latency network is the only safe way to ensure a client can immediately read back what they just wrote is to pass them through the likes of haproxy and use a sticky session. Otherwise you have a good probability of getting a 404 after a POST (create) or stale data after a PUT (update).<p>So for what it is worth, here is my advice on choosing a database from an ease of development standpoint:<p>1) has as many features as you can, even if you don't need them initially
2) has top notch libraries for your language / framework
3) has relation awareness - do not denormalize unless you must
4) supports consistency
5) supports in place updates - easily filter and change values (doesn't apply to Datomic)
6) has tools to make schema changes / reshaping data is easy, and can be done online<p>Maybe 2 years ago Couchdb was a great solution. But with memory and ssd storage being so cheap and so much innovation with traditional and NoSQL DBs, I don't foresee myself deploying Couchdb again. If I did need a place to dump some semi-structured data, I find Amazon's hosted offerings more attractive.
Meh...
Replication has been around for a long time. Couchdb was not anywhere near the first, in fact Couchdb was just leveraging capabilities that were in Erlang OTP for many years.<p>And replication is not perfect either. There are tradeoffs and corner cases that need to be understood. Riak is another cluster datastore that is written in Erlang OTP, like Couchdb, and the people who build Riak are quite open about the issues that they have to deal with and the corner cases. They often present at conferences and write blogs on these topics so if you really want to UNDERSTAND replication, google for blogs and conference presentations connected to Riak.<p>In any case, Couchdb is good, Riak is good, but even traditional RDBMSes like PostgreSQL are good and can do replication. In all these cases, the developers have wrestled with the math and computer science behind replication and have made something that mere mortals can use.
When I first read this I thought it said "Why Republican is Awesome". Now that would have been an entertaining read (not that the actual article isn't entertaining - I'm sure it is).