I wonder what the search story is. One technology that really does deliver, and has totally impressed me, is Lucene/ElasticSearch. I'm used to all sorts of hyperbolic claims, but holy shit, ElasticSearch just delivers. We tossed in about 40M documents from a SQL Server DB, and not only did it require less resources (a 30%? reduction in size), the queries are beyond anything that'd be approachable using SQL Server. And I've only touched the surface, using it as a pure plug-n-play setup.<p>With DocumentDB, not having a local version severely limits what I'd consider this for. Losing that flexibility is a big deal. Maybe this is just a limited preview and they haven't build the management side for local installs.
<a href="http://azure.microsoft.com/en-us/documentation/articles/documentdb-introduction/" rel="nofollow">http://azure.microsoft.com/en-us/documentation/articles/docu...</a><p>> Want to edit or suggest changes to this content? You can edit and submit changes to this article using GitHub.<p>Pretty remarkable given Microsoft's approach to open source in the 1990s that they're now using a service built around Linus's bespoke open source version control system to allow people to suggest changes to their documentation.
Ad hoc queries using SQL like syntax. No need to define indexes.<p>Javascript execution within database. Stored procedures, triggers and functions can be written with Javascript. "All JavaScript logic is executed within an ambient ACID transaction with snapshot isolation. During the course of its execution, if the JavaScript throws an exception, then the entire transaction is aborted."<p>Pricing is based on "capacity units". Starts with $22.50 per month (this includes 50% preview period discount). One capacity unit (CU) gives 10GB of storage and can perform 2000 reads per second, 500 insert/replace/delete, 1000 simple queries returning one doc.<p>In order to see pricing details, change the region to "US West":
<a href="http://azure.microsoft.com/en-us/pricing/details/documentdb/" rel="nofollow">http://azure.microsoft.com/en-us/pricing/details/documentdb/</a><p>Very interesting addition to Microsoft offering. I was actually just yesterday wondering if they have any plans for this kind of service. Table Storage is quite primitive and Azure SQL on the other hand gets expensive when you have lots of data.<p>One potential "problem" with this is the bundling of storage capacity and processing power. If I understand this correctly, I would need to buy 10 CUs per month to store 100GB of data even if I'm not very actively using that data.
<i>DocumentDB utilizes a highly concurrent, lock free, log structured indexing technology to automatically index all document content. This enables rich real-time queries without the need to specify schema hints, secondary indexes or views.</i><p>How does that work? Isn't that going to incur a major performance hit? If not, why don't other databases get rid of indexes?<p>Also, if anyone from MS is reading, <a href="http://azure.microsoft.com/en-us/documentation/articles/documentdb-introduction/" rel="nofollow">http://azure.microsoft.com/en-us/documentation/articles/docu...</a> links to <a href="http://azure.microsoft.com/en-us/documentation/articles/documentdb-concepts/" rel="nofollow">http://azure.microsoft.com/en-us/documentation/articles/docu...</a> which is a 404 error.
I liked everything about it until I saw the API for the Python client. What a catastrophe.<p>I pray Microsoft is looking for Python developers: <a href="https://gist.github.com/whalesalad/2142f0075c6896f4547c" rel="nofollow">https://gist.github.com/whalesalad/2142f0075c6896f4547c</a>
"All JavaScript logic is executed within an ambient ACID transaction with snapshot isolation. During the course of its execution, if the JavaScript throws an exception, then the entire transaction is aborted."<p>Have I missed something, or have MS delivered a novel and valuable feature? I'm not aware of support for transactions across documents in other NoSQL platforms. I'd be grateful if someone has any experience or better information in that regard, thanks.
A quick comparison between DocumentDB vs MongoDB: <a href="http://daprlabs.com/blog/blog/2014/08/22/azure-documentdb/" rel="nofollow">http://daprlabs.com/blog/blog/2014/08/22/azure-documentdb/</a>
If I understand correctly, their multi-document ACID transaction support is a big deal. The only other NoSQL/NewSQL systems I'm aware of with that ability are FoundationDB and Google Spanner/F1.
What are the limits of DocumentDB? You know, like max size of database, max size of document, max number of documents per database, max. number of attributes per document, max. number of databases per DocumentDB account.<p>What's the max. duration of database query, max size of query result.<p>What kind of performance can be expected, does it decrease as the size of database increases or it remains constant?<p>I'm going to wait a few days until hype settles.
What are the size limits on a collection? Docs mention transaction support is offered only within a collection. Is a collection essentially limited to a single physical machine in the background or does it span across machines? It looks like in Standard Preview, the max collection size is 10GB.
Interesting: <a href="https://github.com/Azure/azure-documentdb-python" rel="nofollow">https://github.com/Azure/azure-documentdb-python</a> (it's empty for the moment, but glad to see first-party support for Python)
The @DocumentDB twitter links to a tutorial on DocDB: <a href="http://www.documentdb.com/sql/tutorial" rel="nofollow">http://www.documentdb.com/sql/tutorial</a>
Does anyone know how this compares to AWS DynamoDB[1] ?<p>[1] <a href="https://aws.amazon.com/dynamodb/" rel="nofollow">https://aws.amazon.com/dynamodb/</a>
Another case of Not Invented Here syndrome from Microsoft. One wonders why they couldn't just take the open source and very well architected RavenDB <a href="http://ravendb.net" rel="nofollow">http://ravendb.net</a> .Net Document DB and provide first class support for that within Azure.
A new "cool", locked-in service served on a silver platter by Microsoft to the brainwashed.<p>Everybody else uses open source on premises or their cloud of choice.