I was looking for these types of sources this week to populate a document database. One I ended up using for a demonstration was the "startup company information" hosted at <a href="http://jsonstudio.com/resources/" rel="nofollow">http://jsonstudio.com/resources/</a> (apparently an extract from CrunchBase, mentioned in Jen's blog post).<p>I naively thought I could just grab a pile of tweets or something, but most public APIs require registration as a developer.<p>One quick tip, if you're dealing with JSON dumps as a series of objects (e.g., {} {} {}) that you want to wrap in an array (e.g., [{}, {}, {}]), is to "slurp" them into jq (<a href="https://stedolan.github.io/jq/" rel="nofollow">https://stedolan.github.io/jq/</a>):<p><pre><code> $ jq -s '.' companies.json > companies-array.json</code></pre>