BigQuery has two unusual technical features that make it uniquely suited to hosting public data like this:<p>1. It has a separate "storage tier" and "compute tier", so we can all run queries against the same dataset at the same time.<p>2. The "compute tier" is 100% shared, so I can use BigQuery even if I just want to run a few queries.<p>I hope we'll see more projects like this. Having public data available in SQL is a great thing. SQL is easy enough that a lot of people can figure it out, but powerful enough to do real work.
A lot of this data can also be browsed by mere mortals at <a href="http://openbeta-contracts-explorer.usaspending.gov/" rel="nofollow">http://openbeta-contracts-explorer.usaspending.gov/</a> . More information is available at <a href="https://openbeta.usaspending.gov/" rel="nofollow">https://openbeta.usaspending.gov/</a> too.
Thanks for sharing Anton!<p>Fixing the link for mobile: (GitHub doesn't render the link)<p>- <a href="http://nbviewer.jupyter.org/github/antontarasenko/gpq/blob/master/notebooks/contracts_intro.ipynb" rel="nofollow">http://nbviewer.jupyter.org/github/antontarasenko/gpq/blob/m...</a><p>Some quick charts with DataStudio (<a href="https://medium.com/@hoffa/showing-off-the-new-free-google-analytics-data-studio-with-reddit-aprils-gilded-comments-for-ebe965dbbb15" rel="nofollow">https://medium.com/@hoffa/showing-off-the-new-free-google-an...</a>):<p><a href="http://i.imgur.com/ZHOkrN2.png" rel="nofollow">http://i.imgur.com/ZHOkrN2.png</a>