Hi HN!<p>I am an undergrad student trying to build interesting things with AI.<p>Recently, I was looking for a dataset I could use for a new project. I realized that it is really frustrating to go through all the government websites (with terrible UX) just to find some usable dataset.<p>I set out to build a GitHub for datasets, named DataHub. Right now, we have more than 1000 datasets from Montréal and New York City, with more cities coming soon (and possible government agencies).<p>All of this is wrapped into a powerful search. It's a breeze to find a dataset to work on.<p>I'd be interested to know what you guys are looking at when searching for datasets and if DataHub could be of any help!<p><a href="https://datahub.now.sh/" rel="nofollow">https://datahub.now.sh/</a>
Very cool! The interface is really beautiful and I would love if data.gov was formatted like this.<p>What is your strategy for acquiring these datasets? Are you going to pull them from data.gov and other websites?<p>What happens if those datasets are changed on data.gov, will you detect that?
Nice work. You definitely should get in touch with the Dat Project folks. There are several of them on the core team and in the community who are actively scraping government websites for open data.