TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: DataHub, open source datasets for Artificial Intelligence

14 点作者 theo31超过 7 年前
Hi HN!<p>I am an undergrad student trying to build interesting things with AI.<p>Recently, I was looking for a dataset I could use for a new project. I realized that it is really frustrating to go through all the government websites (with terrible UX) just to find some usable dataset.<p>I set out to build a GitHub for datasets, named DataHub. Right now, we have more than 1000 datasets from Montréal and New York City, with more cities coming soon (and possible government agencies).<p>All of this is wrapped into a powerful search. It&#x27;s a breeze to find a dataset to work on.<p>I&#x27;d be interested to know what you guys are looking at when searching for datasets and if DataHub could be of any help!<p><a href="https:&#x2F;&#x2F;datahub.now.sh&#x2F;" rel="nofollow">https:&#x2F;&#x2F;datahub.now.sh&#x2F;</a>

4 条评论

alex_g超过 7 年前
Very cool! The interface is really beautiful and I would love if data.gov was formatted like this.<p>What is your strategy for acquiring these datasets? Are you going to pull them from data.gov and other websites?<p>What happens if those datasets are changed on data.gov, will you detect that?
评论 #15163161 未加载
colobas超过 7 年前
There&#x27;s a typo. &quot;Datahub has more then 1200 datasets&quot; should read &quot;Datahub has more than 1200 datasets&quot;
评论 #15174681 未加载
bruth超过 7 年前
Nice work. You definitely should get in touch with the Dat Project folks. There are several of them on the core team and in the community who are actively scraping government websites for open data.
评论 #15161575 未加载
maz1b超过 7 年前
Would love to use this once there&#x27;s some kind of public health &#x2F; medical data of some sort!
评论 #15174518 未加载
评论 #15161571 未加载