TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Theinfo.org - for people with large data sets

44 点作者 garret超过 17 年前

7 条评论

andreyf超过 17 年前
Here's a data set that needs scraping, if anyone has the time to spare: speeches and PR communications, esp. those from government officials. Visualizations of what talking points or ideas are spreading through what organizations (political parties, campaigns) and in what countries would be really interesting. Data mining to see which phrases or ideas "stick" in political blogs or on social websites would be neat, also.
imsteve超过 17 年前
A very useful idea. Many databases can be such a pain to find, I'm sure most people don't know they exist.<p>Plus, I've got a ton of python for doing complex manipulations of sql data and schemas and scraping the web that I've been meaning to share. Will add some to this site.
nonrecursive超过 17 年前
reminds me of swivel.com, a cool startup that allows you to "upload &#38; explore" data. I talked to someone there and they mentioned comparing weather data to stock prices as an example.
kleevr超过 17 年前
I hope their data visualization section matures.
ALee超过 17 年前
Cool. When we created Fantasy Congress we scraped the government in Java and then only afterwards did we find out that a friend of ours had scraped it in Perl years before and the Washington Post did it a year ago.<p>It would have saved us a lot of heartburn.
评论 #99127 未加载
eVizitei超过 17 年前
How cool is that? A cross-organization abstraction for the greater good.
btw0超过 17 年前
I don't like the font Aaron Swartz always use.
评论 #99163 未加载