TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Harvard puts metadata for 12M library items into the public domain

115 点作者 vgnet大约 13 年前

10 条评论

guelo大约 13 年前
"The records consist of information describing works—including creator, title, publisher, date, language, and subject headings—as well as other descriptors usually invisible to end users, such as the equalization system used in a recording. "<p>I'm having a hard time thinking of what could be done with this data besides a library catalog.
评论 #3887732 未加载
评论 #3888830 未加载
评论 #3887746 未加载
评论 #3887553 未加载
tar大约 13 年前
Why not link directly to the official press release: <a href="http://isites.harvard.edu/icb/icb.do?keyword=k77982&#38;pageid=icb.page498373" rel="nofollow">http://isites.harvard.edu/icb/icb.do?keyword=k77982&#38;page...</a>
gvozd大约 13 年前
It's about time they did this. Harvard's was about the only major library that didn't allow Z39.50 access to their full MARC21 records. As a private individual with a large rare and antiquarian book collection, I welcome the news, since I've found that Harvard sometimes has the only other copy of a book I'm cataloging. A few other libraries require you to jump through some hoops to get to the data (British National Library, for example), but Harvard was shutting everyone other than faculty and alumni out.
cbsmith大约 13 年前
I hate that the article title says "Big data for Books".<p>Here's a hint on how you can get a sense of whether you are dealing with "Big Data": <i>IF I CAN FIT IT ON A THUMB DRIVE, IT ISN'T BIG DATA</i>.
评论 #3888546 未加载
Jun8大约 13 年前
The direct links for API access and download (3.16GB) is given in the DPLA Dev Blog: <a href="http://blogs.law.harvard.edu/dplatechdev/" rel="nofollow">http://blogs.law.harvard.edu/dplatechdev/</a>
dfc大约 13 年前
It seems that bittorrent would be the logical choice for distributing the dataset. I wonder if this is an oversight or if they are not expecting many people to download the dataset...
评论 #3887589 未加载
kveykva大约 13 年前
I'm not sure who would need to actually fill out the submission form. But wouldn't this: <a href="http://aws.amazon.com/publicdatasets/#3" rel="nofollow">http://aws.amazon.com/publicdatasets/#3</a> be convenient for working with a data set like this?
sparknlaunch12大约 13 年前
Universities are doing some pretty cool stuff with data. Every tech uni is now getting their students to work on social media data analysis. More exciting than entity relationship diagrams...
ryan-guest大约 13 年前
It's going to be interesting to see what people build and/or analysis they do with this data.
esonderegger大约 13 年前
&#62;Finally, note that Harvard asks that you respect community norms, including attributing the source of the metadata as appropriate.<p>That's not what "public domain" means. If they wanted attribution, there are licenses for that. "Public domain" means that it belongs to all of us now. In that case, attribution is meaningless.
评论 #3887563 未加载
评论 #3887572 未加载
评论 #3887554 未加载