TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

In memory of Aaron Swartz: a collection of PDFs from PDFtribute

74 点作者 inconditus超过 12 年前

9 条评论

houshuang超过 12 年前
An important thing to remember, is that many journals already permit self-archiving of publications (ie. uploading a pre-print to a personal server or an institutional repository). In fact, about 70% of large publishers automatically allow some form of self-archiving, and for the others, many have been successful including a copyright addendum with the copyright-transfer document, retaining some rights (<a href="http://scholars.sciencecommons.org/" rel="nofollow">http://scholars.sciencecommons.org/</a>). FAQ on self-archiving (<a href="http://www.eprints.org/openaccess/self-faq/" rel="nofollow">http://www.eprints.org/openaccess/self-faq/</a>).<p>At my university, we keep running workshops, there are student staff in the library willing to help upload articles to the repository if you just e-mail them, etc, but still, most academics won't take the five minutes to do this, even if they have the right.<p>This doesn't mean that the academic publishing system shouldn't change, it absolutely should. And there's also a lot of value in "liberating" academic publications that would otherwise not be free. But I hope people would become more aware of what is already possible, and legal!
评论 #5060456 未加载
houshuang超过 12 年前
As long as these PDFs are exposed publicly (and linked to, which a tweet with or without #pdftribute will take care of), they will mostly be indexed by Google Scholar, which does a decent job of extracting metadata using heuristics etc.<p>Of course, it would be much better if people started embedding machine-readable metadata in PDFs (totally possible, see for example <a href="http://code.google.com/p/pdfmeat/" rel="nofollow">http://code.google.com/p/pdfmeat/</a>), and if there was some agreed-upon format for bibliographic microformats, that could be embedded in websites listing articles.<p>We also eventually need an open alternative to Google Scholar. GS is great, and I use it every day (and love that you can output BibTex for example), but it has no API (and will never have one because of deals with publishers), actively resists automatic access, is a black-box in terms of how data is gathered, etc. Think of "Open Scholar" to Google Scholar as analogous to OSM vs GMaps. OSM might not look as pretty, or be as consistent in the beginning, but it enables a whole range of applications that GMaps doesn't. (And at least GMaps does have a fairly good API, even if it charges for overuse, GS has nothing).<p>(These are just some thoughts I've made, as I've been experimenting with an open scholar workflow, trying to share as much of the "byproduct" of the research, including rich notes and summaries, my own bibliography with links to OA pubs where they exist etc: <a href="http://reganmian.net/wiki/researchr:start" rel="nofollow">http://reganmian.net/wiki/researchr:start</a>).<p>Another thing I've found working on my project, where I try to expose OA links to as many pubs as possible, and regularly rescan to see if they are still available (and still OA), is how quickly documents disappear... Hosting on private pages is convenient, but fragile. Ideally, people would upload papers to university repositories, subject repositories like Arxiv.org, etc.
评论 #5060576 未加载
smogzer超过 12 年前
Cool effort.<p>But ... its a score to jstor. It's unorganized.<p>But ... science if full of noise and crappy publications these days anyway. Lots of ways to do the same thing, unprooven and only exists because everybody has to publish to stay relevant.<p>Now: How to really improve science ? My suggestion: A big python framework for each field of study. That has implementations of the real algorithms and models for comparison and benchmarking and even real life implementation.<p>See as example in the robotics field, ROS ( Robotics Operating System) . Ros is like a basis glue framework where universities and individuals can publish their code. Its decentralized, it has simulators so that scientists do not need to own the physical robots and can even compare(diff) results and algorithms in a very fast way.<p>The simulator can have a embedded browser + wiki + quora that explains X.<p>evolution: physical paper -&#62; PDF -&#62; simulator.
评论 #5059962 未加载
评论 #5062733 未加载
dutchbrit超过 12 年前
Cool stuff, have you seen this yet?<p><a href="http://pdftribute.net/" rel="nofollow">http://pdftribute.net/</a>
评论 #5059719 未加载
评论 #5059758 未加载
jychang超过 12 年前
Looking through all the files that are uploaded, there are a lot more non-English documents than I expected; I randomly clicked on 2. It's amazing how there is support from around the world.
zopticity超过 12 年前
It is a great loss to know such an entrepreneur has died because of legal problems. I, myself, have faced similar been in a similar situation. I feel that Aaron was a martyr for the open source of academic papers. Unfortunately he will not see his impact on this modern and technology dependent world.<p>R.I.P. Aaron Swartz!
houshuang超过 12 年前
Nice short article: 10 things you can do to really support Open Access: <a href="http://phylogenomics.blogspot.de/2013/01/10-things-you-can-do-to-really-support.html" rel="nofollow">http://phylogenomics.blogspot.de/2013/01/10-things-you-can-d...</a>
wreckimnaked超过 12 年前
Nice idea!<p>Also, some metadata aggregation (title, author, tags, date published) capabilities wouldn't hurt anyone.
fgrt2超过 12 年前
in memory of Swartz, 1 million ebooks for free download<p><a href="http://ebookoid.com" rel="nofollow">http://ebookoid.com</a>