TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: Where is Aaron Swartz's federal court record dump?

33 点作者 jsjohns2超过 3 年前
In the USA, public access to federal court records is largely behind a paywall known as PACER. In 2009, Aaron Swartz downloaded somewhere between 1% and 25% of those records (~750GB) [1][2].<p>Where are those documents?<p>[1] https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Aaron_Swartz#PACER [2] https:&#x2F;&#x2F;www.aaronswartzday.org&#x2F;ny-times-pacer-project&#x2F;

2 条评论

schoen超过 3 年前
I don&#x27;t remember whether Aaron&#x27;s downloads were added to this database or not (which means this isn&#x27;t technically an answer to your question), but you can access a lot of PACER documents through<p><a href="https:&#x2F;&#x2F;www.courtlistener.com&#x2F;recap&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.courtlistener.com&#x2F;recap&#x2F;</a><p><a href="https:&#x2F;&#x2F;free.law&#x2F;recap" rel="nofollow">https:&#x2F;&#x2F;free.law&#x2F;recap</a>
caseyscottmckay超过 3 年前
I don&#x27;t have a source for this, but I think I remember reading a long time ago that Swartz also downloaded and released a significant amount of WestLaw legal documents.<p>A few years ago I was looking for state law databases and came across a blog post by a professor (at Stanford I believe) discussing S3 buckets with state law databases. I couldn&#x27;t figure out how to access it, so I emailed the professor and he gave me a link to a site called Public Resource Law or Free Public law or something ( domain was public.law.resource or resource.law or something like that). Anyway it had huge zip files (&lt; 100 GB) for all federal law and all law for all 50 states (opinions, statutes, rules, regulations, jury instructions etc). I played around with the data and found WestLaw editors notes in several of the documents, making me think this was part of Swartz&#x27;s WestLaw data dump. The data dump was taken down roughly a year after I found it.<p>I think this data dump is also what courtlistener.com is built on, because courtlistener.com popped up soon after the data dump disappeared from that site.