If anyone wants to use this for something public-service oriented:<p>Chicago is running for the 2016 Olympic games. About a month ago they released their official "bid book" in PDF form. The local papers gave it a look and wrote some fine stories, but a bunch of local journalists (myself among them) would like to extract the thing out into a Wiki so people could discuss and annotate it instead of just reading it in PDF form.<p>Link to the bid book: <a href="http://www.chicago2016.org/our-plan/bid-book/bid-book.aspx" rel="nofollow">http://www.chicago2016.org/our-plan/bid-book/bid-book.aspx</a><p>We were thinking of using MediaWiki as the wiki engine. One of us is currently running (the excellent) Chicago Elections Wiki over at <a href="http://chicagoelections.pbwiki.com/" rel="nofollow">http://chicagoelections.pbwiki.com/</a><p>We'd host, promote, annotate and fill out the wiki, the important thing is to move this from a pdf to an interactive, scannable, hypertext format so people can tear it apart.<p>We'd been talking about sneaking into PyCon and asking around if anyone there would be interested in working on this. It looks like this PDF miner is the start of something that could do this.