TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

DIY Book Scanner

340 点作者 bcaa7f3a8bbc将近 4 年前

30 条评论

subpar将近 4 年前
My first job out of college was scanning books for the Internet Archive down in the basement of the Library of Congress. Their scanning machines used a foot pedal to raise and lower the glass Platen, so I&#x27;d use one hand to flip the page and wiggle the cradle to get things nice and flat and the other would snap the photo. You can get pretty fast after a while, but boy is it mindless. Older books that had been rebound a couple times already were the hardest to work with as you have the least amount of margin. There&#x27;s a bunch of different sized dowels that we would put under the spine in the cradle so the glass could gain a couple millimeters of margin, just enough to avoid cutting off text. Worst case scenario the book had to be unbound in order to capture. I did get to flip through a lot of cool old illustrated catalogues like this: <a href="https:&#x2F;&#x2F;archive.org&#x2F;details&#x2F;illustratedcatal00keil&#x2F;page&#x2F;14&#x2F;mode&#x2F;2up" rel="nofollow">https:&#x2F;&#x2F;archive.org&#x2F;details&#x2F;illustratedcatal00keil&#x2F;page&#x2F;14&#x2F;m...</a>
评论 #27364676 未加载
评论 #27367717 未加载
评论 #27368010 未加载
评论 #27363996 未加载
评论 #27368089 未加载
评论 #27364926 未加载
评论 #27364193 未加载
评论 #27364297 未加载
zwayhowder将近 4 年前
I built one of these out of pine 2x4s and plywood. I thought it would be cheaper than buying one (I was wrong) but I&#x27;m also not a skilled woodworker and had to buy most of the tools.<p>It works quite well and I digitised dozens of textbooks I&#x27;d purchased and needed to reference but couldn&#x27;t carry around every day while finishing my masters. My one had 2 Nikon mirrorless cameras controlled via Pi-Scan. <a href="https:&#x2F;&#x2F;github.com&#x2F;Tenrec-Builders&#x2F;pi-scan" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;Tenrec-Builders&#x2F;pi-scan</a><p>I had a smaller toggle switch wired to the GPIO pins so I could click the scan next button without having to take my hands of the book. Once I got used to the workflow I could scan about 1000 pages per hour while watching Netflix.<p>I replaced it with a Czur scanner that isn&#x27;t as good, but is a lot smaller and is good enough for my less demanding needs now that I&#x27;m not doing a masters degree :D
评论 #27363603 未加载
评论 #27364379 未加载
flakiness将近 4 年前
At Japan in the meantime, people in book scanning community (that exists) often just cut the book spine and scan the pages using normal scanner, throw it away once all the pages are scanned.<p>People (rightly) value room spaces than books there. It&#x27;s called &quot;Ji-sui&quot; (scanning by oneself) and gear recommendation sites like [1] are abundant. Another reason of &quot;Ji-sui&quot; prevalence was the poor availability of ebooks, although that reason was less relevant today.<p>[1] <a href="http:&#x2F;&#x2F;monomania.sblo.jp&#x2F;article&#x2F;60578693.html" rel="nofollow">http:&#x2F;&#x2F;monomania.sblo.jp&#x2F;article&#x2F;60578693.html</a>
评论 #27364865 未加载
_virtu将近 4 年前
When I was in college the iPad had just come out. I was determined to save money so I snagged an iPad to use as my omnitextbook and built a scanner based upon one of the schematics on this site with a friend.<p>I would usually be the guy that made an email group for everyone to share notes and questions for classes pre all of the blackboard garbage, so I started leveraging those connections and would ask if anyone would let me borrow their book for a scanned version in return. My friends and I would have a book scanning party and would help to scan each others’ books. We’d grab some drinks, find some favorite albums and hang out all night until the wee hours taking turns scanning texts.<p>After one semester the setup paid for itself. I would supplement some texts with learning trackers like bitme before amazing resources came around like libgen. Good times.
评论 #27369421 未加载
fernly将近 4 年前
Nice to provide hardware hints and designs but geez that is almost the least of it. Cleverest hardware still only gets you a thumb drive full of page images. Now what? There needs to be a software workflow ending with a readable book in PDF, EBOOK or MOBI format, and there are many, many choices to be made along that path.<p>Edit: &quot;Finishing a book&quot; is discussed at a very superficial level here: <a href="https:&#x2F;&#x2F;vimeo.com&#x2F;user33752051" rel="nofollow">https:&#x2F;&#x2F;vimeo.com&#x2F;user33752051</a> at about 1:00:<p>&quot;In order to turn these raw images into an ebook, the very minimum you need to do is A, you need to rotate them, B you need to crop them down to use the page [?], and C you need to combine them into one document like a PDF... You can do OCR to make it searchable ... color correction... de-skewing, de-warping ...&quot;
评论 #27362769 未加载
评论 #27367155 未加载
评论 #27363428 未加载
评论 #27362690 未加载
评论 #27364253 未加载
评论 #27365448 未加载
评论 #27369258 未加载
评论 #27362827 未加载
评论 #27364159 未加载
david_allison将近 4 年前
I&#x27;m very interested in getting into archival (getting started this month after a few more conversations).<p>Your buy button[0] is broken. You&#x27;re potentially missing out on a few sales due to this.<p>Is 2x 4GB SD card sufficient for your purposes? I&#x27;ve been quoted 50MB TIFF images as a standard, and a lot of books wouldn&#x27;t fit without swapping SDs at that size.<p>[0] <a href="http:&#x2F;&#x2F;store.diybookscanner.org&#x2F;" rel="nofollow">http:&#x2F;&#x2F;store.diybookscanner.org&#x2F;</a>
评论 #27362706 未加载
评论 #27362974 未加载
timonoko将近 4 年前
If you have fast flatbed scanner, you can scan 300 pages in thirty minutes. Not worth the effort to build automation. Bigger problem was to sort out all errors and missed pages afterwards. Real-time display (from Imagemaqick) solved this problem:<p><pre><code> while true ; do for x in *.pnm ; do killall display display -rotate 90 $x &amp; done sleep 5 done</code></pre>
评论 #27366037 未加载
评论 #27368091 未加载
shard将近 4 年前
This really needs to be redesigned for ergonomics.<p>- Lever should have a button for capture<p>- Display should be visible while looking down<p>But now I see why destructive scanning (slicing the binding off and using a sheet feeding scanner) is so attractive. For any non-rare books, this is just too tedious and time consuming to go through for more than a few books.
评论 #27364339 未加载
评论 #27364368 未加载
dang将近 4 年前
One past thread, a long time ago:<p><i>DIY book scanning</i> - <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=991897" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=991897</a> - Dec 2009 (7 comments)
djoldman将近 4 年前
Here it is in action:<p><a href="http:&#x2F;&#x2F;tenrec.builders&#x2F;quill&#x2F;guide&#x2F;scanning&#x2F;scan&#x2F;" rel="nofollow">http:&#x2F;&#x2F;tenrec.builders&#x2F;quill&#x2F;guide&#x2F;scanning&#x2F;scan&#x2F;</a>
azureel将近 4 年前
For anyone interested, there is also <a href="https:&#x2F;&#x2F;libreflip.org&#x2F;" rel="nofollow">https:&#x2F;&#x2F;libreflip.org&#x2F;</a> website about similar device.
dahart将近 4 年前
&gt; While there are some computer algorithms that can help dewarp the pages after capture, it is always more reliable to just capture flat pages in the first place.<p>I’m sure this is technically true, but curious how much it matters in practice today? Reading Google’s book scanning patents I found a description of a de-warper based on capturing a 3d depth scan of the book, which I assumed they were using in order to achieve the scale of scanning all books on earth. Capturing and de-warping a 3d depth scan would also be leagues more reliable than trying to do a purely 2d image based de-warp.<p>&gt; The lights must also be positioned to minimize glare and reflections.<p>For my personal photo scanning and archiving project, I used a polarizing filter on the light and on the camera in order to eliminate specular glare, it works amazingly well. Would that be impractical, and&#x2F;or not work as well on books for some reason?
usui将近 4 年前
These kinds of discussions need more real examples to accurately depict the tradeoffs of destructive vs non-destructive scanning, so I&#x27;ll add scans I personally made.<p>Here are two pages from Cracking the Coding Interview, 6th Edition, that I preferred over the digital versions I found online that were hard on the eyes because I disliked the black-and-white scans. Feel free to ask me about &quot;details&quot; in the process<p><a href="https:&#x2F;&#x2F;imgur.com&#x2F;2ZQFZ5p" rel="nofollow">https:&#x2F;&#x2F;imgur.com&#x2F;2ZQFZ5p</a><p>It&#x27;s entirely possible to accomplish post-processing without writing code if you have Adobe Photoshop.<p>I used a free-to-the-public bookscanner built by the Digital Archivists at Noisebridge in San Francisco to take pictures of all pages in my textbooks (it took a while). In Photoshop, you can record a macro to automatically crop to a rectangular region determined by just one or more points that are guaranteed to be on the page in every photo. The selection is made by the quick selection tool (selects similar pixels to the page color in the same region). With this macro recorded, you can run it in bulk through all files.<p>The textbook size was still large digitally (a gigabyte) because I wanted the highest quality possible for studying, but it beat having to carry heavy textbooks for sure. I also shared these files with friends and we were able to study without any physical textbooks for books that were not available digitally—it was amazing.<p>Personally I avoided all the deskewing technologies and preferred just pictures, all in color, as close to the real thing as possible, because Noisebridge&#x27;s scanner used two DSLRs and the pictures were high quality. It was better than converting everything to black-and-white for reading enjoyability. OCR through ABBYY FineReader.<p>Overall it gets more annoying the thicker the textbook is. If destructive scanning is acceptable, one can just buy the book, go to FedEx and ask them cut the spine off for $4 to convert it to loose-leaf, then run it through a document scanner such as ScanSnap ix500, which is much faster at around 25 pages&#x2F;min at its slowest<p>One really cool feature about Noisebridge&#x27;s scanner (picture below) was that you could view the camera&#x27;s viewfinder live in real-time, thereby speeding up iteration and catching errors much faster<p><a href="https:&#x2F;&#x2F;imgur.com&#x2F;4Pkdp1j" rel="nofollow">https:&#x2F;&#x2F;imgur.com&#x2F;4Pkdp1j</a>
tunesmith将近 4 年前
Are any of you part of book scanner clubs that might have a database of word counts of famous fiction books? I&#x27;ve found several lists online but it&#x27;s not a wide selection of books - I&#x27;d imagine book scanners might have more. I&#x27;d be happy to share the database I&#x27;ve cobbled together.
评论 #27363742 未加载
totetsu将近 4 年前
Last year I bought a czur book scanner that looks kind of like a lamp to try and archive some 100 year old books I had limited access to. The resolution of the camera was so low I ended up balancing my phone on top and getting better images just using it as a light.
评论 #27363048 未加载
ebr4him将近 4 年前
The store seems to be down, any idea how much it costs?
nanna将近 4 年前
Anyone have thoughts on the Easy Book Scanner design by David Landin?<p><a href="https:&#x2F;&#x2F;www.instructables.com&#x2F;Book-Scanner-Low-cost-easy-to-make-1000-pages-an-h&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.instructables.com&#x2F;Book-Scanner-Low-cost-easy-to-...</a>
评论 #27368314 未加载
评论 #27369538 未加载
Gedxx将近 4 年前
Here a homemade way to digitize a book with a compact camera <a href="https:&#x2F;&#x2F;www.ikkaro.com&#x2F;en&#x2F;como-digitalizar-libro&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.ikkaro.com&#x2F;en&#x2F;como-digitalizar-libro&#x2F;</a>
TaylorAlexander将近 4 年前
Hello if anyone is in the Bay Area and has a book scanner I’d love to scan my copy of this book which was only printed in India in 2001 and seems relatively rare:<p><a href="https:&#x2F;&#x2F;www.abebooks.com&#x2F;9780140298246&#x2F;Patents-Myths-Reality-Shiva-Vandana-014029824X&#x2F;plp" rel="nofollow">https:&#x2F;&#x2F;www.abebooks.com&#x2F;9780140298246&#x2F;Patents-Myths-Reality...</a><p>I did fill out the form for the internet archive but it talked about scanning a library and I’m not sure they want to deal with just one book.
评论 #27364658 未加载
mcguire将近 4 年前
Are the kits back in stock?<p>I fooled around with the DIY option, but realized I was incompetent. Ended up buying a cheap Czur scanner, which works surprisingly well.<p>For it, you hold the book open on the black mat on a table. The scanner uses a laser to measure and correct page curvature, and takes a picture of both pages.<p>It produces decent PDFs (I&#x27;m not sure about the comparative resolution) with (bad) OCR&#x27;ed text. (The IA re-OCR&#x27;s the book after upload, right?)
braincode将近 4 年前
I&#x27;d love to see something like this made out of entirely recycled phones and their cameras instead of going with discrete components... any leads?
评论 #27367602 未加载
ngold将近 4 年前
I have most of the entire collection of hardback national geographics from 1930 to 1970. Wonder how legal it would be to scan them. Always wondered.
评论 #27363438 未加载
评论 #27363394 未加载
评论 #27363336 未加载
Topgamer7将近 4 年前
I&#x27;ve used scan tailor in the past to convert a outboard motor manual to pdf, it&#x27;s pretty powerful. I didn&#x27;t have a proper setup, but my results still came out decently.<p><a href="https:&#x2F;&#x2F;github.com&#x2F;4lex4&#x2F;scantailor-advanced" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;4lex4&#x2F;scantailor-advanced</a>
fiftyacorn将近 4 年前
I remember reading about Larry Page spending time developing a book scanner using a scanner and a hoover to turn pages
fortran77将近 4 年前
I built a similar one of these from a kit that Dan Reetz made. (Technology has improved since I built mine.)<p>I have eliminated most printed books. I had to pass a &quot;psychological barrier&quot; before I was able to discard the books I scanned.<p>The last holdout was music scores, but I now use an iPad for music at the piano.
评论 #27364661 未加载
jbergens将近 4 年前
The link <a href="http:&#x2F;&#x2F;store.diybookscanner.org&#x2F;" rel="nofollow">http:&#x2F;&#x2F;store.diybookscanner.org&#x2F;</a> goes to a shop page that is not configured yet.
Topgamer7将近 4 年前
I remember reading about Google&#x27;s book scanner that operated automatically using vacuum pressure to gently flip pages. I&#x27;d love to see an open source variety of that.
评论 #27362627 未加载
评论 #27364224 未加载
评论 #27364397 未加载
评论 #27364768 未加载
ggm将近 4 年前
UCL-CS had one of these which was deployed in conjunction with the British Library. This is when high pixel count CCDs were super expensive back in the 1980s. Amazing device.
indiantinker将近 4 年前
Nice! A foot-pedal can improve his over-all efficiency and reduce lower back and neck pain.
failwhaleshark将近 4 年前
I need this for some vintage IBM&#x2F;PC-compatible programming books that are a zillion pages long.