Last year I wrote a couple of similar "local" PDF tools that run in the browser with no network requests. Each is just a single HTML file that will work offline:<p>- <a href="https://shreevatsa.net/pdf-pages/" rel="nofollow">https://shreevatsa.net/pdf-pages/</a> is for extracting pages, inserting blank pages, duplicating or reversing pages, etc.<p>- <a href="https://shreevatsa.net/pdf-unspread/" rel="nofollow">https://shreevatsa.net/pdf-unspread/</a> is for splitting a PDF's "wide" pages (consisting of two-page spreads) in the middle.<p>- <a href="https://shreevatsa.net/mobius-print/" rel="nofollow">https://shreevatsa.net/mobius-print/</a> is the earliest of these, and written for a niche use-case: "Möbius printing" of pages, which is printing out an article/paper two-sided in a really interesting order. (I've tried it and love it.)<p>These don't use WebAssembly, but just use the excellent "pdf-lib" JS library. To keep the file self-contained, I put the whole minified source into a <script> tag at the bottom of the (otherwise hand-written) HTML file.
Hey, Thanks for Posting it here, i built this tool, hope you like it, feel free to look at my source and contribute. <a href="https://github.com/jufabeck2202/localpdfmerger" rel="nofollow">https://github.com/jufabeck2202/localpdfmerger</a>
Is there a .pdf tool which allows compression to a defined file size? Tools like ghostscript can compress a .pdf to different levels of quality by using different setting but not a defined file size; I understand that this has to do with the compression algorithm itself and that data could be compressed only to a certain limit, but what if the file size limit is within that limit?<p>I'm asking this because an user of my problem validation platform wanted a solution for this[1], because websites requiring document upload have a file size limit and often the compressed file is either above or below the prescribed file size limit thereby loosing out on quality unnecessarily.<p>[1]'Reduce document file size to specific size' (I have added the link to it on my profile, since it's my own platform).
I'd certainly be curious why wasm ends up being 15x slower than native binary in this case, but it's not insurmountable. All of the major commercial PDF editing suites use wasm + their own C++ based pdf engine to great effect.<p>The article that this is based on is here, and a good read. It seems like it's at least non-trivial to get it working, and I'd wonder how the process looks for other compiled binaries, having not tried to do that implementation from scratch.
<a href="https://dev.to/wcchoi/browser-side-pdf-processing-with-go-and-webassembly-13hn" rel="nofollow">https://dev.to/wcchoi/browser-side-pdf-processing-with-go-an...</a>
Looks like this thread is all about sharing our own related local browser-based PDF tools. Here’s mine: <a href="https://pdftotext.github.io" rel="nofollow">https://pdftotext.github.io</a>
There are a few versions of tools like this, or similar, available. Here's mine:<p><a href="https://kc0bfv.github.io/WASM-PDF-Combiner/" rel="nofollow">https://kc0bfv.github.io/WASM-PDF-Combiner/</a><p>I used existing wasm compiles of PDF tools. This use of wasm is pretty awesome to me - I often end up working on very restricted desktop clients with little customization possible, but they always let me run a browser.
Convenient if you are on a machine where you can’t install software. (Corporate computer, school computer, library computer etc.)<p>For Linux and macOS computers that you are allowed to install software on I recommend the pdftk command line tool.<p>Ubuntu family:<p><pre><code> sudo apt install pdftk
</code></pre>
macOS with Homebrew:<p><pre><code> brew install pdftk-java</code></pre>
Can one easily install such apps as a Chrome app/PWA, and deactivate access to the internet since it doesn't need it and one can merge personal PDFs?
I created a PDF table extractor tool last year with the same idea that it should be local only. Try it here: <a href="https://pdftableutil.possiblenull.com/app/" rel="nofollow">https://pdftableutil.possiblenull.com/app/</a>
Also as a Google Docs addon (still local only) <a href="https://workspace.google.com/marketplace/app/pdf_table_importer/646940040599" rel="nofollow">https://workspace.google.com/marketplace/app/pdf_table_impor...</a><p>I had a bad case of scope creep, so the tool can also extract tables from scanned/image PDFs using OpenCV.js and tesseract OCR wasm build!
This is helpful. Generally if I need to do any pdf manipulation when I'm away from my own machine I use an android app - PDF Utils [1].<p>[1] <a href="https://play.google.com/store/apps/details?id=pdf.shash.com.pdfutility" rel="nofollow">https://play.google.com/store/apps/details?id=pdf.shash.com....</a>
Anybody knows a simple tool I can use to turn an academic two-column paper into a single column pdf (so I can read it easily on e-paper like a remarkable)?<p>(Ideally I'd like to be able to run such a tool from browser/phone)
It's great to see more PDF tools.<p>Many times I just want to clip white margins from PDFs so that it is easier to view on tablets or phones. Most viewers don't have a way to force the clipping of pages, so when you change page the zoom is lost and suddenly all the content is squished to center.<p>Last time I found cli programs to do it aprox. five years ago it was really difficult to find good tools to edit PDFs like that.<p>It's actually not trivial task, as sometimes pages have different margins, e.g. odd and even pages has different margins on folding side of page.
For something like this, how do we know that the files are not sent to a server? Am I just trusting the web app? Is there any way to be sure other than having and reading the source?
Something I still miss is a free and easy PDF tool which lets you delete, reorder and add pages from multiple PDFs. On Windows there is just Xodo but its UX is unfortunately subpar and on macOS you have Preview where the UI is better but once you have multiple PDFs from where you get the pages it can get confusing.
This is a nice usecase for pdfcpu.
If you are pdftk user give the pdfcpu CLI a spin.
It is multi platform and has some nice features baked in.
<a href="https://pdfcpu.io/" rel="nofollow">https://pdfcpu.io/</a>