TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

First Look Media Releases PDF Redact Tools

3 pointsby patrickodalmost 10 years ago

1 comment

bazzarghalmost 10 years ago
The PDFs this produces are simply collections of PNGs, and won&#x27;t be accessible. It&#x27;s always a compromise though. If you try to edit the PDF adding black boxes, and remove hidden objects, you may still leak data via the tagged pdf text; it doesn&#x27;t have to match up to what&#x27;s on the page exactly. So, converting to PNG isn&#x27;t a terrible idea, but it would be nice to combine this with something that OCRd the PNG conversion? eg<p><a href="https:&#x2F;&#x2F;github.com&#x2F;fritz-hh&#x2F;OCRmyPDF" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;fritz-hh&#x2F;OCRmyPDF</a><p>(which uses tessaract under the hood). The other thing this is missing, comparing it to commercial redacters I&#x27;ve used, is the ability to assist in the redaction: eg removing SSNs, phone numbers, all occurrences of key phrases.