科技回声

8 条评论

anonytrary超过 6 年前

You might want to include some actual pictures of the input and output in the readme. The current examples are just one-line command snippets which aren't as useful to someone who hasn't decided to use the tool yet.

评论 #18200942 未加载

danburzo超过 6 年前

I’ve been sporadically working on this over the last couple of weeks, and I think it’s now stable enough to get other people’s feedback on it. I got the idea while perusing Simon Wardley’s mapping book-in-progress (<a href="https://medium.com/wardleymaps" rel="nofollow">https://medium.com/wardleymaps</a>), and I wondered whether I can bundle all the chapters into a decent-looking PDF. (It works pretty well for that purpose). I also wanted it to be a sample app for gluing things together for the purpose of producing books in the browser.I’d love it if you gave it a spin; please let me know if you find anything nasty!

dananjaya86超过 6 年前

How is it different from, let's say:chrome --headless --disable-gpu --print-to-pdf <a href="https://www.google.com/" rel="nofollow">https://www.google.com/</a>

评论 #18200356 未加载

评论 #18201004 未加载

评论 #18200348 未加载

burtonator超过 6 年前

Polar has a similar feature if you're just wanting an archive of web pages:<a href="https://getpolarized.io/" rel="nofollow">https://getpolarized.io/</a>We support 'captured' HTML pages. Basically what we do is we fetch the full HTML of the content and store it in a PHZ file (polar HTML archive) and then we save that to disk (it's just a zip file with JSON metadata).The Polar app is an Electron app so it has full access to render HTML.We then inject our self into the network layer using protocol interceptors and if you're loading the URL you just captured we load the content from the PHZ instead of the network.You can then annotate the content, take notes on it, tag it, and keep it forever without risk of it vanishing.I use it for important documents that I can't afford to ever lose. For example, the Etherium whitepapers are in HTML , not PDF. they're also living documents so I can just capture anytime I want.HTML files don't often print properly so this way I can keep them the way they were meant to be seen.

评论 #18202514 未加载

评论 #18202281 未加载

评论 #18202208 未加载

dustingetz超过 6 年前

Examples please, and can you show me the differences made by the enhancements?

评论 #18200958 未加载

heinrichhartman超过 6 年前

Just tried it on their GitHub page:percollate pdf --output p.pdf <a href="https://github.com/danburzo/percollate" rel="nofollow">https://github.com/danburzo/percollate</a>The font is gigantic and the page tiny. Barely get to the second headline on the first page.And there is no way to tune this on the command line (yet).

评论 #18200934 未加载

dvfjsdhgfv超过 6 年前

This made me smile:> percollate html Not implemented yet

评论 #18200971 未加载

v01d4lph4超过 6 年前

Nice!

评论 #18200959 未加载

8 条评论

anonytrary超过 6 年前

评论 #18200942 未加载

danburzo超过 6 年前

dananjaya86超过 6 年前

How is it different from, let's say:chrome --headless --disable-gpu --print-to-pdf <a href="https://www.google.com/" rel="nofollow">https://www.google.com/</a>

Show HN: Percollate – a command-line tool to grab web pages as PDFs

8 条评论

Show HN: Percollate – a command-line tool to grab web pages as PDFs

8 条评论