TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Saving web pages as PDFs in 2019, a real challenge

37 pointsby lucabenazziover 5 years ago

10 comments

seppelover 5 years ago
Firefox (Mac) is not even able to print this article into a PDF. It gives you only one page and the page ends with a half sentence with the letters cropped in the middle. One third of the text is missing.<p>I cant count the number of times I had to take screenshots to save crucial information from web pages.
prashntsover 5 years ago
A tip if you&#x27;re using iOS&#x2F;iPadOS: In Safari, taking a screenshot now supports taking &quot;full page shot&quot;, which you can save as a pdf. You get this option when you tap the tiny screenshot preview.
评论 #21808866 未加载
xorandover 5 years ago
&gt;Saving pages as HTML is not ideal because a) you get an HTML file plus a folder, not very practical if you want to retrieve them later, and b) you never know how that page is going to render in future versions of your browser.<p>Yes but for my use case, which is better scientific communication means, PDF is not enough.<p>Consider for example slides for a presentation. The typical mathematician does them in TEX which outputs a PDF. Then the PDF is (sometimes) made available online. I realized that PDF slides are far inferior to HTML slides (where you can add demos and whatnot, shameless example [0]). Just put all in a github repository and anybody can take them home.<p>[0] <a href="https:&#x2F;&#x2F;mbuliga.github.io&#x2F;emergent-10-years&#x2F;presentation.html" rel="nofollow">https:&#x2F;&#x2F;mbuliga.github.io&#x2F;emergent-10-years&#x2F;presentation.htm...</a>
fbriffover 5 years ago
I gave up on the idea of reliably saving web pages in PDF.<p>I use now &quot;SingleFile&quot;, a Firefox or Chrome extension that helps to save a complete page (with CSS, images, fonts, frames, etc.) as a single HTML file.<p><a href="https:&#x2F;&#x2F;addons.mozilla.org&#x2F;en-US&#x2F;firefox&#x2F;addon&#x2F;single-file&#x2F;" rel="nofollow">https:&#x2F;&#x2F;addons.mozilla.org&#x2F;en-US&#x2F;firefox&#x2F;addon&#x2F;single-file&#x2F;</a><p><a href="https:&#x2F;&#x2F;chrome.google.com&#x2F;webstore&#x2F;detail&#x2F;singlefile&#x2F;mpiodijhokgodhhofbcjdecpffjipkle?hl=en" rel="nofollow">https:&#x2F;&#x2F;chrome.google.com&#x2F;webstore&#x2F;detail&#x2F;singlefile&#x2F;mpiodij...</a><p><a href="https:&#x2F;&#x2F;github.com&#x2F;gildas-lormeau&#x2F;SingleFile" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;gildas-lormeau&#x2F;SingleFile</a>
评论 #21795402 未加载
sturakovover 5 years ago
I recommend <a href="https:&#x2F;&#x2F;webrecorder.io&#x2F;" rel="nofollow">https:&#x2F;&#x2F;webrecorder.io&#x2F;</a> to reliably capture websites.
评论 #21795709 未加载
Garveyover 5 years ago
Shouldn&#x27;t the UI&#x2F;UX&#x2F;Whoever was responsible for the design, have supplied CSS styles for printing?<p><a href="https:&#x2F;&#x2F;www.smashingmagazine.com&#x2F;2018&#x2F;05&#x2F;print-stylesheets-in-2018&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.smashingmagazine.com&#x2F;2018&#x2F;05&#x2F;print-stylesheets-i...</a>
meehowover 5 years ago
I had similar problem and wrote a browser addon: <a href="https:&#x2F;&#x2F;2read.net&#x2F;" rel="nofollow">https:&#x2F;&#x2F;2read.net&#x2F;</a> It converts websites to &quot;readable&quot; form and if you have IPFS running, it will also &quot;pin&quot; content locally. In most cases it works better than just printing an article. Here is an example with mentioned article: <a href="https:&#x2F;&#x2F;ipfs.io&#x2F;ipfs&#x2F;QmYPkcXgKLBye3L8M1VJWsGAb2mJXkJSEncqcSCkFTdHhi&#x2F;" rel="nofollow">https:&#x2F;&#x2F;ipfs.io&#x2F;ipfs&#x2F;QmYPkcXgKLBye3L8M1VJWsGAb2mJXkJSEncqcSC...</a>
评论 #21795353 未加载
评论 #21795750 未加载
zmzrrover 5 years ago
&gt;Saving pages as HTML is not ideal because a) you get an HTML file plus a folder, not very practical if you want to retrieve them later<p>MHTML exists. <a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;MHTML" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;MHTML</a>
评论 #21791536 未加载
russellbeattieover 5 years ago
This has been a problem for years, which is too bad. No one really uses MHTML or any alternative. Hopefully Web Bundles* becomes a commonly supported spec.<p>* <a href="https:&#x2F;&#x2F;web.dev&#x2F;web-bundles&#x2F;" rel="nofollow">https:&#x2F;&#x2F;web.dev&#x2F;web-bundles&#x2F;</a>
评论 #21795448 未加载
Apocryphonover 5 years ago
Isn’t this what read mode is supposed to be for?
评论 #21791636 未加载