TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: How does outline.com work?

2 点作者 rayvy超过 6 年前
I'm pretty amazed that I can get around a paywall just by using https://outline.com/www.[my url] . I'm sure there's nothing too crazy going on under the hood, but does anyone exactly how it works?

1 comment

brad0超过 6 年前
Just took a look at this, here&#x27;s my guess.<p>- Pretend they&#x27;re a crawler such as Google and pull down the HTML, potentially executing javascript<p>- Once it&#x27;s pulled down, clean it up using open source code such as readability <a href="https:&#x2F;&#x2F;github.com&#x2F;mozilla&#x2F;readability" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;mozilla&#x2F;readability</a><p>- Store that result as a document in a nosql database<p>Once they have pulled the article down once they don&#x27;t need to get it again.