TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

The mounting cost of stale ad blocking rules (2018)

40 点作者 fossislife将近 4 年前

5 条评论

pmoriarty将近 4 年前
<i>&quot;... we applied EasyList to both the Alexa 5k, a curated list of the 5,000 most popular sites on the web, and a random sampling of 5,000 sites from the Alexa 1,000,000 (ensuring no duplicate sites). Our measurement was in several steps:</i><p><i>1. Use Selenium and the DevTools Protocol to record every URL requested when rendering and executing a website.</i><p><i>2. Add additional automation to randomly select three distinct same-domain URLs from anchor tags on a page.</i><p><i>3. Used the above automation to visit the homepage of each site, and a maximum of three child pages, and recorded all URLs requested for images, script files, and other web resources.</i><p><i>4. Determine which of those URLs would be blocked by the version of EasyList fetched on that day, using Brave&#x27;s optimized ad-block implementation.</i><p>...<p><i>We found that the vast majority of EasyList rules are not used when browsing popular websites; 3,268 of 39,198 (~8%) of network and exception rules were used during our crawls (these measurements exclude element rules).&quot;</i><p>That doesn&#x27;t mean that EasyList is not useful for browsing the rest of the internet.
评论 #27496945 未加载
评论 #27498876 未加载
gorhill将近 4 年前
Much work has been done since this article came out to remove stale filters, see:<p><a href="https:&#x2F;&#x2F;twitter.com&#x2F;fanboynz&#x2F;status&#x2F;1344796683612299265" rel="nofollow">https:&#x2F;&#x2F;twitter.com&#x2F;fanboynz&#x2F;status&#x2F;1344796683612299265</a>
snyderp将近 4 年前
For anyone interested in this blog post here, the full conference paper version is here: <a href="https:&#x2F;&#x2F;www.peteresnyder.com&#x2F;static&#x2F;papers&#x2F;easylist-sigmetrics-2020.pdf" rel="nofollow">https:&#x2F;&#x2F;www.peteresnyder.com&#x2F;static&#x2F;papers&#x2F;easylist-sigmetri...</a>
ufo将近 4 年前
Does anyone know how EasyList normally deals with old rules? Is there a process to identify and remove stale rules?
yummypaint将近 4 年前
Can someone explain the steplike shapes in the curve in the &quot;time to filter a request&quot; plot? I was under the impression that ad blockers used hash tables or a similar structure which is agnostic of the address being checked with O(1). Are these some kind of cache misses?
评论 #27495756 未加载
评论 #27495891 未加载