TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

The mounting cost of stale ad blocking rules (2018)

40 pointsby fossislifealmost 4 years ago

5 comments

pmoriartyalmost 4 years ago
<i>&quot;... we applied EasyList to both the Alexa 5k, a curated list of the 5,000 most popular sites on the web, and a random sampling of 5,000 sites from the Alexa 1,000,000 (ensuring no duplicate sites). Our measurement was in several steps:</i><p><i>1. Use Selenium and the DevTools Protocol to record every URL requested when rendering and executing a website.</i><p><i>2. Add additional automation to randomly select three distinct same-domain URLs from anchor tags on a page.</i><p><i>3. Used the above automation to visit the homepage of each site, and a maximum of three child pages, and recorded all URLs requested for images, script files, and other web resources.</i><p><i>4. Determine which of those URLs would be blocked by the version of EasyList fetched on that day, using Brave&#x27;s optimized ad-block implementation.</i><p>...<p><i>We found that the vast majority of EasyList rules are not used when browsing popular websites; 3,268 of 39,198 (~8%) of network and exception rules were used during our crawls (these measurements exclude element rules).&quot;</i><p>That doesn&#x27;t mean that EasyList is not useful for browsing the rest of the internet.
评论 #27496945 未加载
评论 #27498876 未加载
gorhillalmost 4 years ago
Much work has been done since this article came out to remove stale filters, see:<p><a href="https:&#x2F;&#x2F;twitter.com&#x2F;fanboynz&#x2F;status&#x2F;1344796683612299265" rel="nofollow">https:&#x2F;&#x2F;twitter.com&#x2F;fanboynz&#x2F;status&#x2F;1344796683612299265</a>
snyderpalmost 4 years ago
For anyone interested in this blog post here, the full conference paper version is here: <a href="https:&#x2F;&#x2F;www.peteresnyder.com&#x2F;static&#x2F;papers&#x2F;easylist-sigmetrics-2020.pdf" rel="nofollow">https:&#x2F;&#x2F;www.peteresnyder.com&#x2F;static&#x2F;papers&#x2F;easylist-sigmetri...</a>
ufoalmost 4 years ago
Does anyone know how EasyList normally deals with old rules? Is there a process to identify and remove stale rules?
yummypaintalmost 4 years ago
Can someone explain the steplike shapes in the curve in the &quot;time to filter a request&quot; plot? I was under the impression that ad blockers used hash tables or a similar structure which is agnostic of the address being checked with O(1). Are these some kind of cache misses?
评论 #27495756 未加载
评论 #27495891 未加载