TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Blacklight – A Real-Time Website Privacy Inspector

207 pointsby chris_fover 4 years ago

25 comments

fxtentacleover 4 years ago
I feel like we should unify the copyright and privacy laws.<p>If I copy a Disney movie without their knowledge and then extract value from it, for example by watching the movie without paying, everyone agrees that this is theft. And punishment is generally strong to excessive.<p>If a website copies my private data without my knowledge or even after I decline permission by sending DNT headers, that is somehow considered completely fine.<p>But to me, the value of my private data is vastly higher than the value of watching a copied movie. And the potential for financial damages stemming from a stolen identity is also much highest than the real damages from someone downloading an mp4.<p>I suggest that we classify the secret collection of private data as theft.<p>Edit: With my wording, I was referring to this old pro-copyright ad: <a href="https:&#x2F;&#x2F;m.youtube.com&#x2F;watch?v=HmZm8vNHBSU" rel="nofollow">https:&#x2F;&#x2F;m.youtube.com&#x2F;watch?v=HmZm8vNHBSU</a>
评论 #24557671 未加载
评论 #24554231 未加载
评论 #24555673 未加载
评论 #24554119 未加载
评论 #24554172 未加载
评论 #24554077 未加载
评论 #24560979 未加载
评论 #24560238 未加载
BonoboIOover 4 years ago
This is just like a highscore game ...<p>33 Trackers | 60 Third-Party Cookies <a href="https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=edition.cnn.com" rel="nofollow">https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=edition.cnn.com</a><p>32 Trackers | 53 Third-Party Cookies <a href="https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=wsj.com" rel="nofollow">https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=wsj.com</a><p>The newspaper business is digging it&#x27;s own grave.<p>&quot;This website could be monitoring your keystrokes and mouse clicks.&quot;
评论 #24554538 未加载
评论 #24554311 未加载
评论 #24554322 未加载
Amorymeltzerover 4 years ago
As always, it is truly amazing that Wikipedia, a top-10 global website, has none of this stuff: <a href="https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=en.wikipedia.org" rel="nofollow">https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=en.wikipedia.org</a>
评论 #24558810 未加载
eliover 4 years ago
There&#x27;s some good stuff in here, but they&#x27;re also using a very expansive definition of &quot;tracker&quot; that in some cases I think is just unfair.<p>For example Adobe TypeKit serves fonts. It&#x27;s not ad tech at all. The only thing it tracks is how many times a font was served. Adobe does also have ad tracking technology but TypeKit isn&#x27;t part of it.<p>Likewise the tool&#x27;s author seems to misunderstand AWS CloudFront, which is a CDN and does not itself do any tracking nor is it connected to any Amazon ad tech.
评论 #24555096 未加载
评论 #24554817 未加载
yaloginover 4 years ago
As per this tool, if you value privacy visiting pornhub is many times better than visiting TechCrunch.<p>I wasn’t surprised that TechCrunch has so many trackers but surprised that a porn site has only one.
评论 #24554495 未加载
tyingqover 4 years ago
That&#x27;s really well done. I tried several ecommerce, airline, travel, etc, sites. I was surprised with the extent of fingerprinting and 3rd party sites.<p>For example, American Airlines, aa.com: 17 ad trackers, 32 third party cookies, canvas fingerprinting, session keyboard and mouse tracking, data to facebook, linked in, amazon, and more. Ouch.
评论 #24554693 未加载
unnameduser1over 4 years ago
Looks interesting and is definitely a required tool to show what websites are doing that average users don’t realize.<p>Sadly the really bad players detect that this is not a normal user and redirect to a error page.<p>You will have to try harder to fool them into thinking your headless browser is a real user. Probably your test device is in a server center and they detect the IP isn’t a end user IP.<p>Hope you get it working
iruoyover 4 years ago
I tried some sites I thought might get a high score. Mostly news sites and webstores since they like to know a lot about you.<p>Fox News [1] and Breitbart [2] got scarily high scores<p>[1] <a href="https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=foxnews.com" rel="nofollow">https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=foxnews.com</a><p>[2] <a href="https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=breitbart.com" rel="nofollow">https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=breitbart.com</a>
评论 #24554140 未加载
评论 #24554194 未加载
评论 #24553943 未加载
makepanicover 4 years ago
If any website implements this chromium headless bug [1] then one could potentially avoid detection in blacklight.<p>[1] - <a href="https:&#x2F;&#x2F;bugs.chromium.org&#x2F;p&#x2F;chromium&#x2F;issues&#x2F;detail?id=1090429" rel="nofollow">https:&#x2F;&#x2F;bugs.chromium.org&#x2F;p&#x2F;chromium&#x2F;issues&#x2F;detail?id=109042...</a>
hinkleyover 4 years ago
I forget sometimes how much happier I am in my philistine pig-ignorance of what really goes on.<p>The title reminds me of two things. The blacklight sketch on SNL where everything had gross stuff all over it, and a week ago when I was trying to figure out why a button on a website wasn&#x27;t working in Firefox, and so I opened the network tab.<p>Every time I even moused over anything there were four network requests. To so many different origins. I did not want to see this (like the joke of turning your lights on in NYC so the roaches have a chance to scatter before you walk into the kitchen.) I kinda miss when we had a visual indicator that network traffic was going on. I might still not want to know but the social pressure of it would at least slow their roll a little bit. Maybe 1 request per mouseover.
technotarekover 4 years ago
Maybe publishers who do well would proudly let their users know? A badge of sorts? <a href="https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=attic.city" rel="nofollow">https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=attic.city</a>
bookofjoeover 4 years ago
&gt;87 percent of websites are tracking you. This new tool will let you run a creepiness check.<p><a href="https:&#x2F;&#x2F;www.washingtonpost.com&#x2F;technology&#x2F;2020&#x2F;09&#x2F;25&#x2F;privacy-check-blacklight&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.washingtonpost.com&#x2F;technology&#x2F;2020&#x2F;09&#x2F;25&#x2F;privacy...</a><p><a href="https:&#x2F;&#x2F;archive.vn&#x2F;rOpOL" rel="nofollow">https:&#x2F;&#x2F;archive.vn&#x2F;rOpOL</a>
llacb47over 4 years ago
Here are some bad ones:<p><a href="https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=thoughtcatalog.com" rel="nofollow">https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=thoughtcatalog.com</a><p><a href="https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=factinate.com" rel="nofollow">https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=factinate.com</a><p><a href="https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=sacbee.com" rel="nofollow">https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=sacbee.com</a><p><a href="https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=mediabiasfactcheck.com" rel="nofollow">https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=mediabiasfactcheck.com</a><p><a href="https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=www.thestar.com" rel="nofollow">https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=www.thestar.com</a><p><a href="https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=space.com" rel="nofollow">https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=space.com</a><p><a href="https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=laptopmag.com" rel="nofollow">https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=laptopmag.com</a><p><a href="https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=hollywoodlife.com" rel="nofollow">https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=hollywoodlife.com</a><p><a href="https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=nfl.com" rel="nofollow">https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=nfl.com</a><p><a href="https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=kyma.com" rel="nofollow">https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=kyma.com</a><p>The worst so far: <a href="https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=thehindu.com" rel="nofollow">https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=thehindu.com</a><p><a href="https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=m.economictimes.com" rel="nofollow">https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=m.economictimes.com</a>
mellosoulsover 4 years ago
Looks good, but unfortunate name clash - I assume this is unrelated to the established Blacklight discovery platform used (ironically, considering The Markup&#x27;s domain) in the Panama Papers exposure?<p><a href="https:&#x2F;&#x2F;projectblacklight.org&#x2F;" rel="nofollow">https:&#x2F;&#x2F;projectblacklight.org&#x2F;</a><p><a href="https:&#x2F;&#x2F;source.opennews.org&#x2F;articles&#x2F;people-and-tech-behind-panama-papers&#x2F;" rel="nofollow">https:&#x2F;&#x2F;source.opennews.org&#x2F;articles&#x2F;people-and-tech-behind-...</a>
rfreytagover 4 years ago
Check out <a href="https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=wapo.com" rel="nofollow">https:&#x2F;&#x2F;themarkup.org&#x2F;blacklight&#x2F;?url=wapo.com</a><p>Astonishing.
mywacadayover 4 years ago
Does anyone know of a way to quantify the cost of ad-trackers in terms of cpu&#x2F;memory&#x2F;bandwidth&#x2F;battery? How much more responsive would websites be without all the trackers? How much longer would we keep our phones if they didn&#x27;t struggle under the weight of all this tracking? How much less data center capacity would be required? How much more bandwidth would be available?
评论 #24554267 未加载
评论 #24554286 未加载
评论 #24554281 未加载
Semaphorover 4 years ago
This does not work with GDPR compliant sites (the few that exist). I wonder if it would make sense to use something similar to [0] to auto-accept, note in the results that there is a consent gate and then list the trackers in the accepted state?<p>[0]: <a href="https:&#x2F;&#x2F;www.i-dont-care-about-cookies.eu&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.i-dont-care-about-cookies.eu&#x2F;</a>
zieover 4 years ago
Try your bank website... it won&#x27;t be a nice result.<p>Even banks are getting in on the action, it&#x27;s gross.
amqover 4 years ago
Next step: server-side tracking. With Firefox and Safari blocking JS tracking by default, many companies feel like they don&#x27;t have other choice.
sbarreover 4 years ago
I also appreciate that you get an archive download of the results if you want one (at the bottom). Can be useful for historical comparisons.
mdaover 4 years ago
Some popular tech sites: Anandtech: 29 Ad trackers, 76 3rd party cookies Arstechnica: 47 Ad trackers 110 3rd party cookies
quaffapintover 4 years ago
Running pi-hole it&#x27;s astounding the percentage of the dns requests are for trackers&#x2F;ad serving.
TedDoesntTalkover 4 years ago
I entered a website that got 0 for all categories... it was not obvious that the scanner was done or that all zeros was a good thing. Hmmm
评论 #24554059 未加载
bobblywobblesover 4 years ago
This is why websites&#x2F;initiatives like scroll are so effective. I signed up and pay dollars a month to avoid paywalls and help fund news sources [that should also be avoiding ad-pixels!]<p><a href="https:&#x2F;&#x2F;scroll.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;scroll.com&#x2F;</a>
thestepdaddyover 4 years ago
ghostery plugin much more effective if you use chrome it tells you as you visit a site and gives you the option to block it, I don&#x27;t know why this is ranked 1 on HN