TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

A Face Is Exposed for AOL Searcher No. 4417749 (2006)

135 pointsby acqbuabout 1 year ago

10 comments

neonateabout 1 year ago
<a href="https:&#x2F;&#x2F;web.archive.org&#x2F;web&#x2F;20170715075814&#x2F;https:&#x2F;&#x2F;www.nytimes.com&#x2F;2006&#x2F;08&#x2F;09&#x2F;technology&#x2F;09aol.html" rel="nofollow">https:&#x2F;&#x2F;web.archive.org&#x2F;web&#x2F;20170715075814&#x2F;https:&#x2F;&#x2F;www.nytim...</a>
samwillisabout 1 year ago
This is important to look back on in the context of what&#x27;s happing now with AI tools. This story is obviously about the leak of the data publicly, but what it shows is the profiling that is available to corporations.<p>Search has exposed so much data about ourselves to the services we use with very little regulation on what they are permitted to do with it inside their own walls.<p>My fear with AI is that we are moving toward sending even more data to party services. Tools such a co-pilot (which I enjoy using) are a gold mine for behavioural analysis. The profiling that will be possible with these tools is extraordinary and we don&#x27;t yet fully understand the implication.<p>It&#x27;s because of this that I&#x27;m a massive proponent of &quot;Local AI&quot;. We need to be pushing for the industry to adopt a local inference architecture asap. It needs to become the standard pattern as early as possible to reduce the risk of the AI revolution being a repeat of the invasive internet search and advertising industry.
评论 #39504912 未加载
评论 #39500942 未加载
评论 #39504209 未加载
jll29about 1 year ago
That episode (releasing the AOL search query log file for research purposes and subsequent aftermath) led to some firings at the company, but some information retrieval searchers used this log to conduct important experiments.<p>The &quot;60s lady with the dog that kept peeing her sofa&quot; got her hour of fame, and the whole thing became a case study in de-anonymization.<p>A few pointers:<p><a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;AOL_search_log_release" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;AOL_search_log_release</a><p><a href="https:&#x2F;&#x2F;www.researchgate.net&#x2F;publication&#x2F;233390862_Privacy_Preserving_Web_Query_Log_Publishing_A_Survey_on_AnonymizationTechniques" rel="nofollow">https:&#x2F;&#x2F;www.researchgate.net&#x2F;publication&#x2F;233390862_Privacy_P...</a><p><a href="https:&#x2F;&#x2F;github.com&#x2F;wasiahmad&#x2F;aol_query_log_analysis">https:&#x2F;&#x2F;github.com&#x2F;wasiahmad&#x2F;aol_query_log_analysis</a><p><a href="https:&#x2F;&#x2F;www.technologyreview.com&#x2F;2006&#x2F;08&#x2F;15&#x2F;100592&#x2F;who-benefits-from-aols-released-search-logs&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.technologyreview.com&#x2F;2006&#x2F;08&#x2F;15&#x2F;100592&#x2F;who-benef...</a><p><a href="https:&#x2F;&#x2F;www.sciencedirect.com&#x2F;science&#x2F;article&#x2F;abs&#x2F;pii&#x2F;S0020025509000516" rel="nofollow">https:&#x2F;&#x2F;www.sciencedirect.com&#x2F;science&#x2F;article&#x2F;abs&#x2F;pii&#x2F;S00200...</a><p><a href="https:&#x2F;&#x2F;isquared.wordpress.com&#x2F;2014&#x2F;04&#x2F;24&#x2F;mining-search-logs-for-usage-patterns&#x2F;" rel="nofollow">https:&#x2F;&#x2F;isquared.wordpress.com&#x2F;2014&#x2F;04&#x2F;24&#x2F;mining-search-logs...</a>
评论 #39506004 未加载
DicIfTExabout 1 year ago
There was also a theatre production produced around AOL User 927: <a href="https:&#x2F;&#x2F;arstechnica.com&#x2F;uncategorized&#x2F;2008&#x2F;05&#x2F;uare-what-u-seek-new-play-sparked-by-search-queries&#x2F;" rel="nofollow">https:&#x2F;&#x2F;arstechnica.com&#x2F;uncategorized&#x2F;2008&#x2F;05&#x2F;uare-what-u-se...</a><p>And a documentary series about User 711391: <a href="https:&#x2F;&#x2F;www.imdb.com&#x2F;title&#x2F;tt1455044&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.imdb.com&#x2F;title&#x2F;tt1455044&#x2F;</a>
acqbuabout 1 year ago
<a href="https:&#x2F;&#x2F;archive.is&#x2F;sfAMv" rel="nofollow">https:&#x2F;&#x2F;archive.is&#x2F;sfAMv</a>
评论 #39501030 未加载
karaterobotabout 1 year ago
Re-identification of supposedly anonymous data was a problem twenty years ago, and is a bigger problem today. Soon it may become a crisis, as the tools needed to do it become more and more turnkey, effective, and commodified. Now we dox as in a glass, darkly, etc.
评论 #39515081 未加载
mortallywoundedabout 1 year ago
Maybe some day we&#x27;ll have a PIR-based[1] search engine-- but then you couldn&#x27;t harvest data on your users and sell personalized ads, so maybe not.<p>[1]: <a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Private_information_retrieval" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Private_information_retrieval</a>
mmvasqabout 1 year ago
My data is in this list someone used for &quot;research&quot;. They evidently correlated with Sony Everquest Chats, LinkedIn, Twitter, AOL, corporate email, Meta, Amazon Purchases, Natural Language Processing Data collections, some DFIR (digital forensics) data collection, HR records and so much more. I would say at the time they collected the info I was planning renaissance fest attire, studying security topics and frantically figuring out identities of some gamers who dogpiled on me as a few made genuine threats towards my life and livelihood.<p>I&#x27;d say yes, throw disinformation at at, as once I started noticing someone was really doing that level of tracking I did throw disinfo at it... a lot. The problem was with their interpretations from there of searches intentionally made to mess them up - as I neither had consent yet realized I was being stalked.<p>That went on and on and on - until people died - my father thought I was swatting him (I had a welfare check made - not a swat - as they were disclosing his financial advisor and info - he&#x27;s white and safe presumably) . Yet yes, people did die. It is evident no one understands it is never just them in research or in a channel for conversations. The result of the whole thing was my childhood best friend also being left homeless and run over (by someone opportunistic in the whole process). My brother de-housed by a gang member a party kept sending about (one actually poisoned my dog). My father alienated. My sister used as a puppet for extortion instead of being rehabbed.<p>It&#x27;s very very very ugly the string of deaths and alienation that came from what they thought was funny research into AI.<p>This old data needs to be located and disposed of or put into proper custodianship. It&#x27;s grown teeth that cost lives. Throwing misinfo at it is just crapping up the Internet more to push to web 3.0, where the same problem will thrive. Requires legislation. Not quite relaxed enough to articulate how and what right now due to the extensive harassment that came about from it all. Maybe some day. It&#x27;s been pretty terrorizing.
评论 #39559821 未加载
zx8080about 1 year ago
There&#x27;s one thing could work, but which would be very hard or impossible to implement: management personal responsibility for private data loss&#x2F;exposure.
elzbardicoabout 1 year ago
2006... If they only knew....
评论 #39501599 未加载