TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

A Face Is Exposed for AOL Searcher No. 4417749 (2006)

135 点作者 acqbu大约 1 年前

10 条评论

neonate大约 1 年前
<a href="https:&#x2F;&#x2F;web.archive.org&#x2F;web&#x2F;20170715075814&#x2F;https:&#x2F;&#x2F;www.nytimes.com&#x2F;2006&#x2F;08&#x2F;09&#x2F;technology&#x2F;09aol.html" rel="nofollow">https:&#x2F;&#x2F;web.archive.org&#x2F;web&#x2F;20170715075814&#x2F;https:&#x2F;&#x2F;www.nytim...</a>
samwillis大约 1 年前
This is important to look back on in the context of what&#x27;s happing now with AI tools. This story is obviously about the leak of the data publicly, but what it shows is the profiling that is available to corporations.<p>Search has exposed so much data about ourselves to the services we use with very little regulation on what they are permitted to do with it inside their own walls.<p>My fear with AI is that we are moving toward sending even more data to party services. Tools such a co-pilot (which I enjoy using) are a gold mine for behavioural analysis. The profiling that will be possible with these tools is extraordinary and we don&#x27;t yet fully understand the implication.<p>It&#x27;s because of this that I&#x27;m a massive proponent of &quot;Local AI&quot;. We need to be pushing for the industry to adopt a local inference architecture asap. It needs to become the standard pattern as early as possible to reduce the risk of the AI revolution being a repeat of the invasive internet search and advertising industry.
评论 #39504912 未加载
评论 #39500942 未加载
评论 #39504209 未加载
jll29大约 1 年前
That episode (releasing the AOL search query log file for research purposes and subsequent aftermath) led to some firings at the company, but some information retrieval searchers used this log to conduct important experiments.<p>The &quot;60s lady with the dog that kept peeing her sofa&quot; got her hour of fame, and the whole thing became a case study in de-anonymization.<p>A few pointers:<p><a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;AOL_search_log_release" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;AOL_search_log_release</a><p><a href="https:&#x2F;&#x2F;www.researchgate.net&#x2F;publication&#x2F;233390862_Privacy_Preserving_Web_Query_Log_Publishing_A_Survey_on_AnonymizationTechniques" rel="nofollow">https:&#x2F;&#x2F;www.researchgate.net&#x2F;publication&#x2F;233390862_Privacy_P...</a><p><a href="https:&#x2F;&#x2F;github.com&#x2F;wasiahmad&#x2F;aol_query_log_analysis">https:&#x2F;&#x2F;github.com&#x2F;wasiahmad&#x2F;aol_query_log_analysis</a><p><a href="https:&#x2F;&#x2F;www.technologyreview.com&#x2F;2006&#x2F;08&#x2F;15&#x2F;100592&#x2F;who-benefits-from-aols-released-search-logs&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.technologyreview.com&#x2F;2006&#x2F;08&#x2F;15&#x2F;100592&#x2F;who-benef...</a><p><a href="https:&#x2F;&#x2F;www.sciencedirect.com&#x2F;science&#x2F;article&#x2F;abs&#x2F;pii&#x2F;S0020025509000516" rel="nofollow">https:&#x2F;&#x2F;www.sciencedirect.com&#x2F;science&#x2F;article&#x2F;abs&#x2F;pii&#x2F;S00200...</a><p><a href="https:&#x2F;&#x2F;isquared.wordpress.com&#x2F;2014&#x2F;04&#x2F;24&#x2F;mining-search-logs-for-usage-patterns&#x2F;" rel="nofollow">https:&#x2F;&#x2F;isquared.wordpress.com&#x2F;2014&#x2F;04&#x2F;24&#x2F;mining-search-logs...</a>
评论 #39506004 未加载
DicIfTEx大约 1 年前
There was also a theatre production produced around AOL User 927: <a href="https:&#x2F;&#x2F;arstechnica.com&#x2F;uncategorized&#x2F;2008&#x2F;05&#x2F;uare-what-u-seek-new-play-sparked-by-search-queries&#x2F;" rel="nofollow">https:&#x2F;&#x2F;arstechnica.com&#x2F;uncategorized&#x2F;2008&#x2F;05&#x2F;uare-what-u-se...</a><p>And a documentary series about User 711391: <a href="https:&#x2F;&#x2F;www.imdb.com&#x2F;title&#x2F;tt1455044&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.imdb.com&#x2F;title&#x2F;tt1455044&#x2F;</a>
acqbu大约 1 年前
<a href="https:&#x2F;&#x2F;archive.is&#x2F;sfAMv" rel="nofollow">https:&#x2F;&#x2F;archive.is&#x2F;sfAMv</a>
评论 #39501030 未加载
karaterobot大约 1 年前
Re-identification of supposedly anonymous data was a problem twenty years ago, and is a bigger problem today. Soon it may become a crisis, as the tools needed to do it become more and more turnkey, effective, and commodified. Now we dox as in a glass, darkly, etc.
评论 #39515081 未加载
mortallywounded大约 1 年前
Maybe some day we&#x27;ll have a PIR-based[1] search engine-- but then you couldn&#x27;t harvest data on your users and sell personalized ads, so maybe not.<p>[1]: <a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Private_information_retrieval" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Private_information_retrieval</a>
mmvasq大约 1 年前
My data is in this list someone used for &quot;research&quot;. They evidently correlated with Sony Everquest Chats, LinkedIn, Twitter, AOL, corporate email, Meta, Amazon Purchases, Natural Language Processing Data collections, some DFIR (digital forensics) data collection, HR records and so much more. I would say at the time they collected the info I was planning renaissance fest attire, studying security topics and frantically figuring out identities of some gamers who dogpiled on me as a few made genuine threats towards my life and livelihood.<p>I&#x27;d say yes, throw disinformation at at, as once I started noticing someone was really doing that level of tracking I did throw disinfo at it... a lot. The problem was with their interpretations from there of searches intentionally made to mess them up - as I neither had consent yet realized I was being stalked.<p>That went on and on and on - until people died - my father thought I was swatting him (I had a welfare check made - not a swat - as they were disclosing his financial advisor and info - he&#x27;s white and safe presumably) . Yet yes, people did die. It is evident no one understands it is never just them in research or in a channel for conversations. The result of the whole thing was my childhood best friend also being left homeless and run over (by someone opportunistic in the whole process). My brother de-housed by a gang member a party kept sending about (one actually poisoned my dog). My father alienated. My sister used as a puppet for extortion instead of being rehabbed.<p>It&#x27;s very very very ugly the string of deaths and alienation that came from what they thought was funny research into AI.<p>This old data needs to be located and disposed of or put into proper custodianship. It&#x27;s grown teeth that cost lives. Throwing misinfo at it is just crapping up the Internet more to push to web 3.0, where the same problem will thrive. Requires legislation. Not quite relaxed enough to articulate how and what right now due to the extensive harassment that came about from it all. Maybe some day. It&#x27;s been pretty terrorizing.
评论 #39559821 未加载
zx8080大约 1 年前
There&#x27;s one thing could work, but which would be very hard or impossible to implement: management personal responsibility for private data loss&#x2F;exposure.
elzbardico大约 1 年前
2006... If they only knew....
评论 #39501599 未加载