TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Historical programming-language groups disappearing from Google

737 点作者 beachwood23将近 5 年前

48 条评论

jedberg将近 5 年前
It&#x27;s funny, when I took a tour of the US Geological Survey, the curator of the collection hated Google (which was just a few blocks away). He said Google is great <i>now</i>, with all their maps, which were far more accurate and had better coverage than the USGS.<p>But what happens when they get bored with map data and get rid of it?<p>He had been ordered to turn over all of their historical arial archives for scanning by Google, and then told the USGS would no longer do arial scanning since Google was doing it. But there was no agreement for Google to turn over their arial scans back to the USGS.<p>At the time we all told him not to worry, Google would never remove data it had collected. Looks like he was a lot smarter than us.
评论 #23979250 未加载
评论 #23979687 未加载
评论 #23980204 未加载
评论 #23979327 未加载
评论 #23980174 未加载
评论 #23979729 未加载
评论 #23990866 未加载
评论 #23982110 未加载
评论 #23985386 未加载
评论 #23981963 未加载
评论 #23982599 未加载
评论 #23982083 未加载
评论 #23985729 未加载
评论 #23982371 未加载
评论 #23981234 未加载
评论 #23982187 未加载
评论 #23981442 未加载
评论 #23979065 未加载
评论 #23980238 未加载
synack将近 5 年前
Just recently I collected all of the archives of comp.lang.ada I could find and imported them into a public-inbox repository. There&#x27;s a gap around 1992 that I couldn&#x27;t find a copy of, but it&#x27;s otherwise complete. It took a few days to get everything into the right format and get SpamAssassin dialed in, but it would certainly be possible to do this for the other comp.* groups if one had the patience.<p><a href="https:&#x2F;&#x2F;archive.legitdata.co&#x2F;" rel="nofollow">https:&#x2F;&#x2F;archive.legitdata.co&#x2F;</a><p><a href="https:&#x2F;&#x2F;archive.legitdata.co&#x2F;comp.lang.ada&#x2F;" rel="nofollow">https:&#x2F;&#x2F;archive.legitdata.co&#x2F;comp.lang.ada&#x2F;</a><p><a href="https:&#x2F;&#x2F;public-inbox.org&#x2F;README.html" rel="nofollow">https:&#x2F;&#x2F;public-inbox.org&#x2F;README.html</a>
评论 #23979270 未加载
kazinator将近 5 年前
The vast majority of the spam content is injected into these newsgroups via Google Groups itself, and is not even seen on other NNTP servers.<p>Blocking posting access to these newsgroups from GG is generally a good thing for those newsgroups.<p>Not being able to search the archive is the unfortunate collateral damage though. Google is not obliged to provide a Usenet archive, I suppose.<p>Formerly obtained deep links to the content also do not work!<p>If you formely cited a comp.lang.lisp article by giving a direct link into Google Groups, people navigating it now get a permission error.
评论 #23978673 未加载
_kp6z将近 5 年前
Google&#x27;s handling of these critical archives they were given is pretty abhorrent. The usenet archives should really be made public since there is no business value to them and they don&#x27;t care about usenet.
评论 #23981467 未加载
评论 #23980883 未加载
评论 #23978926 未加载
评论 #23979532 未加载
评论 #23978909 未加载
jeffbee将近 5 年前
The fact that nobody had enough fucks to give to archive these groups tells you everything you need to know about decentralized peer-to-peer proof-of-work blockchain nerd hobbies. This content exists on a completely open peer-to-peer content distribution network and here you are whining that one company -- the company that already rescued this archive in a midnight U-Haul run 20 years ago -- failed to archive it.
评论 #23983264 未加载
评论 #23983637 未加载
none10287将近 5 年前
Google has bought dejanews and has profited immensely from open source and open information.<p>So I do think they have an obligation either a) to make the whole archive available for anyone or b) maintain it properly.<p>Properly means restoring the fast UI from around 2004.
评论 #23978528 未加载
评论 #23978495 未加载
评论 #23980295 未加载
icheishvili将近 5 年前
This type of behavior is why I can never consider GCP. How many people have been burned at this point by Google randomly shutting down something they rely on?
评论 #23979220 未加载
userbinator将近 5 年前
One thing that&#x27;s become extremely clear to me over the last decade or so is that almost all tech companies simply <i>do not care about the past</i>, and I suspect at least part of that is so their narrative of progress can be subjected to fewer challenges from those who look back and compare.<p>Also, and this may be a bit of a tangential point, but the &quot;deny the past because it has something <i>bad</i>&quot; that Google has effectively done here is uncomfortably close to the set of recent and far more political events.
评论 #23983649 未加载
评论 #23984167 未加载
Animats将近 5 年前
&quot;He who controls the present controls the past. He who controls the past controls the future&quot; - Orwell, &quot;1984&quot;
fmajid将近 5 年前
&gt; Usenet predates Google&#x27;s spam handling tools<p>In fact Usenet predates spam itself, since the first spam (Canter &amp; Siegel) was on Usenet itself in 1994 (I was there).
aidenn0将近 5 年前
Anyone know if anyone not google has newsgroup archives publicly accessible (The Internet Archive maybe?)
评论 #23978459 未加载
评论 #23978405 未加载
评论 #23983894 未加载
CrankyBear将近 5 年前
No, no, no. These groups and other Usenet groups archives must be preserved. They&#x27;re our history.
imhoguy将近 5 年前
Anyone looking for a hobby? It is time to become a data hoarder <a href="https:&#x2F;&#x2F;www.reddit.com&#x2F;r&#x2F;DataHoarder&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.reddit.com&#x2F;r&#x2F;DataHoarder&#x2F;</a>
rdiddly将近 5 年前
Either those Usenet groups are not part of the world, or they don&#x27;t consist of information, or Google just failed at &quot;organizing the world&#x27;s information.&quot;
评论 #23981902 未加载
WoodenChair将近 5 年前
I read the article and I read the threads here, and maybe I missed it—but why did these groups disappear? Were they banned due to bad words or a mistaken spam filter?
评论 #23979394 未加载
summerlight将近 5 年前
<a href="https:&#x2F;&#x2F;www.lumendatabase.org&#x2F;notices&#x2F;search?utf8=%E2%9C%93&amp;term=%22comp.lang.forth%22+%22comp.lang.lisp%22&amp;sort_by=" rel="nofollow">https:&#x2F;&#x2F;www.lumendatabase.org&#x2F;notices&#x2F;search?utf8=%E2%9C%93&amp;...</a><p>Looks like there has been (likely automated, nearly all of them are the same Italian phrase) mechanical legal complaints and it probably caused this instance of automated blocking going wild.<p>As an engineer I can understand the desire to automate everything, but please at least have some heuristics to detect this kind of easy-to-detect mechanical behavior before giving the model a full authority to block anyone it doesn&#x27;t like.
评论 #23986082 未加载
评论 #23983806 未加载
jolmg将近 5 年前
&gt; since there is no other comprehensive archive after Google&#x27;s purchase of Dejanews around 20 years ago<p>Was I naive in thinking that The Internet Archive would have long archived this type of thing?
评论 #23983921 未加载
msie将近 5 年前
WTF Google? Are you now so full of young programmers who have no respect for programming history? You’ve lost all greek cred that’s for sure.
mark_l_watson将近 5 年前
Too many people and companies don’t appreciate culture enough. Maintaining a cultural record should apparently not be left to just one company.<p>Thanks for posting this, it reminded me to donate again to archive.org, which I just did.<p>I use ‘culture’ to include anything creative, anything that we experience as humans. Everything should be preserved, schools should be well funded, as should the arts.
lkirk将近 5 年前
Is this something that the internet archive would preserve?
avodonosov将近 5 年前
There is a comp.lang.lisp archive published in 2009.<p>&gt; In 2009, Ron Garret published a 700MB archive file of all of comp.lang.lisp<p><a href="https:&#x2F;&#x2F;www.xach.com&#x2F;naggum&#x2F;articles&#x2F;notes.html" rel="nofollow">https:&#x2F;&#x2F;www.xach.com&#x2F;naggum&#x2F;articles&#x2F;notes.html</a>
rurban将近 5 年前
Ridiculous. They are blaming missing moderators, but only Google would be able to solve the spam problem. They open now these old forums, and Gmail is mostly spam free. Now you cannot even browse the archives. Where is the internet police when you need them.
zxcvbn4038将近 5 年前
For a long time I&#x27;ve wanted to revisit some the old Usenet stuff. I knew someone in the who ran a commercial usenet feed service in the early 90s and their whole setup depended heavily on low level backplane configuration, number of spindles, disk rotation speed, etc. - a lot of details that AWS hides from most of us. Using everything I&#x27;ve learned about distributed systems in the last thirty years I bet I could build a really awesome news feed today.<p>Of course the downside of Usenet was most people expected conversations to disappear after a couple weeks or a month but there was always some jerk that kept everything and refused to delete anything.
DoctorNick将近 5 年前
It&#x27;s becoming clear to me that Google has become a far, far worse monopoly than Microsoft ever was. Microsoft just controlled our computers; Google controls our access to history.
评论 #23979880 未加载
评论 #23979552 未加载
评论 #23979447 未加载
评论 #23980162 未加载
评论 #23979430 未加载
LockAndLol将近 5 年前
Why are people even relying on Google to keep any product alive? It&#x27;s a business, not a charity. They don&#x27;t do a single thing out of good will. It always has the goal of getting money in the short or long term. Knowing their quarterly obligations to shareholders, that&#x27;s probably short term.<p>These groups should be putting more effort into federalisation and decentralisation. Make it possible to store all of this data in a distributed fashion and stop relying on a central authority for archiving purposes.
评论 #23980028 未加载
cptnapalm将近 5 年前
I was learning C, once upon a time, and had a bug that I couldn&#x27;t figure out. It worked fine on Linux&#x2F;x86, but was wrong on Solaris&#x2F;sparc64. Deep Google diving found a newsgroup post from 1992 or so with a very similar problem; it was an endian problem. My search-fu may have been weak, but an old newsgroup post that helped me solve my problem, not stackoverflow or any other site.
NewEntryHN将近 5 年前
Either this archive exists elsewhere, either now is not the proper time for panic -- it was when Google became sole owner of this archive.
haecceity将近 5 年前
So Google Groups archives usenet stuff? Where are the usenet stuff hosted originally? How do I connect to it without Google Groups?
评论 #23979481 未加载
评论 #23979570 未加载
smsm42将近 5 年前
I think everybody should have learned the lesson now - do not trust Google - or any other major megacorp, but especially Google - to preserve any data for longer that they are contractually obliged to. If there needs to be historic preservation, it should be done by independent organization specifically created for that purpose.
fizixer将近 5 年前
Can anyone tell me how Google got hold of the whole usenet (I know it was like 15-20 years ago) which looks to me like a community service kinda thing.<p>Like when Google decided it&#x27;s going to host comp.lang.c, can there be only one comp.lang.c on the internet, or can someone else start hosting comp.lang.c as well?
评论 #23979976 未加载
DonHopkins将近 5 年前
Since when were Forth and Lisp historical programming languages??! People still use them. <i>HARUMPH!</i>
评论 #23982423 未加载
totalforge将近 5 年前
SELF FOOT SHOOT DUP
评论 #23979597 未加载
评论 #23978800 未加载
评论 #23980794 未加载
Arjuna144将近 5 年前
They are really shooting their own feet which such moves. They confirm, validate and strengthen the already existing trend to avoid vendor lock in at all cost and move to open, possibly self-hosted and export friendly platforms!<p>This is really bad marketing
jolmg将近 5 年前
&gt; Perhaps Google can be convinced to restore the content<p>The support ticket was deleted, so I guess not.
ryanmarsh将近 5 年前
Thank god. I said some really dumb shit on those lists in my youth that I regret.
grappler将近 5 年前
This kind of thing makes it really easy to get interested, and stay interested, in decentralization tech.<p>Once you see things in this light, the new flavor of the month online service just doesn&#x27;t hold any allure.
quantified将近 5 年前
(Repeating one of the comments from the post):<p>&gt; Has anyone (EFF?) considered the aspect of destroying evidence of prior art in the public domain?<p>I think there’s a case to be made for stewardship of these groups for that reason.
Havoc将近 5 年前
I&#x27;m hearing a fair bit of chatter in SEO circles about google de-indexing pages so this certainly rings true.<p>I guess there was this unjustified assumption that google only adds &amp; never subtracts.
hosh将近 5 年前
Maybe it is something that a non-profit dedicated towards preserving knowledge and internet content (such as Internet Archive) should be handling anyways.
bawolff将近 5 年前
Maybe these types of historical archives can be turned over to internet archive. I trust them a lot more than google for this.
Igelau将近 5 年前
If an AI decided to shut off comp.lang.lisp, I&#x27;d say it&#x27;s officially too late to solve the Alignment Problem.
photon-torpedo将近 5 年前
Guess comp.lang.lisp has too many posts with (((code))) in them... ;)
ZinniaZirconium将近 5 年前
alt.sex is still there and you don&#x27;t get an adult content warning unless you choose the desktop version.
ipunchghosts将近 5 年前
i would like to find the quickbasic archives. anyone know how i can get them?
评论 #23980287 未加载
bawana将近 5 年前
is google sinking? Between their mothballing&#x2F;deletion of services and the obnoxious signup ads on youtube. I am wondering what is going on?
评论 #23978973 未加载
评论 #23978901 未加载
评论 #23978958 未加载
gnabgib将近 5 年前
This is editorialized (actual title: &quot;Some Usenet groups suspended in Goggle Groups&quot;), or on LWN[1] &quot;Historical programming-language groups disappearing from Google&quot; (basically the same content)<p>[1]: <a href="https:&#x2F;&#x2F;lwn.net&#x2F;Articles&#x2F;827233&#x2F;" rel="nofollow">https:&#x2F;&#x2F;lwn.net&#x2F;Articles&#x2F;827233&#x2F;</a>
评论 #23978603 未加载
Ijumfs将近 5 年前
It was a terrible idea to entrust <i>ANYTHING</i> to Google.<p>Time to de-Google the whole Web.
staycoolboy将近 5 年前
On the plus side, evidence of my awful usenet etiquette from the late 80&#x27;s is disappearing with some of these groups.