Historical programming-language groups disappearing from Google

737 pointsby beachwood23almost 5 years ago

48 comments

jedbergalmost 5 years ago

It's funny, when I took a tour of the US Geological Survey, the curator of the collection hated Google (which was just a few blocks away). He said Google is great now, with all their maps, which were far more accurate and had better coverage than the USGS.But what happens when they get bored with map data and get rid of it?He had been ordered to turn over all of their historical arial archives for scanning by Google, and then told the USGS would no longer do arial scanning since Google was doing it. But there was no agreement for Google to turn over their arial scans back to the USGS.At the time we all told him not to worry, Google would never remove data it had collected. Looks like he was a lot smarter than us.

评论 #23979250 未加载

评论 #23979687 未加载

评论 #23980204 未加载

评论 #23979327 未加载

评论 #23980174 未加载

评论 #23979729 未加载

评论 #23990866 未加载

评论 #23982110 未加载

评论 #23985386 未加载

评论 #23981963 未加载

评论 #23982599 未加载

评论 #23982083 未加载

评论 #23985729 未加载

评论 #23982371 未加载

评论 #23981234 未加载

评论 #23982187 未加载

评论 #23981442 未加载

评论 #23979065 未加载

评论 #23980238 未加载

synackalmost 5 years ago

Just recently I collected all of the archives of comp.lang.ada I could find and imported them into a public-inbox repository. There's a gap around 1992 that I couldn't find a copy of, but it's otherwise complete. It took a few days to get everything into the right format and get SpamAssassin dialed in, but it would certainly be possible to do this for the other comp.* groups if one had the patience.<a href="https://archive.legitdata.co/" rel="nofollow">https://archive.legitdata.co/</a><a href="https://archive.legitdata.co/comp.lang.ada/" rel="nofollow">https://archive.legitdata.co/comp.lang.ada/</a><a href="https://public-inbox.org/README.html" rel="nofollow">https://public-inbox.org/README.html</a>

评论 #23979270 未加载

kazinatoralmost 5 years ago

The vast majority of the spam content is injected into these newsgroups via Google Groups itself, and is not even seen on other NNTP servers.Blocking posting access to these newsgroups from GG is generally a good thing for those newsgroups.Not being able to search the archive is the unfortunate collateral damage though. Google is not obliged to provide a Usenet archive, I suppose.Formerly obtained deep links to the content also do not work!If you formely cited a comp.lang.lisp article by giving a direct link into Google Groups, people navigating it now get a permission error.

评论 #23978673 未加载

_kp6zalmost 5 years ago

Google's handling of these critical archives they were given is pretty abhorrent. The usenet archives should really be made public since there is no business value to them and they don't care about usenet.

评论 #23981467 未加载

评论 #23980883 未加载

评论 #23978926 未加载

评论 #23979532 未加载

评论 #23978909 未加载

jeffbeealmost 5 years ago

The fact that nobody had enough fucks to give to archive these groups tells you everything you need to know about decentralized peer-to-peer proof-of-work blockchain nerd hobbies. This content exists on a completely open peer-to-peer content distribution network and here you are whining that one company -- the company that already rescued this archive in a midnight U-Haul run 20 years ago -- failed to archive it.

评论 #23983264 未加载

评论 #23983637 未加载

none10287almost 5 years ago

Google has bought dejanews and has profited immensely from open source and open information.So I do think they have an obligation either a) to make the whole archive available for anyone or b) maintain it properly.Properly means restoring the fast UI from around 2004.

评论 #23978528 未加载

评论 #23978495 未加载

评论 #23980295 未加载

icheishvilialmost 5 years ago

This type of behavior is why I can never consider GCP. How many people have been burned at this point by Google randomly shutting down something they rely on?

评论 #23979220 未加载

userbinatoralmost 5 years ago

One thing that's become extremely clear to me over the last decade or so is that almost all tech companies simply do not care about the past, and I suspect at least part of that is so their narrative of progress can be subjected to fewer challenges from those who look back and compare.Also, and this may be a bit of a tangential point, but the "deny the past because it has something bad" that Google has effectively done here is uncomfortably close to the set of recent and far more political events.

评论 #23983649 未加载

评论 #23984167 未加载

Animatsalmost 5 years ago

"He who controls the present controls the past. He who controls the past controls the future" - Orwell, "1984"

fmajidalmost 5 years ago

> Usenet predates Google's spam handling toolsIn fact Usenet predates spam itself, since the first spam (Canter & Siegel) was on Usenet itself in 1994 (I was there).

aidenn0almost 5 years ago

Anyone know if anyone not google has newsgroup archives publicly accessible (The Internet Archive maybe?)

评论 #23978459 未加载

评论 #23978405 未加载

评论 #23983894 未加载

CrankyBearalmost 5 years ago

No, no, no. These groups and other Usenet groups archives must be preserved. They're our history.

imhoguyalmost 5 years ago

Anyone looking for a hobby? It is time to become a data hoarder <a href="https://www.reddit.com/r/DataHoarder/" rel="nofollow">https://www.reddit.com/r/DataHoarder/</a>

rdiddlyalmost 5 years ago

Either those Usenet groups are not part of the world, or they don't consist of information, or Google just failed at "organizing the world's information."

评论 #23981902 未加载

WoodenChairalmost 5 years ago

I read the article and I read the threads here, and maybe I missed it—but why did these groups disappear? Were they banned due to bad words or a mistaken spam filter?

评论 #23979394 未加载

summerlightalmost 5 years ago

<a href="https://www.lumendatabase.org/notices/search?utf8=%E2%9C%93&term=%22comp.lang.forth%22+%22comp.lang.lisp%22&sort_by=" rel="nofollow">https://www.lumendatabase.org/notices/search?utf8=%E2%9C%93&...</a>Looks like there has been (likely automated, nearly all of them are the same Italian phrase) mechanical legal complaints and it probably caused this instance of automated blocking going wild.As an engineer I can understand the desire to automate everything, but please at least have some heuristics to detect this kind of easy-to-detect mechanical behavior before giving the model a full authority to block anyone it doesn't like.

评论 #23986082 未加载

评论 #23983806 未加载

jolmgalmost 5 years ago

> since there is no other comprehensive archive after Google's purchase of Dejanews around 20 years agoWas I naive in thinking that The Internet Archive would have long archived this type of thing?

评论 #23983921 未加载

msiealmost 5 years ago

WTF Google? Are you now so full of young programmers who have no respect for programming history? You’ve lost all greek cred that’s for sure.

mark_l_watsonalmost 5 years ago

Too many people and companies don’t appreciate culture enough. Maintaining a cultural record should apparently not be left to just one company.Thanks for posting this, it reminded me to donate again to archive.org, which I just did.I use ‘culture’ to include anything creative, anything that we experience as humans. Everything should be preserved, schools should be well funded, as should the arts.

lkirkalmost 5 years ago

Is this something that the internet archive would preserve?

avodonosovalmost 5 years ago

There is a comp.lang.lisp archive published in 2009.> In 2009, Ron Garret published a 700MB archive file of all of comp.lang.lisp<a href="https://www.xach.com/naggum/articles/notes.html" rel="nofollow">https://www.xach.com/naggum/articles/notes.html</a>

rurbanalmost 5 years ago

Ridiculous. They are blaming missing moderators, but only Google would be able to solve the spam problem. They open now these old forums, and Gmail is mostly spam free. Now you cannot even browse the archives. Where is the internet police when you need them.

zxcvbn4038almost 5 years ago

For a long time I've wanted to revisit some the old Usenet stuff. I knew someone in the who ran a commercial usenet feed service in the early 90s and their whole setup depended heavily on low level backplane configuration, number of spindles, disk rotation speed, etc. - a lot of details that AWS hides from most of us. Using everything I've learned about distributed systems in the last thirty years I bet I could build a really awesome news feed today.Of course the downside of Usenet was most people expected conversations to disappear after a couple weeks or a month but there was always some jerk that kept everything and refused to delete anything.

DoctorNickalmost 5 years ago

It's becoming clear to me that Google has become a far, far worse monopoly than Microsoft ever was. Microsoft just controlled our computers; Google controls our access to history.

评论 #23979880 未加载

评论 #23979552 未加载

评论 #23979447 未加载

评论 #23980162 未加载

评论 #23979430 未加载

LockAndLolalmost 5 years ago

Why are people even relying on Google to keep any product alive? It's a business, not a charity. They don't do a single thing out of good will. It always has the goal of getting money in the short or long term. Knowing their quarterly obligations to shareholders, that's probably short term.These groups should be putting more effort into federalisation and decentralisation. Make it possible to store all of this data in a distributed fashion and stop relying on a central authority for archiving purposes.

评论 #23980028 未加载

cptnapalmalmost 5 years ago

I was learning C, once upon a time, and had a bug that I couldn't figure out. It worked fine on Linux/x86, but was wrong on Solaris/sparc64. Deep Google diving found a newsgroup post from 1992 or so with a very similar problem; it was an endian problem. My search-fu may have been weak, but an old newsgroup post that helped me solve my problem, not stackoverflow or any other site.

NewEntryHNalmost 5 years ago

Either this archive exists elsewhere, either now is not the proper time for panic -- it was when Google became sole owner of this archive.

haecceityalmost 5 years ago

So Google Groups archives usenet stuff? Where are the usenet stuff hosted originally? How do I connect to it without Google Groups?

评论 #23979481 未加载

评论 #23979570 未加载

smsm42almost 5 years ago

I think everybody should have learned the lesson now - do not trust Google - or any other major megacorp, but especially Google - to preserve any data for longer that they are contractually obliged to. If there needs to be historic preservation, it should be done by independent organization specifically created for that purpose.

fizixeralmost 5 years ago

Can anyone tell me how Google got hold of the whole usenet (I know it was like 15-20 years ago) which looks to me like a community service kinda thing.Like when Google decided it's going to host comp.lang.c, can there be only one comp.lang.c on the internet, or can someone else start hosting comp.lang.c as well?

评论 #23979976 未加载

DonHopkinsalmost 5 years ago

Since when were Forth and Lisp historical programming languages??! People still use them. HARUMPH!

评论 #23982423 未加载

totalforgealmost 5 years ago

SELF FOOT SHOOT DUP

评论 #23979597 未加载

评论 #23978800 未加载

评论 #23980794 未加载

Arjuna144almost 5 years ago

They are really shooting their own feet which such moves. They confirm, validate and strengthen the already existing trend to avoid vendor lock in at all cost and move to open, possibly self-hosted and export friendly platforms!This is really bad marketing

jolmgalmost 5 years ago

> Perhaps Google can be convinced to restore the contentThe support ticket was deleted, so I guess not.

ryanmarshalmost 5 years ago

Thank god. I said some really dumb shit on those lists in my youth that I regret.

grappleralmost 5 years ago

This kind of thing makes it really easy to get interested, and stay interested, in decentralization tech.Once you see things in this light, the new flavor of the month online service just doesn't hold any allure.

quantifiedalmost 5 years ago

(Repeating one of the comments from the post):> Has anyone (EFF?) considered the aspect of destroying evidence of prior art in the public domain?I think there’s a case to be made for stewardship of these groups for that reason.

Havocalmost 5 years ago

I'm hearing a fair bit of chatter in SEO circles about google de-indexing pages so this certainly rings true.I guess there was this unjustified assumption that google only adds & never subtracts.

hoshalmost 5 years ago

Maybe it is something that a non-profit dedicated towards preserving knowledge and internet content (such as Internet Archive) should be handling anyways.

bawolffalmost 5 years ago

Maybe these types of historical archives can be turned over to internet archive. I trust them a lot more than google for this.

Igelaualmost 5 years ago

If an AI decided to shut off comp.lang.lisp, I'd say it's officially too late to solve the Alignment Problem.

photon-torpedoalmost 5 years ago

Guess comp.lang.lisp has too many posts with (((code))) in them... ;)

ZinniaZirconiumalmost 5 years ago

alt.sex is still there and you don't get an adult content warning unless you choose the desktop version.

ipunchghostsalmost 5 years ago

i would like to find the quickbasic archives. anyone know how i can get them?

评论 #23980287 未加载

bawanaalmost 5 years ago

is google sinking? Between their mothballing/deletion of services and the obnoxious signup ads on youtube. I am wondering what is going on?

评论 #23978973 未加载

评论 #23978901 未加载

评论 #23978958 未加载

gnabgibalmost 5 years ago

This is editorialized (actual title: "Some Usenet groups suspended in Goggle Groups"), or on LWN[1] "Historical programming-language groups disappearing from Google" (basically the same content)[1]: <a href="https://lwn.net/Articles/827233/" rel="nofollow">https://lwn.net/Articles/827233/</a>

评论 #23978603 未加载

Ijumfsalmost 5 years ago

It was a terrible idea to entrust ANYTHING to Google.Time to de-Google the whole Web.

staycoolboyalmost 5 years ago

On the plus side, evidence of my awful usenet etiquette from the late 80's is disappearing with some of these groups.

48 comments

jedbergalmost 5 years ago

评论 #23979250 未加载

评论 #23979687 未加载

评论 #23980204 未加载

评论 #23979327 未加载

评论 #23980174 未加载

评论 #23979729 未加载

评论 #23990866 未加载

评论 #23982110 未加载

评论 #23985386 未加载

评论 #23981963 未加载

评论 #23982599 未加载

评论 #23982083 未加载

评论 #23985729 未加载

评论 #23982371 未加载

评论 #23981234 未加载

评论 #23982187 未加载

评论 #23981442 未加载

评论 #23979065 未加载

评论 #23980238 未加载

synackalmost 5 years ago

评论 #23979270 未加载

kazinatoralmost 5 years ago

评论 #23978673 未加载

_kp6zalmost 5 years ago

评论 #23981467 未加载

评论 #23980883 未加载

评论 #23978926 未加载

评论 #23979532 未加载

评论 #23978909 未加载

jeffbeealmost 5 years ago

评论 #23983264 未加载

评论 #23983637 未加载

none10287almost 5 years ago

评论 #23978528 未加载

评论 #23978495 未加载

评论 #23980295 未加载

icheishvilialmost 5 years ago

This type of behavior is why I can never consider GCP. How many people have been burned at this point by Google randomly shutting down something they rely on?

评论 #23979220 未加载

userbinatoralmost 5 years ago

评论 #23983649 未加载

评论 #23984167 未加载

Animatsalmost 5 years ago

"He who controls the present controls the past. He who controls the past controls the future" - Orwell, "1984"

fmajidalmost 5 years ago

> Usenet predates Google's spam handling toolsIn fact Usenet predates spam itself, since the first spam (Canter & Siegel) was on Usenet itself in 1994 (I was there).

aidenn0almost 5 years ago

Anyone know if anyone not google has newsgroup archives publicly accessible (The Internet Archive maybe?)

评论 #23978459 未加载

评论 #23978405 未加载

评论 #23983894 未加载

CrankyBearalmost 5 years ago

No, no, no. These groups and other Usenet groups archives must be preserved. They're our history.

imhoguyalmost 5 years ago

Anyone looking for a hobby? It is time to become a data hoarder <a href="https://www.reddit.com/r/DataHoarder/" rel="nofollow">https://www.reddit.com/r/DataHoarder/</a>

rdiddlyalmost 5 years ago

Either those Usenet groups are not part of the world, or they don't consist of information, or Google just failed at "organizing the world's information."

评论 #23981902 未加载

WoodenChairalmost 5 years ago

I read the article and I read the threads here, and maybe I missed it—but why did these groups disappear? Were they banned due to bad words or a mistaken spam filter?

评论 #23979394 未加载

summerlightalmost 5 years ago

评论 #23986082 未加载

评论 #23983806 未加载

jolmgalmost 5 years ago

评论 #23983921 未加载

msiealmost 5 years ago

WTF Google? Are you now so full of young programmers who have no respect for programming history? You’ve lost all greek cred that’s for sure.

mark_l_watsonalmost 5 years ago

lkirkalmost 5 years ago

Is this something that the internet archive would preserve?

avodonosovalmost 5 years ago

rurbanalmost 5 years ago

zxcvbn4038almost 5 years ago

DoctorNickalmost 5 years ago

It's becoming clear to me that Google has become a far, far worse monopoly than Microsoft ever was. Microsoft just controlled our computers; Google controls our access to history.

评论 #23979880 未加载

评论 #23979552 未加载

评论 #23979447 未加载

评论 #23980162 未加载

评论 #23979430 未加载

LockAndLolalmost 5 years ago

评论 #23980028 未加载

cptnapalmalmost 5 years ago

NewEntryHNalmost 5 years ago

Either this archive exists elsewhere, either now is not the proper time for panic -- it was when Google became sole owner of this archive.

haecceityalmost 5 years ago

So Google Groups archives usenet stuff? Where are the usenet stuff hosted originally? How do I connect to it without Google Groups?

评论 #23979481 未加载

评论 #23979570 未加载

smsm42almost 5 years ago

fizixeralmost 5 years ago

评论 #23979976 未加载

DonHopkinsalmost 5 years ago

Since when were Forth and Lisp historical programming languages??! People still use them. HARUMPH!

评论 #23982423 未加载

totalforgealmost 5 years ago

SELF FOOT SHOOT DUP

评论 #23979597 未加载

评论 #23978800 未加载

评论 #23980794 未加载

Arjuna144almost 5 years ago

jolmgalmost 5 years ago

> Perhaps Google can be convinced to restore the contentThe support ticket was deleted, so I guess not.

ryanmarshalmost 5 years ago

Thank god. I said some really dumb shit on those lists in my youth that I regret.

grappleralmost 5 years ago

quantifiedalmost 5 years ago

Havocalmost 5 years ago

I'm hearing a fair bit of chatter in SEO circles about google de-indexing pages so this certainly rings true.I guess there was this unjustified assumption that google only adds & never subtracts.

hoshalmost 5 years ago

Maybe it is something that a non-profit dedicated towards preserving knowledge and internet content (such as Internet Archive) should be handling anyways.

bawolffalmost 5 years ago

Maybe these types of historical archives can be turned over to internet archive. I trust them a lot more than google for this.

Igelaualmost 5 years ago

If an AI decided to shut off comp.lang.lisp, I'd say it's officially too late to solve the Alignment Problem.

photon-torpedoalmost 5 years ago

Guess comp.lang.lisp has too many posts with (((code))) in them... ;)

ZinniaZirconiumalmost 5 years ago

alt.sex is still there and you don't get an adult content warning unless you choose the desktop version.

ipunchghostsalmost 5 years ago

i would like to find the quickbasic archives. anyone know how i can get them?

评论 #23980287 未加载

bawanaalmost 5 years ago

is google sinking? Between their mothballing/deletion of services and the obnoxious signup ads on youtube. I am wondering what is going on?

评论 #23978973 未加载

评论 #23978901 未加载

评论 #23978958 未加载

gnabgibalmost 5 years ago

评论 #23978603 未加载

Ijumfsalmost 5 years ago

It was a terrible idea to entrust ANYTHING to Google.Time to de-Google the whole Web.

staycoolboyalmost 5 years ago

On the plus side, evidence of my awful usenet etiquette from the late 80's is disappearing with some of these groups.