I wonder if anyone remembers the stupid filter project around 2007.<p>There was an attempt to filter out stupid comments in social media using a vector filter as a plugin for wordpress initially. Project somewhat fell though since it turns out it's quite hard to filter out stupidity in the internet.<p>While the main source code may be outdated, the corpus may still be of some use in training modern LLMs based content moderation or at least a historical curio of humanities attempt to stem the tied of stupidity.<p>I've converted the original MySQL dump into an sqlite database and csv file so it should be easier for anyone interested to give this corpus a shot with new machine learning advances these days.