TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Stupid Filter Corpus (2007)

1 pointsby mofosyne12 months ago

1 comment

mofosyne12 months ago
I wonder if anyone remembers the stupid filter project around 2007.<p>There was an attempt to filter out stupid comments in social media using a vector filter as a plugin for wordpress initially. Project somewhat fell though since it turns out it&#x27;s quite hard to filter out stupidity in the internet.<p>While the main source code may be outdated, the corpus may still be of some use in training modern LLMs based content moderation or at least a historical curio of humanities attempt to stem the tied of stupidity.<p>I&#x27;ve converted the original MySQL dump into an sqlite database and csv file so it should be easier for anyone interested to give this corpus a shot with new machine learning advances these days.