TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Ask YC: Bayesian filter for NSFW content ?

4 pointsby ptmabout 17 years ago
I've just launched No-NSFW (NSFW content warning system) which relies on user feedback to determine site ratings.<p>I'm now thinking of introducing a Bayesian filter to determine site content. Does this make sense ?<p>Also, where do I hunt for seed data - I'm using nsfw.reddit for NSFW data (thanks kirubakaran), what do i use for SFW data ?

3 comments

raabout 17 years ago
Also have a look at DansGuardian <a href="http://dansguardian.org/" rel="nofollow">http://dansguardian.org/</a>. Blacklist files are available here: <a href="http://urlblacklist.com/" rel="nofollow">http://urlblacklist.com/</a><p>I'm not sure what you are looking for in terms of safe for work data; maybe technorati tags?
rmsabout 17 years ago
Google safesearch might help... <a href="http://www.google.com/support/bin/static.py?page=searchguides.html&#38;ctx=preferences&#38;hl=en" rel="nofollow">http://www.google.com/support/bin/static.py?page=searchguide...</a>
xenoterracideabout 17 years ago
NSFW? I'm not familiar with the term (yes I could google it but perhaps you could enlighten those of us who aren't, so we don't all have to.)
评论 #181605 未加载