TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask YC: Bayesian filter for NSFW content ?

4 点作者 ptm大约 17 年前
I've just launched No-NSFW (NSFW content warning system) which relies on user feedback to determine site ratings.<p>I'm now thinking of introducing a Bayesian filter to determine site content. Does this make sense ?<p>Also, where do I hunt for seed data - I'm using nsfw.reddit for NSFW data (thanks kirubakaran), what do i use for SFW data ?

3 条评论

ra大约 17 年前
Also have a look at DansGuardian <a href="http://dansguardian.org/" rel="nofollow">http://dansguardian.org/</a>. Blacklist files are available here: <a href="http://urlblacklist.com/" rel="nofollow">http://urlblacklist.com/</a><p>I'm not sure what you are looking for in terms of safe for work data; maybe technorati tags?
rms大约 17 年前
Google safesearch might help... <a href="http://www.google.com/support/bin/static.py?page=searchguides.html&#38;ctx=preferences&#38;hl=en" rel="nofollow">http://www.google.com/support/bin/static.py?page=searchguide...</a>
xenoterracide大约 17 年前
NSFW? I'm not familiar with the term (yes I could google it but perhaps you could enlighten those of us who aren't, so we don't all have to.)
评论 #181605 未加载