Open Sourcing a Deep Learning Solution for Detecting NSFW Images

453 pointsby pumpikanoover 8 years ago

36 comments

brianwawokover 8 years ago

So can it be reversed to become the ultimate porn-finding neural network?

评论 #12614544 未加载

评论 #12614630 未加载

评论 #12614487 未加载

评论 #12614662 未加载

评论 #12614507 未加载

评论 #12615336 未加载

评论 #12615078 未加载

评论 #12615514 未加载

评论 #12616070 未加载

bahroover 8 years ago

I should update my sexy map finder: <a href="http://exclav.es/2016/05/20/sexy-maps/" rel="nofollow">http://exclav.es/2016/05/20/sexy-maps/</a>

评论 #12616056 未加载

评论 #12616458 未加载

inlinedover 8 years ago

Forgive my ignorance of ML but the last bit: "you'll need your own porn to train on" confused me. Does this mean that they're just exposing the rough topology of their neutral net (eg depth) and not the actual weights between nodes? I'm curious to learn from an ML expert how much this actually offers.

评论 #12614566 未加载

评论 #12615090 未加载

评论 #12616253 未加载

wildpeaksover 8 years ago

Direct link to Github: <a href="https://github.com/yahoo/open_nsfw" rel="nofollow">https://github.com/yahoo/open_nsfw</a>

zfedoranover 8 years ago

Has anyone tried taking the features that are learned at the various layers of a neural net and feeding them into something like this: <a href="https://news.ycombinator.com/item?id=12612246" rel="nofollow">https://news.ycombinator.com/item?id=12612246</a>?I imagine we would get some really interesting images back...

echelonover 8 years ago

Can you run deep dream on this? That would be quite fascinating.

评论 #12614708 未加载

NicoJuicyover 8 years ago

We are not releasing the training images or other details due to the nature of the data, but instead we open source the output model which can be used for classification by a developer.I'm guessing the one who had to input the data/images had a fun time at work :p

评论 #12614631 未加载

评论 #12615710 未加载

darklajidover 8 years ago

They acknowledge that NSFW (or pornographic) is hard to define, a la 'I recognize it if I see it'.But looking at the meager 3 sample images I'm confused about the scoring already. Why is the one in the middle scoring the highest?The question is an honest one. The two rightmost images seem to be interchangeable to me and are ~boring~: People at the beach. Is this network therefor already trained to include the biases of the creators?

评论 #12615695 未加载

eggoaover 8 years ago

I wonder if anyone at Yahoo tried using this to "deconvolute" noise into Cronenberg nightmare porn?

TrueDualityover 8 years ago

Sit back grab some popcorn. Lets see how long it takes people to start running data backwards to get new original porn.

评论 #12614940 未加载

评论 #12614627 未加载

lifeisstillgoodover 8 years ago

My first thought was from years ago, when I was pitching open source forensic services to London police (did not get far, bad Salesman that I am)Cataloging, categorising pornography seized is a nasty job and one that cops across the planet might do better with good common OSS tools.Hopefully this will help

c3534lover 8 years ago

My first thought: would probably be very useful for sites to crack down on inappropriate content.My second thought: I could probably use this to find porn in unexpected places via a webscraping Python program.

m-i-lover 8 years ago

Good to see they've automated this (beyond the initial classification of training data). In the early days of the web, such filters were typically based on manually maintained lists of sites. I actually met someone at a party once whose full-time job was to surf for porn, to maintain the filter for a provider of IT services to schools (he worked for a company now called RM Education). He said it was his ideal job for the first few days, but soon grew tiresome (note that back in those days there wasn't really any extremely objectionable material on the web).

SloopJonover 8 years ago

Anyone else see the irony in acknowledging that NFSW is subjective and contextual, but assuming that pornographic images are not?

评论 #12615285 未加载

评论 #12615321 未加载

joshmnover 8 years ago

I'm not a deep learning person whatsoever, but I do have an interesting use case that I won't disclose publicly: Is there a way to build this, and output detections based on the, ugh, object it has detected?e.g.penis 0.94vagina 0.01

评论 #12614978 未加载

评论 #12616914 未加载

slowmovintargetover 8 years ago

I think I'll pass on browsing the the deep dream visualizations for this.

zuzunover 8 years ago

With access to Flickr and Tumblr it must have been very easy to create a huge training set for such a task.

prirunover 8 years ago

Aren't there more important problems to work on than worrying about someone looking at naked people? This is just what we need: more effort spent on censoring and controlling people.

评论 #12618199 未加载

cvwrightover 8 years ago

Reminds me of this post from hackerfactor where he describes his own porn filter based on pHash.<a href="http://www.hackerfactor.com/blog/index.php?/archives/529-Kind-of-Like-That.html" rel="nofollow">http://www.hackerfactor.com/blog/index.php?/archives/529-Kin...</a>It'd be interesting to see a direct comparison of the two. Off the cuff, I'd expect the deep neural network to be more accurate and better at generalizing, but much more expensive to train.

Dim25over 8 years ago

another work in this field: "Adult video content detection using Machine Learning Techniques" PDF: <a href="http://colorlab.no/content/download/37238/470343/file/VictorTorres_MasterThesis.pdf" rel="nofollow">http://colorlab.no/content/download/37238/470343/file/Victor...</a>

johnnyoover 8 years ago

I'll bet this would be a good tool for sysadmins or network administrators to run against their network and see what it finds.

chadsciraover 8 years ago

Awesome!I have been using nude.js to do this ( <a href="http://s.codepen.io/icodeforlove/debug/gMrEKV" rel="nofollow">http://s.codepen.io/icodeforlove/debug/gMrEKV</a> ), which is hit or miss.

ganwarover 8 years ago

To be precise they are only releasing the already trained model. The associated dataset is not being made public.Thus, it is meant to be for off the shelf use rather than being able to tinker with the network to produce nuanced results.

评论 #12615926 未加载

patrickaljordover 8 years ago

I wonder what would happen if we stopped firing people for watching NSFW images. I mean bosses look at NSFW images all the time and it sounds like a shallow reason to fire someone.

评论 #12617341 未加载

Joofover 8 years ago

Are there any other fairly basic image recognition problems that people want? I'd be happy to provide as long as a dataset is easy to collect.

CompanionCuubeover 8 years ago

Has anyone run this NN on the censored Facebook image?

askewover 8 years ago

Interesting that the photo of two women on the beach is given a higher NSFW rating than the photo of a man on the beach.

Happpyover 8 years ago

Could this work on mobile to detect 18+ content in images or video? Or would it be a trained library of 50mb+?

KennyCasonover 8 years ago

I literally just started working on this problem 2 hours ago >_<

zwindlover 8 years ago

That's it! That's what I'm looking for.

matheweisover 8 years ago

is this just the network or is it a fully trained model? The TechCrunch article suggests the former but the yahoo post the latter...

评论 #12616463 未加载

cftover 8 years ago

I hope this is ported to TensorFlow soon!

rasz_plover 8 years ago

oh silly Americans, its just tits

ykover 8 years ago

I would suggest, that the link should go to Yahoo's blog post<a href="https://yahooeng.tumblr.com/post/151148689421/open-sourcing-a-deep-learning-solution-for" rel="nofollow">https://yahooeng.tumblr.com/post/151148689421/open-sourcing-...</a>which contains some technical details. (And furthermore, I guess the HN crowd has enough Internet experience to come up with stupid jokes of their own design.)

BinaryIdiotover 8 years ago

The Yahoo blog[1] post is far more interesting than this techcrunch "article". Suggest changing URL to the Yahoo Blog please.[1] <a href="https://yahooeng.tumblr.com/post/151148689421/open-sourcing-a-deep-learning-solution-for" rel="nofollow">https://yahooeng.tumblr.com/post/151148689421/open-sourcing-...</a>

评论 #12614856 未加载

Budover 8 years ago

So this is what Yahoo was up to for the last 10 years, instead of building any sort of security, keeping Yahoo Messenger working properly, or anything else of value? Heckuva job, Yahoo.