TechEcho

8 comments

colinhmitabout 7 years ago

Anecdotally, I've noticed the onion uses certain phrases over and over again in articles - "area man" comes to mind.Did you really train it to detect satire, or just the onion writer's conventions? How does it perform when trained on onion articles and tested against some non-onion satire publication?

评论 #16888153 未加载

评论 #16888247 未加载

minimaxirabout 7 years ago

This has the same input data fidelity issues as the author's previous approach toward identifying fake news, which was flagged to death for being misleading: <a href="https://news.ycombinator.com/item?id=16128295" rel="nofollow">https://news.ycombinator.com/item?id=16128295</a>A sample size of 600 for text data is literally nothing for these types of models. (although atleast the classes are balanced this time)

latenightcodingabout 7 years ago

This is a thinly veiled "machinebox" ad. Thanks for teaching us how to overfit in under an hour OP.

评论 #16888313 未加载

happertigerabout 7 years ago

As always it’s easy to apply this technology to differentiated content from a single publisher vs another publisher, as is the case here. In addition the onion is satire, but satire is the easier use case because not only is it single source content (as mentioned, but the larger the author pool and the more differentiated the model the higher difficulty accuracy becomes) but it doesn’t have to take into account less outrageous articles built on subtler genres like parody and sarcasm. Subtle crushes machine learning algs ime.Love the concept, but it’s be great to see a deeper exploration as a demo. Keep going!

MajorSauceabout 7 years ago

Without diminishing the author's efforts, I would say that he quickly teaches his AI how to recognize The Onion articles, instead of satire articles in general.I would be curious to see results with 3-4 news sources for each groups.

评论 #16888267 未加载

komali2about 7 years ago

This is what I loved about Google cloud machine learning API (or whatever mixture of the above nouns it's called now). I found it during my final project as a coding bootcamp student and got it up and running within a day, telling me whether a sentence was in one of three given languages.Machine learning / ai things like this are so simple and approachable right now. Just fill a .CSV and upload it, boom, training model.

mozumderabout 7 years ago

Can't tell if this is satire or not.. author needs to put his own article into his classifier.

Daniel3about 7 years ago

I think the author has used the word "trained" satirically.

8 comments

colinhmitabout 7 years ago

评论 #16888153 未加载

评论 #16888247 未加载

minimaxirabout 7 years ago

latenightcodingabout 7 years ago

This is a thinly veiled "machinebox" ad. Thanks for teaching us how to overfit in under an hour OP.

评论 #16888313 未加载

happertigerabout 7 years ago

MajorSauceabout 7 years ago

评论 #16888267 未加载

komali2about 7 years ago

mozumderabout 7 years ago

Can't tell if this is satire or not.. author needs to put his own article into his classifier.

Daniel3about 7 years ago

I think the author has used the word "trained" satirically.

How I trained an AI to detect satire in under an hour

8 comments

How I trained an AI to detect satire in under an hour

8 comments