TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

How I trained an AI to detect satire in under an hour

28 pointsby thetall0neabout 7 years ago

8 comments

colinhmitabout 7 years ago
Anecdotally, I&#x27;ve noticed the onion uses certain phrases over and over again in articles - &quot;area man&quot; comes to mind.<p>Did you really train it to detect satire, or just the onion writer&#x27;s conventions? How does it perform when trained on onion articles and tested against some non-onion satire publication?
评论 #16888153 未加载
评论 #16888247 未加载
minimaxirabout 7 years ago
This has the same input data fidelity issues as the author&#x27;s previous approach toward identifying fake news, which was flagged to death for being misleading: <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=16128295" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=16128295</a><p>A sample size of 600 for <i>text data</i> is literally nothing for these types of models. (although atleast the classes are balanced this time)
latenightcodingabout 7 years ago
This is a thinly veiled &quot;machinebox&quot; ad. Thanks for teaching us how to overfit in under an hour OP.
评论 #16888313 未加载
happertigerabout 7 years ago
As always it’s easy to apply this technology to differentiated content from a single publisher vs another publisher, as is the case here. In addition the onion is satire, but satire is the easier use case because not only is it single source content (as mentioned, but the larger the author pool and the more differentiated the model the higher difficulty accuracy becomes) but it doesn’t have to take into account less outrageous articles built on subtler genres like parody and sarcasm. Subtle crushes machine learning algs ime.<p>Love the concept, but it’s be great to see a deeper exploration as a demo. Keep going!
MajorSauceabout 7 years ago
Without diminishing the author&#x27;s efforts, I would say that he quickly teaches his AI how to recognize The Onion articles, instead of satire articles in general.<p>I would be curious to see results with 3-4 news sources for each groups.
评论 #16888267 未加载
komali2about 7 years ago
This is what I loved about Google cloud machine learning API (or whatever mixture of the above nouns it&#x27;s called now). I found it during my final project as a coding bootcamp student and got it up and running within a day, telling me whether a sentence was in one of three given languages.<p>Machine learning &#x2F; ai things like this are so simple and approachable right now. Just fill a .CSV and upload it, boom, training model.
mozumderabout 7 years ago
Can&#x27;t tell if this is satire or not.. author needs to put his own article into his classifier.
Daniel3about 7 years ago
I think the author has used the word &quot;trained&quot; satirically.