Perspective: Google API that detects toxic comments

92 pointsby blacksmythealmost 8 years ago

45 comments

kristiancalmost 8 years ago

An API for censorship, that really is something.One of our great facets as humans is the ability to sort through information and come to opinions based on it.We come pre-equipped with our own API for critical reasoning, honed by thousands of years of evolution, and with a training set of everything we've ever done and seen.Why would I want to use a copy of that, that at best will only replicate human biases?And that risks sidelining perfectly valid viewpoints? Does something have to be expressed in college-educated grammar to even be read with this in place? What are the consequences of that?My first instinct, and I hope yours, was to push back the slider.EDIT: I pasted the first paragraph of the Google diversity memo into it, and it got a "10% likely to be perceived as toxic" score. Make of that what you will.

评论 #15000205 未加载

评论 #15000445 未加载

评论 #15000437 未加载

dvtalmost 8 years ago

It's an interesting concept, but falls short when comments are sarcastic or contain contextualized information.It seems like this is more of a "word filter" than anything. I mean, sure, it's pretty obvious a comment will be toxic if it contains "asshole" in it. Here are some examples where the labeling fails. Some of these I got from real comments and articles on CNN or Breitbart.<pre><code> # Toxicity too low 34% toxic: Obama was born in Africa and he should move back there. 40% toxic: George Soros and his Jewish sympathizers want to destabilize our government. 25% toxic: Breitbart is run by jews - as is all mass media. As the saying goes, "the best way to control the opposition is to lead it". 47% toxic: Why do you expect anyone to take you seriously when you don't take yourself seriously. # Toxicity too high 51% toxic: Ivanka is arguably the most powerful Jewish woman in America today. 51% toxic: US Muslims and Jews are strengthening bonds amid acts of bigotry. 48% toxic: The problem of income inequality is not between Black America and White America but between black haves and black have-nots. 87% toxic: I lost my grip and fell on my ass. Not a great bike.</code></pre>

评论 #15000213 未加载

评论 #15000177 未加载

评论 #15000294 未加载

评论 #15000171 未加载

评论 #15001178 未加载

评论 #15004531 未加载

colordropsalmost 8 years ago

Really not a fan of these types of technology, with the subtleties of language such as sarcasm and irony, and then you've got approved narratives and taboo subjects, and those times where the minority is right and is under attack by the mob.I'd only support this tech as a filter for human moderators and not as an automated system.

评论 #15000154 未加载

评论 #15000153 未加载

kccqzyalmost 8 years ago

How do you even define "toxic"?I pasted this comment from the recent diversity manifesto:> I’m simply stating that the distribution of preferences and abilities of men and women differ in part due to biological causes and that these differences may explain why we don’t see equal representation of women in tech and leadership.The page says it's 2% toxic. What does that mean? 2% of the population would find it toxic? There is a 2% chance someone would find it toxic? The API is 2% confident that it is toxic? And more importantly, toxic in the sense that it is verbal harassment? Or just plain illogical? Or logically sound but with an absurd premise?I suspect that it is only able to detect more emotional comments, but will fail to detect utterly unfounded, totally disproved arguments that are communicated under the veil of reason.

评论 #15000127 未加载

brainopeneralmost 8 years ago

I saw this roll through Twitter the other day: a bot that's as good at detecting toxicity as Google is in 50 lines of code<a href="https://twitter.com/toxicitychecker" rel="nofollow">https://twitter.com/toxicitychecker</a>It came from this thread where there are complaints that the Perspective API may not outperform a random number generator.<a href="https://twitter.com/NoraReed/status/895498083131207681" rel="nofollow">https://twitter.com/NoraReed/status/895498083131207681</a>

评论 #15000470 未加载

emergedalmost 8 years ago

Google are the most literal incarnation of Big Brother I could possibly imagine at this point.

评论 #15000079 未加载

评论 #15000085 未加载

评论 #15000320 未加载

yositoalmost 8 years ago

This is cool, but it has some inherent biases. If you type only "Trump", it suggests that there's a 42% chance that your comment could be perceived as toxic. If you type only "Clinton" there's a 14% chance.That being said, I think there's some huge potential to use AI/ML in this way to improve our ability to communicate less toxicly. I've seen some research from Google investigating biases in AI/ML outcomes, so I'm excited to see what develops.

评论 #15000554 未加载

评论 #15000577 未加载

评论 #15000125 未加载

评论 #15000159 未加载

评论 #15000218 未加载

hartatoralmost 8 years ago

I like how all the safest ones are the ones defending climate change.I wish they train their models against non political data to avoid potential partisan bias. The current approach is a bit ridiculous.

microcolonelalmost 8 years ago

> This model was trained by asking people to rate internet comments on a scale from "Very toxic" to "Very healthy" contribution. Toxic is defined as... "a rude, disrespectful, or unreasonable comment that is likely to make you leave a discussion."> asking peopleGotta wonder: which people?The examples are good though, I just hope the general results are consistent with that quality level.

评论 #15000032 未加载

minimaxiralmost 8 years ago

I recently developed a neural network model which can predict the reaction to a given text/comment with reasonably low error (I'll be open-sourcing the model soon).There are a few caveats with using these approaches:1) Toxicity is heavily contextual, not just by topic (as the demo texts indicate), but also by source; at the risk of starting a political debate, a comment that would be considered toxic by the NYT/Guardian (i.e. the sources Google partnered with) may not regarded by toxic on conservative sites. It makes training a model much more difficult, but it's necessary to do so to get an unbiased, heterogenous sample.2) When looking at comments only, there's a selection bias toward "readable" comments, while anyone who has played online games know that toxic commentary is often less "Your wrong" and more "lol kill urself :D"3) Neural networks still have difficulty with sarcastic comments and could miscontrue sarcasm as toxic, which users on Hacker News would absolutely never believe.

评论 #15000087 未加载

评论 #15000114 未加载

评论 #15000123 未加载

nhebbalmost 8 years ago

<pre><code> "Men" - 29% likely to be perceived as toxic "Women" - 34% likely to be perceived as toxic </code></pre> Google gender bias confirmed.Seriously, though, I think this tool itself is toxic. I think it's more likely to fuel disagreement than quell it.

geofftalmost 8 years ago

"What we must fight for is to safeguard the existence and reproduction of our race and our people, the sustenance of our children and the purity of our blood, the freedom and independence of the fatherland, so that our people may mature for the fulfillment of the mission allotted it by the creator of the universe." -Adolf Hitler, Mein Kampf... 12% likely to be perceived as toxic."Injustice anywhere is a threat to justice everywhere." -Martin Luther King Jr., letter from Birmingham jail... 40% likely to be perceived as toxic.You might even argue that Hitler's statement is in fact not very toxic, that MLK is actively trying to cause problems for injustice and as long as nobody is making Hitler think the existence of his people is at risk he won't do anything, and so the API is accurately measuring toxicity. The question is whether a non-toxic, anodyne discourse is what you want. Peace for our time!

s_kilkalmost 8 years ago

Jesus the results are abysmally bad. "Genocide is awesome" is rated at 20% toxic, while "Genocide is awful" gets 90% toxic.Google, step up your game.

评论 #15000654 未加载

评论 #15000176 未加载

4684499almost 8 years ago

<pre><code> > Trying out it's Writing Experiment > Google is evil. 70% likely to be perceived as "toxic" > Google is good. 4% likely to be perceived as "toxic" > Google is god. 21% likely to be perceived as "toxic" </code></pre> The content above is considered 51% likely to be perceived as "toxic".

评论 #15000146 未加载

jfktreyalmost 8 years ago

It seems that any words with a curse in the middle automatically get ~41% toxic. Scunthorpe must be a toxic place.

jdavis703almost 8 years ago

These results are a bit scary. For the U.S. election category, the only comment in the "least toxic" set that really took a stand on anything said: "Too much media influence." All the other comments were either meta-comments or along the lines of let's all hold hands and sing kumbaya.I agree we need to weed out toxic comments, but human-moderated systems are the best. Hacker News has some of the best discussions that I read online. Even when I vehemently disagree with someone's point it's still worded in a respectful tone.

评论 #15000247 未加载

emanreusalmost 8 years ago

Tools like this will always do more harm than good. False positives will always be sky high. On one hand it will obstruct the legitimate discussions and on the other hand it's trivial to game such systems. Toxicity won't be stopped but magnified by stimulating offenders to embed it in benign words and sentences. Quick examples:<pre><code> 10% Holocaust was amazing. We should do it again sometimes. 12% Would you like to buy some knee grows?</code></pre>

kccqzyalmost 8 years ago

I really wonder whether hiding these comments would simply lead to even more echo chamber effects. Censoring (or "hiding") online speech is a fine line to walk.

nitwit005almost 8 years ago

If you let people see their toxicity rating, they'll just learn to game the system. Of course, more indirect or poetic insults might be an improvement.

评论 #15000099 未加载

评论 #15000828 未加载

hyperpapealmost 8 years ago

Yesterday, I got some lovely results:67% "Radical Islam" is not the largest threat our nation faces.48% There are lots of angry people on the Internet.17% I'm open to other ideas, but I'd like to suggest that perhaps we should sterilize people whose opinions I dispute.

gt_almost 8 years ago

A software tool for silencing those with contrasting voices.From a company committed to diversity.

dvfjsdhgfvalmost 8 years ago

It seems to have changed it's Perspective on potatos. From the previous discussion (<a href="https://news.ycombinator.com/item?id=13713443" rel="nofollow">https://news.ycombinator.com/item?id=13713443</a>):<pre><code> 53%: Your a potato. 55%: your a potato 61%: ur a potato 36%: a potato is you </code></pre> Now:<pre><code> 74%: Your a potato. 77%: your a potato 85%: ur a potato 66%: a potato is you </code></pre> As it's based on ML, it looks like people get offended more easily.

评论 #15000556 未加载

cftalmost 8 years ago

Soon this API will be a condition for using AdSense on pages with user comments.

azr79almost 8 years ago

They've made an episode on that in South Park. Didn't end well.

avaeralmost 8 years ago

I really hope this ends up paid and expensive.If you're paying for it, it's a powerful tool for you to steer discussion and truth towards what you'd like on your platform.If you're not paying for it, it's a powerful tool for Google to steer discussion and truth towards what Google would like on everyone's platform.

tdurdenalmost 8 years ago

Google deciding what is "toxic" or not is terrifying.edit: 59% likely to be perceived as "toxic"

cyanexttuesdayalmost 8 years ago

This is dangerous. The unequal treatment of protected classes and censorious nature of this is bad.I can see the governments of the world regulating Google hard if they go forward with this, and honestly they will deserve it.

unityByFreedomalmost 8 years ago

Cool. I look forward to when something like this can be a plugin.Given that we know people sell reddit (and HN?) usernames in order for others to mass-comment, it'd be nice to have something to combat the low-hanging fruit such as the examples given on this page.I don't think either of these contribute anything to any conversation,> If they voted for Hilary they are idiots> Screw you trump supportersIf you do, well, we might be visiting different websites -- one that implements this tech (here?), and one that doesn't (4chan).

megousalmost 8 years ago

I can imagine similar tech is used to delete [extremist] content on YouTube. And it's probably just as precise as this.

评论 #15014639 未加载

dgudkovalmost 8 years ago

I don't see how this can work well. Toxicity would strongly dependent on context. What is considered toxic in the US may not be considered toxic in other countries. Some totally appropriate conversations between friends could be perceived toxic if exposed publicly.

the8472almost 8 years ago

And now we need an adversarial bot that performs substitutions with a thesaurus (including urban dictionary and similar slang) until it finds a result that rates at a desired toxicity level.

dvfjsdhgfvalmost 8 years ago

Really, this is nothing more than a profanity filter.The differences in abilities, knowledge and salaries between men and women can be attributed to biological causes.2% likely to be perceived as "toxic"

评论 #15002020 未加载

roceastaalmost 8 years ago

If you've solved toxic comments then you've solved AGI.

dvfjsdhgfvalmost 8 years ago

The differences in abilities between men and women can be attributed to biological causes.3% likely to be perceived as "toxic" I guess they need to train it a bit more...

christianjungalmost 8 years ago

Has anyone applied for access? How long did it take? I want to use it for a research project. I applied a couple weeks ago. No response back.

larvaetronalmost 8 years ago

I guess I'm missing the point. If this is a growing trend in communication, why pretend it doesn't exist?

octaveguinalmost 8 years ago

This is really neat. Especially since they have the api results in the page so you can test out how toxic a phase it.It begs the game - make the most toxic comment that can fly under the radar. If they started using this in youtube comments, reddit, etc, at least the comment would be more original.I got a 30% toxicity with:"I believe the intelligence of climate change deniers is likely to be zero. Furthermore, they have the body oder of a kind of ogre."Can you do better?

评论 #15000100 未加载

评论 #15000091 未加载

评论 #15000460 未加载

评论 #15000272 未加载

评论 #15000098 未加载

评论 #15000542 未加载

destalmost 8 years ago

Will irony and sarcasm be detected?

golemotronalmost 8 years ago

Inherent in this is the notion that toxicity is bad. It isn't. We grow stronger through exposure to toxicity in our environment.It may seem glib to equate chemicals and comments but it's not. There are many people who have become hyper-fragile to speech they disagree with. That is not good mental or emotional health.

sp527almost 8 years ago

"Women shouldn't have rights." -> 5% likely to be toxicHmmmmm

letsmakeitalmost 8 years ago

Very ugly idea.

lwansbroughalmost 8 years ago

I know there's going to be a lot of pushback on this because HN is sensitive to censorship, but let's try to look at it a little more objectively than that. I'd like to draw on one example, one that is near and dear to many hearts in the US and abroad: the US election.Throughout the course of the election, opinions and comments were being shared all over the place. Twitter, Facebook, here on HN, bathroom stalls, news broadcasts and websites, comments on blogs and videos. There was no shortage of opinions. This is great, and showcases the power of the internet in its capability to transmit and receive all types of information. But is it not important how an opinion is formed? Surely you wouldn't enjoy or find valuable a blog post that was sparse on details, proof or a coherent line of thinking. And yet, there it was: in every corner of the internet, anyone who could operate an internet device could share their opinion on the matter. It doesn't matter if they spent 1 second on their response, or 1 hour. Most comments received the same amount of attention and value.The question is, should all thoughts and opinions be valued the same when information is in incredible supply? Most of us don't think so, and we've shown that by creating voting systems which allow for humans to filter out the things we find to be deconstructive. But we don't really stop there, do we? Humans are also incredibly biased on average: you see it here, you see it a lot on reddit. People vote things down not on the merit of the level of attention the commenter gave to their response, but generally on whether or not they agree with the sentiment expressed by the commenter.How many arguments has this biased fuelled? I wonder how many people have been pushed further away from a centrist perspective because of the shaming and bashing that goes on in online threads.I think Hacker News is a great example of humans doing much better than average at filtering out strictly toxic comments (and the mods are certainly at least partially to thank!) We're really lucky to be able to have people engage in conversations which have opposing views here, and also be able to see many different perspectives treated with the same level of respect. But even here, quite often we're prevented from having discussions that are truly political, because of the toxicity that arises. And I have to say I think I've noticed an increase in the past couple years.There aren't a lot of immediately obvious solutions to this problem, but I propose that AI intervention isn't the worst solution, and may be the best, even compared to humans. I'm gonna give Google the recognition they deserve for this service. I think an increase in this approach to online conversation could change dramatically the way we choose to engage each other in conversation, and generally will lead to more positive perspectives of one another -- something we could all use a little help with.Edit: I will say, however, that this needs to work. If it's not doing its job correctly, or well enough, it could lead to problems which I don't need to address here.

rootw0rmalmost 8 years ago

fuck no.

jamesmp98almost 8 years ago

They need to use that on Youtube lol

gregkerzhneralmost 8 years ago

Can we use this to filter Donald Trumps twitter?

45 comments

kristiancalmost 8 years ago

评论 #15000205 未加载

评论 #15000445 未加载

评论 #15000437 未加载

dvtalmost 8 years ago

评论 #15000213 未加载

评论 #15000177 未加载

评论 #15000294 未加载

评论 #15000171 未加载

评论 #15001178 未加载

评论 #15004531 未加载

colordropsalmost 8 years ago

评论 #15000154 未加载

评论 #15000153 未加载

kccqzyalmost 8 years ago

评论 #15000127 未加载

brainopeneralmost 8 years ago

评论 #15000470 未加载

emergedalmost 8 years ago

Google are the most literal incarnation of Big Brother I could possibly imagine at this point.

评论 #15000079 未加载

评论 #15000085 未加载

评论 #15000320 未加载

yositoalmost 8 years ago

评论 #15000554 未加载

评论 #15000577 未加载

评论 #15000125 未加载

评论 #15000159 未加载

评论 #15000218 未加载

hartatoralmost 8 years ago

microcolonelalmost 8 years ago

评论 #15000032 未加载

minimaxiralmost 8 years ago

评论 #15000087 未加载

评论 #15000114 未加载

评论 #15000123 未加载

nhebbalmost 8 years ago

geofftalmost 8 years ago

s_kilkalmost 8 years ago

Jesus the results are abysmally bad. "Genocide is awesome" is rated at 20% toxic, while "Genocide is awful" gets 90% toxic.Google, step up your game.

评论 #15000654 未加载

评论 #15000176 未加载

4684499almost 8 years ago

评论 #15000146 未加载

jfktreyalmost 8 years ago

It seems that any words with a curse in the middle automatically get ~41% toxic. Scunthorpe must be a toxic place.

jdavis703almost 8 years ago

评论 #15000247 未加载

emanreusalmost 8 years ago

kccqzyalmost 8 years ago

I really wonder whether hiding these comments would simply lead to even more echo chamber effects. Censoring (or "hiding") online speech is a fine line to walk.

nitwit005almost 8 years ago

If you let people see their toxicity rating, they'll just learn to game the system. Of course, more indirect or poetic insults might be an improvement.

评论 #15000099 未加载

评论 #15000828 未加载

hyperpapealmost 8 years ago

gt_almost 8 years ago

A software tool for silencing those with contrasting voices.From a company committed to diversity.

dvfjsdhgfvalmost 8 years ago

评论 #15000556 未加载

cftalmost 8 years ago

Soon this API will be a condition for using AdSense on pages with user comments.

azr79almost 8 years ago

They've made an episode on that in South Park. Didn't end well.

avaeralmost 8 years ago

tdurdenalmost 8 years ago

Google deciding what is "toxic" or not is terrifying.edit: 59% likely to be perceived as "toxic"

cyanexttuesdayalmost 8 years ago

unityByFreedomalmost 8 years ago

megousalmost 8 years ago

I can imagine similar tech is used to delete [extremist] content on YouTube. And it's probably just as precise as this.

评论 #15014639 未加载

dgudkovalmost 8 years ago

the8472almost 8 years ago

And now we need an adversarial bot that performs substitutions with a thesaurus (including urban dictionary and similar slang) until it finds a result that rates at a desired toxicity level.

dvfjsdhgfvalmost 8 years ago

评论 #15002020 未加载

roceastaalmost 8 years ago

If you've solved toxic comments then you've solved AGI.

dvfjsdhgfvalmost 8 years ago

The differences in abilities between men and women can be attributed to biological causes.3% likely to be perceived as "toxic" I guess they need to train it a bit more...

christianjungalmost 8 years ago

Has anyone applied for access? How long did it take? I want to use it for a research project. I applied a couple weeks ago. No response back.

larvaetronalmost 8 years ago

I guess I'm missing the point. If this is a growing trend in communication, why pretend it doesn't exist?

octaveguinalmost 8 years ago

评论 #15000100 未加载

评论 #15000091 未加载

评论 #15000460 未加载

评论 #15000272 未加载

评论 #15000098 未加载

评论 #15000542 未加载

destalmost 8 years ago

Will irony and sarcasm be detected?

golemotronalmost 8 years ago

sp527almost 8 years ago

"Women shouldn't have rights." -> 5% likely to be toxicHmmmmm

letsmakeitalmost 8 years ago

Very ugly idea.

lwansbroughalmost 8 years ago

rootw0rmalmost 8 years ago

fuck no.

jamesmp98almost 8 years ago

They need to use that on Youtube lol

gregkerzhneralmost 8 years ago

Can we use this to filter Donald Trumps twitter?