TechEcho

Hey everyone!I built ModerateHatespeech, an initiative that working on better understanding + building solutions to combat hate speech in online platform. We have our flagship API, which is completely free and ML-powered, and gives a lot more actionable/better results than pretty much any other similar platform out there.We've been able to leverage our partnerships with many communities to get a lot of good data + feedback, and bring our system to a lot of users (we process ~200k comments a day right now).We've also done a lot of work to better understand biases/potential abuse-cases of our API, which you can read about on our site (trying to avoid too many links getting caught in spam filters).I would definitely love to hear any thoughts/feedback! Here's a link to information about our project/API: https://moderatehatespeech.com/

8 comments

DontchaKnowitover 2 years ago

God can't wait till this is implemented everywhere and the internet turns into fucking Disneyland.

masterof0over 2 years ago

"Republicans are bigots"{ "class": normal "confidence": 0.955 }"Democrats are bigots"{ "class": flag "confidence": 0.575 }

评论 #33641528 未加载

smileybarryover 2 years ago

It doesn't seem to deal with misgendering hate or transphobia very well (I saw lots of models failing on this so I checked that right away), and I mean obvious ones regardless of your stance, e.g.:> "She is a he" => { "class": normal, "confidence": 1 }> "He will never be a woman" => { "class": normal, "confidence": 0.999 }But it does seem to identify 2nd-person targeted transphobia:> "You will never be a woman" => { "class": flag, "confidence": 0.996 }

评论 #33617880 未加载

评论 #33613274 未加载

评论 #33613106 未加载

评论 #33618440 未加载

sn0w_crashover 2 years ago

So it’s just a woke language model?

评论 #33605647 未加载

评论 #33642104 未加载

af3dover 2 years ago

Interesting. It seems to be able to detect the subtle nuances of meaning fairly well. Maybe not perfect, but I would give it at least 8 stars on a scale from 1 to 10.

itakeover 2 years ago

What is the max qps and latency this supports? What city is it hosted in?Some moderation platforms I've worked with are too slow for messaging, especially if users are in different countries.Do you have plans for image moderation?

评论 #33617849 未加载

Sujetoover 2 years ago

Works pretty well in the few tests I made

jawertyover 2 years ago

How do you define hate speech?

评论 #33641502 未加载

8 comments

DontchaKnowitover 2 years ago

God can't wait till this is implemented everywhere and the internet turns into fucking Disneyland.

masterof0over 2 years ago

"Republicans are bigots"{ "class": normal "confidence": 0.955 }"Democrats are bigots"{ "class": flag "confidence": 0.575 }

评论 #33641528 未加载

smileybarryover 2 years ago

评论 #33617880 未加载

评论 #33613274 未加载

评论 #33613106 未加载

评论 #33618440 未加载

sn0w_crashover 2 years ago

So it’s just a woke language model?

评论 #33605647 未加载

评论 #33642104 未加载

af3dover 2 years ago

Interesting. It seems to be able to detect the subtle nuances of meaning fairly well. Maybe not perfect, but I would give it at least 8 stars on a scale from 1 to 10.

itakeover 2 years ago

评论 #33617849 未加载

Sujetoover 2 years ago

Works pretty well in the few tests I made

jawertyover 2 years ago

How do you define hate speech?

评论 #33641502 未加载

Show HN: A free, AI-powered service for automating hate speech moderation

8 comments

Show HN: A free, AI-powered service for automating hate speech moderation

8 comments