TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Show HN: Bullshit-detector – quickly detect amount of bullshit in any text

28 pointsby __natty__9 months ago
Hi, I have been reading marketing and business books recently and found plenty of them are filled with meaningless corporate jargon. These books could often be 1/3 of the original length and much more straightforward. I wrote a tiny library to calculate the amount of meaningless jargon in any text for myself, and open-sourced it later because someone may need this.

6 comments

luke-stanley9 months ago
Presumably, the identified phrase list could be used to finetune a Bert model or similar that could catch more cases, as a binary classifier. But presumably some actual semantically meaningful words would be needed too. That would be straight forward to do too though. Someone has probably already done it. The advantage would be you could get probability metrics on a broader set of text. Good data is the key thing though.
BrandoElFollito9 months ago
The fact that OP felt the need to add a disclaimer suggest that they expect people who write such abominations to search for detectors :)<p>Who knows, though. Maybe there is a marketing dude who once thought &quot;maybe that&#x27;s too much?&quot;. Naah.
ilaksh9 months ago
Hm. I wonder how well this works versus a large LLM. Seems like something a very strong LLM should be able to handle well with the right prompting.<p>If you can handle it with just phrases that would save a lot of time and money though.
评论 #41324998 未加载
delichon9 months ago
It counts these phrases that the author doesn&#x27;t like:<p><a href="https:&#x2F;&#x2F;github.com&#x2F;pilotpirxie&#x2F;bullshit-detector&#x2F;blob&#x2F;main&#x2F;src&#x2F;phrases.ts">https:&#x2F;&#x2F;github.com&#x2F;pilotpirxie&#x2F;bullshit-detector&#x2F;blob&#x2F;main&#x2F;s...</a><p>By including this file this project should therefore correctly give itself a very high bullshit score. It&#x27;s performance art really.
评论 #41325055 未加载
评论 #41325193 未加载
pacifika9 months ago
Love this
r00tanon9 months ago
Bullshit.