TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

ZipPy: Detect AI-generated text quickly via compression ratios

29 pointsby makeworldover 1 year ago

3 comments

bastawhizover 1 year ago
I think it&#x27;ll be difficult or perhaps even practically impossible to detect if a single blob of text is generated by AI in the future. However I think we&#x27;ll find there&#x27;s a practical means of determining if multiple blobs of text in a set of blobs was generated by AI. For example, if you have a class of students turning in reports on the same topic: an AI would produce similar outputs for roughly the same prompts because it was trained in a specific way. Those reports would have some statistical similarities.<p>But this isn&#x27;t an AI detection system, it&#x27;s a general cheating detection system. I&#x27;d expect it to also detect if one person wrote the reports for multiple students and simply rephrased each report. But if the intention is to detect cheating, that&#x27;s not a bad thing.
kejover 1 year ago
This is clever but it feels like a situation where Goodhart&#x27;s Law (when a measure becomes a target, it ceases to be a good measure) will apply. Once this becomes a known test, AI text generators can optimize for it and then it won&#x27;t be a reliable test anymore.
bArrayover 1 year ago
For student reports, you could potentially whip something up to pull out each paragraph and evaluate it - giving you a form of score to investigate further. When you&#x27;re marking 200-400 reports and you only have 5-10 minutes to mark each, something like this could be a game changer.