TE
TechEcho
Home
24h Top
Newest
Best
Ask
Show
Jobs
English
GitHub
Twitter
Home
Semhash: Fast deduplication and dataset multitool in Python
3 points
by
stephantul
4 months ago
1 comment
stephantul
4 months ago
Hello,<p>today we released Semhash! Here's a blogpost on how it works. We did a show HN yesterday, but that got deleted for some reason.<p>Let me know what you think!