TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: How can I prevent duplicates in a crowd-sourced database of media titles?

1 点作者 wwwtyro大约 13 年前
For example, http://whatshouldireadnext.com/ will ask you to enter the name of a book you liked, and a list of books will pop up. If there are multiple entries, it will recommend selecting the top one. That'll work, but is there a more elegant way?

1 comment

cd34大约 13 年前
soundex or another method to group similarly spelled words to assign a confidence of whether they should be merged.<p>Then, keep a list of the misspellings to assign future misspelled entries to the proper spelling.