TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Full Text Search Algorithms

2 点作者 amourgh大约 14 年前
Hello guys,<p><pre><code> I'm working on an asp.net project where a user(student)saves a doc file(.doc,.docx)in folder.</code></pre> Now that there is more tha 20000 doc files in that folder,i'm asked to do search in those files by keywords that that the user types like:"Object oriented programming",programing,networks...<p>I would like to know the most used techniques and algorithms for those kind of searchs?full text search algorithms?

2 条评论

dalke大约 14 年前
If you want to know some of the techniques, read "Managing Gigabytes: Compressing and Indexing Documents and Images" by Ian H. Witten, Alistair Moffat, and Timothy C. Bell.<p>If you want to actually implement search for your users, use solr. And use Tika for extracting text from Word documents.
评论 #2490365 未加载
NonEUCitizen大约 14 年前
Take a look at <a href="http://lucene.apache.org/java/docs/index.html" rel="nofollow">http://lucene.apache.org/java/docs/index.html</a>