TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Full Text Search Algorithms

2 pointsby amourghabout 14 years ago
Hello guys,<p><pre><code> I'm working on an asp.net project where a user(student)saves a doc file(.doc,.docx)in folder.</code></pre> Now that there is more tha 20000 doc files in that folder,i'm asked to do search in those files by keywords that that the user types like:"Object oriented programming",programing,networks...<p>I would like to know the most used techniques and algorithms for those kind of searchs?full text search algorithms?

2 comments

dalkeabout 14 years ago
If you want to know some of the techniques, read "Managing Gigabytes: Compressing and Indexing Documents and Images" by Ian H. Witten, Alistair Moffat, and Timothy C. Bell.<p>If you want to actually implement search for your users, use solr. And use Tika for extracting text from Word documents.
评论 #2490365 未加载
NonEUCitizenabout 14 years ago
Take a look at <a href="http://lucene.apache.org/java/docs/index.html" rel="nofollow">http://lucene.apache.org/java/docs/index.html</a>