Hello guys,<p><pre><code> I'm working on an asp.net project where a user(student)saves a doc file(.doc,.docx)in folder.</code></pre>
Now that there is more tha 20000 doc files in that folder,i'm asked to do search in those files by keywords that that the user types like:"Object oriented programming",programing,networks...<p>I would like to know the most used techniques and algorithms for those kind of searchs?full text search algorithms?
If you want to know some of the techniques, read "Managing Gigabytes: Compressing and Indexing Documents and Images" by Ian H. Witten, Alistair Moffat, and Timothy C. Bell.<p>If you want to actually implement search for your users, use solr. And use Tika for extracting text from Word documents.