I am writing a proposal for a company that has a lot of data in all forms, text, video (youtube I think and others), mp3 and wants to index it and make it relevant to searchers. For example, if a cook signs up, they get all the data related to being a cheif including how to take care of burns, fires, or things like that that don't necessarily have the word cook in it. I need some good resources or a starting point so I can research this and put something together for them. Any help would be very very welcome!<p>They suggested Sitecore’s Content Management Solutions tool but I've never heard of it.
This project seems shockingly ambitious. I think anything that you get, you will have to write quite a bit of manual stuff for your specific use cases. I think quite a lot of work has gone into the Netflix Prize and the discussion of this has a lot of useful points. Also, did you check academic research on citeseer?
>"they get all the data related to being a cheif including how to take care of burns, fires, or things like that that don't necessarily have the word cook in it"<p>This seems almost impossible to pull off, unless you can tap into some dataset that already links cooking to burns etc.