TechEcho

yidalmost 12 years ago

I can see two problems with this:<p>-- A naive linear scan for a lookup will not scale as the size of your database grows larger. You should be looking into space-partitioning trees, or approximate methods like locality-sensitive hashing.<p>-- Euclidean distance is a terrible metric for kNN on non-metric spaces, which is what your movie example is. It will also be beaten to a pulp by the Curse of Dimensionality: <a href="http://en.wikipedia.org/wiki/Curse_of_dimensionality#Distance_functions" rel="nofollow">http://en.wikipedia.org/wiki/Curse_of_dimensionality#Distanc...</a>

评论 #6054428 未加载

mck-almost 12 years ago

Alike is a versatile light-weight kNN/similarity library that can be useful for many Machine Learning projects. Whether you are building a recommendation system, or an optimization model, comparing objects is pervasive -- feedback welcome!

flockonusalmost 12 years ago

I've been looking for this!

评论 #6053794 未加载

Alike: light kNN library for Node

3 comments

Alike: light kNN library for Node

3 comments