科技回声

6 条评论

rm999将近 9 年前

I don't get the innovation in this paper - are they just running word2vec on groups of items? If so, Spotify has been doing this on playlists for years now: <a href="https://erikbern.com/2013/11/02/model-benchmarks/" rel="nofollow">https://erikbern.com/2013/11/02/model-benchmarks/</a>Also, I know the paper isn't claiming state-of-the-art, but their SVD results are horrendous. Standard CF would create much better artist-artist pairings with even a medium sized dataset.As an aside, I've run some quantitative and qualitative tests and have found the best recommendations come from a combination of user-item and item-item. I co-gave a talk at the NYC machine learning meetup recently (<a href="https://docs.google.com/presentation/d/1S5Cizi9LFQ7l0bMYtY7gASvOPqxNsQk0-NuP5KWAl-4/pub?start=false&loop=false&delayms=3000&slide=id.p4" rel="nofollow">https://docs.google.com/presentation/d/1S5Cizi9LFQ7l0bMYtY7g...</a>) that shows how this can work, starting at slide 20. The idea is to create a candidate list of matches using item-item, and then reorder using item-user. I've found this creates "sensible" suggestions using item-item, but truly personalizes when re-ordering. You can remove obvious recommendations by removing popular matches or matches the user has already interacted with (I consider this a business decision rather than something inherent in the algorithm).

评论 #12070049 未加载

评论 #12068746 未加载

评论 #12070822 未加载

评论 #12092521 未加载

评论 #12070348 未加载

评论 #12068987 未加载

praccu将近 9 年前

Fascinating.The qualitative comparison suggests that the item2vec may produce _more_ homogenous / boring results, which is kinda unfortunate; the interesting question in recommendations is how to find "aspirational" recommendations (things the shopper would not have looked for on their own).I would really love to see an analysis that did an A/B test using more traditional CF and this, and see what the revenue lift was, because "accuracy" as measured here doesn't necessarily map onto the objective that you care about in the real world.On the other hand, I played with using collaborative filtering to improve the personalization of language models for speech recognition for shopping, and in that context this approach sounds like it might have been super useful, because it was actually fairly challenging to get broad enough coverage of the full set of items from a small number of purchases for the purposes of language modeling. Having good embeddings would have helped a lot.

评论 #12068518 未加载

评论 #12069212 未加载

评论 #12068484 未加载

apstls将近 9 年前

I wonder if the item vectors capture semantics and behave in a way analogous to word vectors. So, for example, would a PS4 - a PS4 controller = an XBox - an XBox controller, the same way France - Paris = Greece - Athens? Something along these lines could maybe be used as a way to find relevant addons/upsells to show on the checkout page.

评论 #12069836 未加载

评论 #12078183 未加载

评论 #12078166 未加载

评论 #12069839 未加载

olh将近 9 年前

Does anyone know good resources/research about generating latent vector representations with iterative processes using numerical analysis algorithms and not neural networks?The black-box effect on word2vec and similars puts back some applications like generalizing linguistics methods to bioinformatics.

评论 #12068289 未加载

评论 #12068948 未加载

评论 #12071737 未加载

评论 #12070532 未加载

评论 #12071230 未加载

galaxy911将近 9 年前

This is a great model. I applied it to online retailer data and movies and it works amazingly well! much better than SVD++ or SVD. I have found it to perform very well on items with low usage too. I took the authors advice to change the window size dynamically according to the set size.

karmacondon将近 9 年前

Github! This should be on github

评论 #12068631 未加载

评论 #12068070 未加载

6 条评论

rm999将近 9 年前

评论 #12070049 未加载

评论 #12068746 未加载

评论 #12070822 未加载

评论 #12092521 未加载

评论 #12070348 未加载

评论 #12068987 未加载

praccu将近 9 年前

评论 #12068518 未加载

评论 #12069212 未加载

评论 #12068484 未加载

apstls将近 9 年前

评论 #12069836 未加载

评论 #12078183 未加载

评论 #12078166 未加载

评论 #12069839 未加载

olh将近 9 年前

评论 #12068289 未加载

评论 #12068948 未加载

评论 #12071737 未加载

评论 #12070532 未加载

评论 #12071230 未加载

galaxy911将近 9 年前

karmacondon将近 9 年前

Github! This should be on github

评论 #12068631 未加载

评论 #12068070 未加载

Item2Vec: Neural Item Embedding for Collaborative Filtering

6 条评论

Item2Vec: Neural Item Embedding for Collaborative Filtering

6 条评论