科技回声

6 条评论

Interesting article! Shopify's approach is cool, it's interesting they're using Kafka to generate datasets. I wonder if the explicit human rankings will get stale (and also be hugely outweighted by implicit judgements in the training data). The real-time feedback aspect sounds cool, I wonder if it's just for metrics or also for re-training in real-time.I worked on a Learning To Rank implementation a year or so ago. What struck me then (and now reading about Shopify's implementation) is that the approach is often very similar across sites, but the implementation is usually rather tailored. You see the same patterns: online/offline metrics; nDCG; click models and implicit/explicit relevance judgements; re-ranking top-k of results, and so on.Unfortunately there doesn't seem to be a technology tying all of the components of an LtR system together. A managed service like Algolia could be an answer. I wonder if industry will eventually converge on a framework, such as an extension to Open Source Connection's Elasticsearch Learning to Rank plugin (<a href="https://diff.wikimedia.org/2017/10/17/elasticsearch-learning-to-rank-plugin/" rel="nofollow">https://diff.wikimedia.org/2017/10/17/elasticsearch-learning...</a>).It's a really interesting area of theory and practice - I hope Shopify write more about their implementation!I'd also recommend reading Airbnb's really excellent paper - <a href="https://arxiv.org/pdf/1810.09591.pdf" rel="nofollow">https://arxiv.org/pdf/1810.09591.pdf</a>.

评论 #26667678 未加载

NumberCruncher大约 4 年前

I should re-read the article because I can't see what kind of problem they try to solve with MAP, NDCG and "invented here" Pagerank what couldn't be solved with tf-idf and out-of-the box Elasticsearch functionality. It's a highly underrated peace of software.

评论 #26667544 未加载

评论 #26673804 未加载

评论 #26671923 未加载

LZ_Khan大约 4 年前

Where do the relevance scores come from? Are they human rated? I feel like that could leave room for error as raters would probably not have the same opinion as me on what a good document is.

评论 #26668934 未加载

ntonozzi大约 4 年前

Great article!This seems like a fairly tricky ranking function. I wonder if they compared it to combining TF-IDF and the page popularity. This would help with the problem they explained.It'd be interesting to see more details about how they implemented the query-specific page rank.

评论 #26667550 未加载

lernerzhang大约 4 年前

I wonder how they decide how many cases to manually label?

colesantiago大约 4 年前

Not sure why they didn't just go with Elasticsearch?

评论 #26670877 未加载

6 条评论

BillFranklin大约 4 年前

评论 #26667678 未加载

NumberCruncher大约 4 年前

评论 #26667544 未加载

评论 #26673804 未加载

评论 #26671923 未加载

LZ_Khan大约 4 年前

Where do the relevance scores come from? Are they human rated? I feel like that could leave room for error as raters would probably not have the same opinion as me on what a good document is.

评论 #26668934 未加载

ntonozzi大约 4 年前

评论 #26667550 未加载

lernerzhang大约 4 年前

I wonder how they decide how many cases to manually label?

colesantiago大约 4 年前

Not sure why they didn't just go with Elasticsearch?

评论 #26670877 未加载

Evaluating Search Algorithms

6 条评论

Evaluating Search Algorithms

6 条评论