TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Building the Next New York Times Recommendation Engine

129 pointsby jprobalmost 10 years ago

6 comments

flashmanalmost 10 years ago
I&#x27;m really impressed that NYT took the time to document this. It&#x27;s always interesting to see the different recommendation models evaluated and applied to real-world situations.<p>I&#x27;ve been pursuing a collaborative filtering approach to product recommendation lately (&#x27;people who bought this also bought that&#x27;), but perhaps LDA would let me model our products based on their metadata (&#x27;people who bought products broadly like this also bought products broadly like that&#x27;).
muktabhalmost 10 years ago
We make a contextual recommendation engine as a service for online publishers at our startup ParallelDots. We discovered the problem of tags not really working well for recommendations on our clients websites too. We ended up using unsupervised word embeddings and auto encoders on top of them to solve the problem. We dont still use it for personalization though, just contextually similar articles. Great seeing some of similar problems being solved at New York Times too. :)
ThomPetealmost 10 years ago
The way I see it, the primary thing to solve for any recommendation engine is to optimize for serendipity. I.e. allowing you to get information you didn&#x27;t know you wanted.<p>This means basically also finding ex. articles that are not written by NYT.<p>Newspapers problem is that their primarily omnibus approach to whats relevant isn&#x27;t really doing the waste amount of insightful information available that exist out there.<p>So the whole issue IMO with all newspapers&#x2F;media these days. They are building silos where none should really exist and this is one of the primary the reason why people don&#x27;t consider it valuable anymore.
评论 #10045127 未加载
评论 #10045069 未加载
评论 #10045597 未加载
bcainealmost 10 years ago
Fun read. Topic modeling can be fascinating to work with.<p>Curious how they measured performance of their model, and whether they found a &quot;best&quot; number of topics for LDA where their model stopped getting much benefit by having more topics.<p>I&#x27;d imagine increased number of topics would have some interesting side effects where it would create too narrow of recommendations.
doppenhealmost 10 years ago
We have built this and anybody can use it <a href="https:&#x2F;&#x2F;Algorithmia.com&#x2F;recommends" rel="nofollow">https:&#x2F;&#x2F;Algorithmia.com&#x2F;recommends</a>. 2 lines of js to implement.Currently serving the geekwire.com recs. You can also modify it further (see blog.Algorithmia.com).<p>The article is awesome though good on NYT.
评论 #10046431 未加载
ersiialmost 10 years ago
I think it&#x27;d be great if you&#x27;d have this kind of information in your help section later on, for anxious people like me who are very wary of even having a recommendation engine at a news paper. I was actually on my way to sign up for a subscription after reading &quot;A Renegade Trawler, Hunted for 10,000 Miles by Vigilantes&quot; by Ian Urbina - but held back for the moment to give it more thought.<p>That said, I guess I could see a point in it maybe retaining users &#x2F; subscribers if it&#x27;s good enough. (I&#x27;d still appreciated it a lot more if this functionality could be turned off for users who request it though).
评论 #10044437 未加载