TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: Did bandits go out of fashion?

12 点作者 dira33 个月前
About 10 years ago, multi-armed bandits (and especially contextual bandits) were really important for optimization and business use cases.<p>But nowadays you hardly hear about it, despite optimization continuing to be a need. The only major open source library continues to be Vowpal Wabbit - and even that is not very robustly maintained or documented.<p>What are people doing for bandits or optimization needs nowadays? And what are some active, Python-centric, open-source libraries with a strong user community and stable code?

4 条评论

softwaredoug3 个月前
I think what happens is traditional, supervised ML kind of comes through and steamrolls everything due to the proliferation of tools to operationalize and train models. There&#x27;s just inertia behind one family of ML that other solutions end up with much more bespoke solutions. Mindshare builds momentum into more mindshare...
hruk3 个月前
We&#x27;ve used this library for Bayesian contextual bandits in production (we have a critical business use case supported by a ~200K feature sparse Linear UCB bandit). It&#x27;s a small community, but it&#x27;s also a small enough codebase that we&#x27;ve read through all of it and feel fine about maintaining it ourselves in case it goes inactive.<p><a href="https:&#x2F;&#x2F;github.com&#x2F;bayesianbandits&#x2F;bayesianbandits">https:&#x2F;&#x2F;github.com&#x2F;bayesianbandits&#x2F;bayesianbandits</a>
arromatic3 个月前
I have been looking for resources about contextual bandits too . But i failed finding anything good . Closest thing i found was Vowpal wabbit , same as you and same problem - unmaintained . Tried searching hn about bandits but didn&#x27;t find anything useful either .<p>I was looking for algorithms that can find new interest of users . I felt like after all these research all i learned is the ancient technique of showing x percent of random items to users .
PaulHoule3 个月前
Personally I have an RSS reader that does content based recommendation, my personal evaluation is that it is `great` so I am in no hurry to improve it. So far as bandits go I blend in 10% random items to help it learn and calibrate, I am thinking of raising that to 20% but I have no objective criteria to decide what that fraction should be.