TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Ask HN: Is there a primer on RL applied to LLMs?

2 pointsby eamag3 months ago
Want to read more on how exactly new thinking models are trained and if some old RL techniques are now applied again to LLMs

2 comments

Philpax3 months ago
<a href="https:&#x2F;&#x2F;www.interconnects.ai&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.interconnects.ai&#x2F;</a> has great writing on this; the author is currently working on <a href="https:&#x2F;&#x2F;rlhfbook.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;rlhfbook.com&#x2F;</a>.
billconan3 months ago
<a href="https:&#x2F;&#x2F;arxiv.org&#x2F;abs&#x2F;2404.00282" rel="nofollow">https:&#x2F;&#x2F;arxiv.org&#x2F;abs&#x2F;2404.00282</a>
评论 #42912155 未加载
评论 #42911750 未加载