TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

39 点作者 roboboffin4 个月前

5 条评论

s-macke4 个月前
&gt; Notably, no self-reflection training data or prompt was included, suggesting that advanced System 2 reasoning can foster intrinsic self-reflection.<p>They suggest, that self-reflection is an emergent phenomena of reasoning. Impressive. Can&#x27;t wait to see the code.
throwaway815234 个月前
Abstract is impressive. I&#x27;m surprised this post hasn&#x27;t gotten more attention.
评论 #42653983 未加载
helltone4 个月前
Off topic but how is MCTS usually implemented efficiently? It has a branching structure that doesn&#x27;t seem parallelizable (GPU).
fabmilo4 个月前
I was just about to submit this link and redirected me to this page. I am shocked that it received only four comments. If you are working in the LLMs&#x2F;Agent space ( you are, right?) and you don&#x27;t understand the significance of this paper, you are set for failure.
dantodor4 个月前
The repo gives 404?
评论 #42654473 未加载