TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Karpathy on Deepseek R1

12 点作者 sonabinu4 个月前

3 条评论

billconan4 个月前
&gt; do you think this kind of RL is enough to generalize beyond math and code? as in generalizing into domains which aren&#x27;t easily verifiable<p>&gt; 1 Exactly the right question to be asking atm imo. 2 Not obvious. 3 Probably yes.<p>But both math and code are easy to verify, they are rigorous. There are many other tasks are not. I doubt what works for math and code and be generalized to other things.
johnneville4 个月前
<a href="https:&#x2F;&#x2F;xcancel.com&#x2F;karpathy&#x2F;status&#x2F;1883941452738355376" rel="nofollow">https:&#x2F;&#x2F;xcancel.com&#x2F;karpathy&#x2F;status&#x2F;1883941452738355376</a>
mainecoder4 个月前
Deepseek kills ClosedAI now the their stocks are trash, finally the bubble is popping yes hahahahah