12 点作者 sonabinu4 个月前

3 条评论

> do you think this kind of RL is enough to generalize beyond math and code? as in generalizing into domains which aren't easily verifiable<p>> 1 Exactly the right question to be asking atm imo. 2 Not obvious. 3 Probably yes.<p>But both math and code are easy to verify, they are rigorous. There are many other tasks are not. I doubt what works for math and code and be generalized to other things.

johnneville4 个月前

<a href="https://xcancel.com/karpathy/status/1883941452738355376" rel="nofollow">https://xcancel.com/karpathy/status/1883941452738355376</a>

mainecoder4 个月前

Deepseek kills ClosedAI now the their stocks are trash, finally the bubble is popping yes hahahahah

Karpathy on Deepseek R1

3 条评论

Karpathy on Deepseek R1

3 条评论