TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Kimi K1.5: Scaling Reinforcement Learning with LLMs

203 pointsby noch4 months ago

5 comments

NitpickLawyer4 months ago
Really unfortunate timing with Deepseek-R1 and the distills coming out at basically the same time. Hard for people to pay attention to, and plus open source > API, even if the results are a bit lower.
评论 #42784666 未加载
zurfer4 months ago
Is it fair to say that 2 of the 3 leading models are from Chinese labs? It's really incredible how fast China has caught up.
评论 #42780213 未加载
asah4 months ago
The set of math&#x2F;logic problems behind AIME 2024 appears to be... <a href="https:&#x2F;&#x2F;artofproblemsolving.com&#x2F;wiki&#x2F;index.php&#x2F;2024_AIME_I_Problems" rel="nofollow">https:&#x2F;&#x2F;artofproblemsolving.com&#x2F;wiki&#x2F;index.php&#x2F;2024_AIME_I_P...</a><p>Impressive stuff! But unclear to me if it&#x27;s literally just these 15 or if there&#x27;s a large problem set...
评论 #42786210 未加载
评论 #42781023 未加载
joaohkfaria4 months ago
But wait, which LLM models were used to train Kimi? It wasn&#x27;t clear on the report.
评论 #42799049 未加载
cuuupid4 months ago
I really, really dislike when companies use GitHub to promote their product by posting a &quot;research paper&quot; and a code sample.<p>It&#x27;s not even an SDK, library, etc., it&#x27;s just advertising.<p>I&#x27;ve noticed a number of China-based labs do this; they will often post a really cool demo, some images, and then either an API or just nothing except advertising for their company (e.g. model may not even exist). Often they will also promise in some GitHub issue that they will release the weights, and never do.<p>I&#x27;d love to see some sort of study here, I wonder what % of &quot;omg really cool AI model!!!&quot; hype papers [1] never provide an API, [2] cannot be reproduced at all, and&#x2F;or [3] promise but never provide weights. If this was any other field, academics would be up in arms about likely fraud, false advertising, etc.
评论 #42778378 未加载
评论 #42781475 未加载
评论 #42780848 未加载
评论 #42782638 未加载
评论 #42781038 未加载
评论 #42778390 未加载
评论 #42784478 未加载