TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

DeepSeek: Inference-Time Scaling for Generalist Reward Modeling

163 pointsby tim_swabout 2 months ago

4 comments

ALLTakenabout 2 months ago
Not jus being impressed that every paper coming out is SOTA, but also leads the way in being Open-Source in the pure definition of OSS, even with permissible licensing.<p>Let&#x27;s not confuse the company with the country by over-fitting a narrative. Popular media is reenforcing hatred or anything that sponsors them, especially to weaker groups. Less repercussions and more clicks&#x2F;money to be made I guess.<p>While Politicians may hate each other, Scientists love to work with other aspiring Scientists who have similar ambitions and the only competition is in achieving measurable success and the reward it means to the greater public.<p>Without any bias, but it&#x27;s genuinely admirable when companies release their sources to enable faster scientific progress cycles. It&#x27;s ironic that this company is dedicated to finance, yet shares their progress, while non-profits and companies dedicated purely to AI are locking all knowledge about their findings from access.<p>Are there other companies like DeepSeek that you know of that commonly release great papers? I am following Mistral already, but I&#x27;d love to enrich my sources of publications that I consume. Highly appreciated!
评论 #43586029 未加载
评论 #43586147 未加载
评论 #43586950 未加载
restersabout 2 months ago
DeepSeek R1 is by far the best at writing prose of any model, including Grok-3, GPT-4o, o1-pro, o3, claude, etc.<p>Paste in a snippet from a book and ask the model to continue the story in the style of the snippet. It&#x27;s surprising how bad most of the models are.<p>Grok-3 comes in a close second, likely because it is actually DeepSeek R1 with a few mods behind the scenes.
评论 #43586072 未加载
评论 #43590258 未加载
mentalgearabout 2 months ago
Happy to see deekseek using the correct (and much more idiomatic) term &quot;inference-time scaling&quot;, instead of the grotesque construction of &quot;test-time compute&quot; that openAI came up with.
bilsbieabout 2 months ago
Any idea why I lost interest in deep seek? I used it and grok3 a whole bunch when they first came out but now I’ve fallen back to Claude for everything.
评论 #43588946 未加载
评论 #43605087 未加载
评论 #43607409 未加载
评论 #43589262 未加载