科技回声

18 条评论

LZ_Khan4 个月前

Note: This is not an actual model but rather an announcement of an effort to reproduce the R1 model.

评论 #42851093 未加载

评论 #42851197 未加载

ipnon4 个月前

Is this what the Web was like in the beginning? Something exciting and fascinating every week?

评论 #42850531 未加载

评论 #42851622 未加载

评论 #42850622 未加载

评论 #42849974 未加载

评论 #42850362 未加载

评论 #42850255 未加载

评论 #42850065 未加载

评论 #42850579 未加载

评论 #42851498 未加载

评论 #42852395 未加载

评论 #42850171 未加载

评论 #42849883 未加载

评论 #42850014 未加载

评论 #42850768 未加载

评论 #42852004 未加载

评论 #42850059 未加载

评论 #42850809 未加载

nutanc4 个月前

How can we help. Can crowd sourcing help? Is there any list of tasks that we want a crowd to do? The reason I am asking is because we have done a couple of crowdsourcing efforts and collected story data in Telugu(Chandamama Kathalu) and ASR speech data using college going students. Since we have access to the students, we can mobilize them and get this going. We will also be doing an internship program for 100,000 students in Telangana as part of Viswam[1] in April. Can include some work as part of this effort.[1] <a href="https://viswam.ai/" rel="nofollow">https://viswam.ai/</a>

评论 #42851443 未加载

评论 #42851751 未加载

breadwinner4 个月前

From the article: they didn’t release everything—although the model weights are open, the datasets and code used to train the model are not.Is that true about Meta Llama as well? Specifically, the code used to train the model is not open? (I know no one releases datasets). If so the label "open source" is inappropriate. "Open weights" would be more appropriate.

sunshine-o4 个月前

Now that things are really getting wild in the LLM space and people are just running anything that come it seems I did a quick search on the thead model of hosting you own LLM.I didn't find much, starting with llama.ccp which is just reminding you to sandbox and isolate everything if running untrusted models.I feel we are back in the Windows 95 / early Internet era when people would just run anything without caring about security.

评论 #42851066 未加载

评论 #42852742 未加载

评论 #42851065 未加载

评论 #42851613 未加载

fblp4 个月前

Given DeepSeek's open philosophy I wonder what their response is to simply being asked for access to the code and data that this project intends to recreate?

评论 #42850134 未加载

评论 #42850187 未加载

评论 #42850049 未加载

drakenot4 个月前

What are some other domains outside of Math and Coding that would be suitable for RL with automated verification?

评论 #42849881 未加载

评论 #42850093 未加载

评论 #42850267 未加载

评论 #42850682 未加载

评论 #42849918 未加载

评论 #42850995 未加载

评论 #42851030 未加载

zoobab4 个月前

For "open source", we will wait that Debian ships them to have the guarantee it's actually "open" and with "sources". Right now it's a mystery how they produce their binaries.

评论 #42851895 未加载

DeflectedFlux4 个月前

About the training data, cant the datasets from the Tulu3 Model by the Allen Institute be used? They claim that they have used a fully open source training dataset.

评论 #42851913 未加载

htrp4 个月前

The hf team tweeted they'd be doing this over the weekend. I guess now it's an official project with headcount

freddealmeida4 个月前

how is this open vs whatdeepseek did?

评论 #42849710 未加载

评论 #42849783 未加载

cadamsdotcom4 个月前

Exciting to see this being reproduced, loving the hyper-fast movement in open source!This is exactly why it is not “US vs China”, the battle is between heavily-capitalized Silicon Valley companies versus open source.Every believer in this tech owes DeepSeek some gratitude, but even they stand on shoulders of giants in the form of everyone else who pushed the frontier forward and chose to publish, rather than exploit, what they learned.

评论 #42849859 未加载

评论 #42849907 未加载

评论 #42850163 未加载

评论 #42850099 未加载

评论 #42849842 未加载

评论 #42849948 未加载

评论 #42850261 未加载

vinni24 个月前

I wonder how long it takes to reproduce it and would having access to latest GPUs speed it up?

amelius4 个月前

Are there any other groups trying this?

Babawomba4 个月前

super cool to see an open initiative like this—love the idea of replicating DeepSeek-R1 in a transparent way.I do like the idea of making these reasoning techniques accessible to everyone. If they really manage to replicate the results of DeepSeek-R1, especially on a smaller budget, that’s a huge win for open-source AI.I’m all for projects that push innovation and share the process with others, even if it’s messy.But yeah—lots of hurdles. They might hit a wall because they don’t have DeepSeek’s original datasets.

fl4tul44 个月前

Is OpenAI open as of yesterday?

readthenotes14 个月前

Stopped reading when it said DeepSeek brokevtge stock market

vinni24 个月前

Where is the evaluation numbers? without it you can’t call it reproduction.

评论 #42850674 未加载

18 条评论

LZ_Khan4 个月前

Note: This is not an actual model but rather an announcement of an effort to reproduce the R1 model.

评论 #42851093 未加载

评论 #42851197 未加载

ipnon4 个月前

Is this what the Web was like in the beginning? Something exciting and fascinating every week?

评论 #42850531 未加载

评论 #42851622 未加载

评论 #42850622 未加载

评论 #42849974 未加载

评论 #42850362 未加载

评论 #42850255 未加载

评论 #42850065 未加载

评论 #42850579 未加载

评论 #42851498 未加载

评论 #42852395 未加载

评论 #42850171 未加载

评论 #42849883 未加载

评论 #42850014 未加载

评论 #42850768 未加载

评论 #42852004 未加载

评论 #42850059 未加载

评论 #42850809 未加载

nutanc4 个月前

评论 #42851443 未加载

评论 #42851751 未加载

breadwinner4 个月前

sunshine-o4 个月前

评论 #42851066 未加载

评论 #42852742 未加载

评论 #42851065 未加载

评论 #42851613 未加载

fblp4 个月前

Given DeepSeek's open philosophy I wonder what their response is to simply being asked for access to the code and data that this project intends to recreate?

评论 #42850134 未加载

评论 #42850187 未加载

评论 #42850049 未加载

drakenot4 个月前

What are some other domains outside of Math and Coding that would be suitable for RL with automated verification?

评论 #42849881 未加载

评论 #42850093 未加载

评论 #42850267 未加载

评论 #42850682 未加载

评论 #42849918 未加载

评论 #42850995 未加载

评论 #42851030 未加载

zoobab4 个月前

For "open source", we will wait that Debian ships them to have the guarantee it's actually "open" and with "sources". Right now it's a mystery how they produce their binaries.

评论 #42851895 未加载

DeflectedFlux4 个月前

About the training data, cant the datasets from the Tulu3 Model by the Allen Institute be used? They claim that they have used a fully open source training dataset.

评论 #42851913 未加载

htrp4 个月前

The hf team tweeted they'd be doing this over the weekend. I guess now it's an official project with headcount

freddealmeida4 个月前

how is this open vs whatdeepseek did?

评论 #42849710 未加载

评论 #42849783 未加载

cadamsdotcom4 个月前

评论 #42849859 未加载

评论 #42849907 未加载

评论 #42850163 未加载

评论 #42850099 未加载

评论 #42849842 未加载

评论 #42849948 未加载

评论 #42850261 未加载

vinni24 个月前

I wonder how long it takes to reproduce it and would having access to latest GPUs speed it up?

amelius4 个月前

Are there any other groups trying this?

Babawomba4 个月前

fl4tul44 个月前

Is OpenAI open as of yesterday?

readthenotes14 个月前

Stopped reading when it said DeepSeek brokevtge stock market

vinni24 个月前

Where is the evaluation numbers? without it you can’t call it reproduction.

评论 #42850674 未加载

Open-R1: an open reproduction of DeepSeek-R1

18 条评论

Open-R1: an open reproduction of DeepSeek-R1

18 条评论