DeepSeek Open Infra: Open-Sourcing 5 AI Repos in 5 Days

757 pointsby ahsmha_3 months ago

40 comments

infecto3 months ago

Am I the only one excited for the release but not overanalyzing their words? This thread feels full of personal interpretations. DeepSeek is still a business—great release, but expectations and motivations seem inflated.

评论 #43129444 未加载

评论 #43134625 未加载

thundergolfer3 months ago

“Pure garage-energy” is a great phrase.Most interested to see their inference stack, hope that’s one of the 5. I think most people are running R1 on a single H200 node but Deepseek had much lower RAM per GPU for their inference and so had some cluster based MoE deployment.

评论 #43124794 未加载

评论 #43124624 未加载

评论 #43124808 未加载

评论 #43124787 未加载

oefrha3 months ago

> Starting next week, we'll open-source 5 repos – one daily dropProbably counts as announcement of announcement? Let’s wait for the actual repo drops before discussing them, especially because there are no details about what will be open sourced other than> These are humble building blocks of our online service: documented, deployed and battle-tested in production.

评论 #43125553 未加载

评论 #43129616 未加载

评论 #43124882 未加载

评论 #43124679 未加载

ipsum23 months ago

This is more exciting to me than OpenAI's 12 days of Christmas

评论 #43124411 未加载

评论 #43127078 未加载

vinhnx3 months ago

Deep respect for DeepSeek and what they've done regarding all the innovations and researches they have been putting out in-the-open."Because every line shared becomes collective momentum that accelerates the journey. Daily unlocks begin soon. No ivory towers - just pure garage-energy and community-driven innovation" is a great phase.

评论 #43124705 未加载

wg03 months ago

In fact they are totally dismantling OpenAI. Most likely, without any intention on their part.LLMs have been more legitimate "blockchain" when most CIO magazines had these essays with "What's your blockchain strategy?" kind of stuffed material.AI bubble will burst and will burst hard. By end of 2026 at max.

评论 #43127103 未加载

评论 #43127517 未加载

评论 #43126870 未加载

评论 #43126098 未加载

antupis3 months ago

Kinda interesting to see where the moat is in AI space. Good base models can always distilled when you have access to API. System prompts can get leaked, and UI tricks can be copied. In the end, the moat might be in the hardware and vertical integration.

评论 #43124971 未加载

评论 #43124528 未加载

评论 #43125715 未加载

评论 #43125543 未加载

评论 #43125090 未加载

评论 #43124650 未加载

评论 #43124589 未加载

codelion3 months ago

This is great to see! Open-sourcing infrastructure tools can really accelerate innovation in the AI space. I've found that having access to well-documented repos makes it much easier to experiment and build on existing work. Are there any specific areas these repos focus on, like distributed training or model serving?

andy_ppp3 months ago

How do the valuations of foundation model companies compete with them being firmly open sourced by Facebook and DeepSeek? It seems likely that building these models will not produce hundreds of billions in value given China and Facebook are giving them away largely for free.

评论 #43125136 未加载

评论 #43125601 未加载

评论 #43126570 未加载

ryao3 months ago

Could DeepSeek and OpenAI swap names?

评论 #43127564 未加载

评论 #43127391 未加载

mythz3 months ago

Looking forward to it! I'll generally make an effort to use Open Models over proprietary alternatives when the use-case permits as Open Models getting better and more popular encourages more models to become open as well - a requisite for a future to be able to build self-hosted solutions that's not beholden to the control of mega corps and AI monopolies.

评论 #43124687 未加载

voxelizer3 months ago

I wonder if they are just shorting Nvidia...

评论 #43124555 未加载

评论 #43126137 未加载

评论 #43125697 未加载

macns3 months ago

> Why? Because every line shared becomes collective momentum that accelerates the journey.Truly admireable on their part and a great paradigm for others. Reasons for this doesn't really matter to me but I can't help but wonder if somehow they were obliged or otherwise indebted to follow this route.

mindwok3 months ago

This team is truly something special.

RandyOrion3 months ago

Well, although R1-671b is way too expensive for me to self-host, given their past open source (or weight) contributions, I DO have high expectation of them.Each and every contribution to open source community will be helpful. Thanks DeepSeek!

评论 #43125374 未加载

photochemsyn3 months ago

Remember when OpenAI was doing this:"OpenAI threatens to revoke o1 access for asking it about its chain of thought"<a href="https://news.ycombinator.com/item?id=41534474">https://news.ycombinator.com/item?id=41534474</a>Not only did DeepSeek opensource their model, they also showed the user chain-of-thought right up front, which everyone else rushed to emulate when they saw how much users liked it.

csomar3 months ago

> These are humble building blocks of our online service: documented, deployed and battle-tested in production. No vaporware, just code that moved our tiny moonshot forward.My not-so-innocent guess is that they are looking to crowd-source their online platform (the front-end essentially) in order to reduce costs. Still acceptable though as they made the model open weight and partially re-producible.

评论 #43125300 未加载

评论 #43134826 未加载

suraci3 months ago

I always consider open-sourcing to be a great social experiment. It may fail one day, but its effects will remain and benefit everyone.

t24uo2i34j324l3 months ago

Deepseek seems to be having huge PR wins as the "oh shucks" modest boy genius, while the Americans seem like pouty jerks.Amodei's / Hassabis' comments in particular came off as so arrogant and annoying.

评论 #43124966 未加载

ein0p3 months ago

Beatings will continue until openness improves, apparently. Kudos to Deepseek, about time someone spilled some significant beans.

Mr_Bees693 months ago

R1 is a better o1, this is a better devdays.

abdellah1233 months ago

DeepSeek seems like Hisoka helping Gon and Killua ... just for a more challenging battle at some point xD

评论 #43125270 未加载

jacktang3 months ago

The funding company holds the assets, and the news make the stock market blooming and they make money!

tw19843 months ago

I really hope DeepSeek is going to open source their entire training pipeline.

quantum_state3 months ago

God bless the DeepSeek team with more innovative ideas to share with us all!

sgt3 months ago

Speaking of DeepSeek, anyone here used SambaNova - are they reliable?

rvz3 months ago

I really like this definition of "AGI": When everyone (yes everyone) benefits from very powerful AI models released for free and it is not gate-kept by one company and it costs $0 to use commercially or for research and you can do whatever you want with it.Unlike the other counterpart which believes that "AGI" means: "raising billions of dollars to achieve $100BN of profits to their investors". (Which is complete nonsense).While not totally "open source" by the strictest definition, it is at least better than having no model released with no mention of the architecture on the system card or paper and just vague comments about the 'performance'.Ladies and gentlemen, this is closer towards being an better "Open AI". Unlike the other alleged $157BN "non-profit" scam.I think you know which one really is beneficial to humanity and is the real "Open AI".

评论 #43124660 未加载

buyucu3 months ago

deepseek just keeps on giving. kudos to them.i can almost hear sam altman and dario amodei cry every time deepseek does something amazing.

jokoon3 months ago

duckduckgo also have one, so not sure if this makes a difference

tuyguntn3 months ago

DeepSeek is seeking deep to Open AI.irony

bigcat123456783 months ago

No turning back...

tmaly3 months ago

Is it out of the realm of possibility to look at this move as a way to take down the moat of closed source AI companies?I mean strategically this could be the first use of open source in this way.

swyx3 months ago

odds on r1.5/r2 release?

Havoc3 months ago

Tbh this just feels like the same playbook as OAI. Open start and then less so over time.Mistral has been holding the line on that topic remarkable well.

finnjohnsen23 months ago

Looking forward to it

fmerian3 months ago

launch weeks ftw

deyiao3 months ago

I really admire their mindset of striving for the betterment of humanity. There was a time when OpenAI, Anthropic, and even Musk used to talk with that same lofty vision. But now, they've all shifted to competing for national interests instead, which is honestly quite disappointing.

评论 #43124656 未加载

评论 #43124866 未加载

评论 #43124677 未加载

评论 #43124685 未加载

评论 #43127386 未加载

评论 #43124799 未加载

评论 #43126202 未加载

评论 #43125608 未加载

评论 #43126175 未加载

评论 #43126250 未加载

评论 #43126863 未加载

评论 #43125072 未加载

评论 #43125618 未加载

评论 #43126546 未加载

评论 #43127207 未加载

评论 #43125494 未加载

sebastianconcpt3 months ago

Wasn't it caught already sending data to China in a sneaky way? Why using it for anything?

评论 #43129331 未加载

评论 #43126990 未加载

sidcool3 months ago

This may be my cynical take, but this cannot be out of good will or noble intentions. There has to be an ulterior motive.

评论 #43125280 未加载

评论 #43130484 未加载

dhdjruf3 months ago

Long live llms I hope they infest every part of the internet with low level comments. Both the clear , deep, and dark.Imagine no more human interactions just a permanent flood of meaningless thoughtless word salad.I think the Chinese are perfect to introduce such a product very inline with what they usually produce.Get ready for web3.o

评论 #43124992 未加载