科技回声

7 条评论

Deepseek is the real "open<something>" that the world needed. Via these three projects, Deepseek has addressed not only efficient AI but also distributed computing:1. smallpond: <a href="https://github.com/deepseek-ai/smallpond">https://github.com/deepseek-ai/smallpond</a>2. 3fs: <a href="https://github.com/deepseek-ai/3FS">https://github.com/deepseek-ai/3FS</a>3. deepep: <a href="https://github.com/deepseek-ai/DeepEP">https://github.com/deepseek-ai/DeepEP</a>

评论 #43254634 未加载

评论 #43254657 未加载

jakozaur2 个月前

It was already on HN recently:<a href="https://news.ycombinator.com/item?id=43200793">https://news.ycombinator.com/item?id=43200793</a><a href="https://news.ycombinator.com/item?id=43232410">https://news.ycombinator.com/item?id=43232410</a>

ogarten2 个月前

Looks like we are approaching the "distributed" phase of the distributed-centralized computing cycle :)Not saying this is bad, but it's just interesting to see after being in the industry for 8 years.

评论 #43251606 未加载

nemo44x2 个月前

Isn’t the whole point of DuckDB is that it’s not distributed?

评论 #43259743 未加载

评论 #43270897 未加载

评论 #43259637 未加载

benrutter2 个月前

I'm not massively knowledgable about the ins and outs of DeepSeek, but I think I'm in the right place to ask. My understanding is DeepSeek:- Created comparable LLM performance for a fraction of the cost of OpenAI using more off-the-shelf hardware.- Seem to be open sourcing lots of distributed stuff.My question is, are those two things related? Did distributed computing allow the AI model somehow? If so how? Or is it not that simple?

评论 #43259431 未加载

maknee2 个月前

Does anyone have blogs with benchmarks to show the performance of running smallpond let alone 3fs + smallpond?A lot of blogs praise these new systems, but don't really provide any numbers :/

cmollis2 个月前

spark is getting a bit long in the tooth.. interesting to see duckdb integrated with Ray for data-access partitioning across (currently) 3FS. probably a matter of time before they (or someone) supports S3. It should be noted that duckdb (standalone) actually does a pretty good job scanning s3 parquet on its own.

7 条评论

OutOfHere2 个月前

评论 #43254634 未加载

评论 #43254657 未加载

jakozaur2 个月前

ogarten2 个月前

评论 #43251606 未加载

nemo44x2 个月前

Isn’t the whole point of DuckDB is that it’s not distributed?

评论 #43259743 未加载

评论 #43270897 未加载

评论 #43259637 未加载

benrutter2 个月前

评论 #43259431 未加载

maknee2 个月前

Does anyone have blogs with benchmarks to show the performance of running smallpond let alone 3fs + smallpond?A lot of blogs praise these new systems, but don't really provide any numbers :/

cmollis2 个月前

DeepSeek's smallpond: Bringing Distributed Computing to DuckDB

7 条评论

DeepSeek's smallpond: Bringing Distributed Computing to DuckDB

7 条评论