TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

DeepSeek's smallpond: Bringing Distributed Computing to DuckDB

198 点作者 xkgt2 个月前

7 条评论

OutOfHere2 个月前
Deepseek is the real &quot;open&lt;something&gt;&quot; that the world needed. Via these three projects, Deepseek has addressed not only efficient AI but also distributed computing:<p>1. smallpond: <a href="https:&#x2F;&#x2F;github.com&#x2F;deepseek-ai&#x2F;smallpond">https:&#x2F;&#x2F;github.com&#x2F;deepseek-ai&#x2F;smallpond</a><p>2. 3fs: <a href="https:&#x2F;&#x2F;github.com&#x2F;deepseek-ai&#x2F;3FS">https:&#x2F;&#x2F;github.com&#x2F;deepseek-ai&#x2F;3FS</a><p>3. deepep: <a href="https:&#x2F;&#x2F;github.com&#x2F;deepseek-ai&#x2F;DeepEP">https:&#x2F;&#x2F;github.com&#x2F;deepseek-ai&#x2F;DeepEP</a>
评论 #43254634 未加载
评论 #43254657 未加载
jakozaur2 个月前
It was already on HN recently:<p><a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=43200793">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=43200793</a><p><a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=43232410">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=43232410</a>
ogarten2 个月前
Looks like we are approaching the &quot;distributed&quot; phase of the distributed-centralized computing cycle :)<p>Not saying this is bad, but it&#x27;s just interesting to see after being in the industry for 8 years.
评论 #43251606 未加载
nemo44x2 个月前
Isn’t the whole point of DuckDB is that it’s not distributed?
评论 #43259743 未加载
评论 #43270897 未加载
评论 #43259637 未加载
benrutter2 个月前
I&#x27;m not massively knowledgable about the ins and outs of DeepSeek, but I think I&#x27;m in the right place to ask. My understanding is DeepSeek:<p>- Created comparable LLM performance for a fraction of the cost of OpenAI using more off-the-shelf hardware.<p>- Seem to be open sourcing lots of distributed stuff.<p>My question is, are those two things related? Did distributed computing allow the AI model somehow? If so how? Or is it not that simple?
评论 #43259431 未加载
maknee2 个月前
Does anyone have blogs with benchmarks to show the performance of running smallpond let alone 3fs + smallpond?<p>A lot of blogs praise these new systems, but don&#x27;t really provide any numbers :&#x2F;
cmollis2 个月前
spark is getting a bit long in the tooth.. interesting to see duckdb integrated with Ray for data-access partitioning across (currently) 3FS. probably a matter of time before they (or someone) supports S3. It should be noted that duckdb (standalone) actually does a pretty good job scanning s3 parquet on its own.