TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Build models like we build open-source software

51 点作者 tristanz超过 3 年前

4 条评论

mmmeff超过 3 年前
I feel like there&#x27;s several industries that are practically computer science yet don&#x27;t utilize open source effectively. Data science is definitely one, but the video game industry definitely comes to mind.<p>You could argue game engines are notoriously complex, but the Linux kernel would like a word.
评论 #29507873 未加载
tristanz超过 3 年前
Collaborative incremental improvement of models would be extremely disruptive. While this happens via research, it&#x27;s massively inefficient, particularly as pretrained models get larger and span multiple modalities.
评论 #29506396 未加载
评论 #29506389 未加载
amznbyebyebye超过 3 年前
There is definitely a problem re: large parameter models, the issue is I don’t think throwing software dev tools at this is the right solution.<p>The constraint is largely hardware. The incremental post training done via transfer learning is generally not broadly applicable to many use cases.
sharemywin超过 3 年前
I&#x27;m curious how Deepmind&#x27;s MOE models Perceiver and Switch might play into managing a open distributed model.
评论 #29514578 未加载