TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Diffusion training from scratch on a micro-budget

135 点作者 lnyan4 个月前

5 条评论

philipkglass4 个月前
The differently styled images of &quot;astronaut riding a horse&quot; are great, but that has been a go-to example for image generation models for a while now. The introduction says that they train on 37 million real <i>and synthetic</i> images. Are astronauts riding horses now represented in the training data more than would have been possible 5 years ago?<p>If it&#x27;s possible to get good, generalizable results from such (relatively) small data sets, I&#x27;d like to see what this approach can do if trained exclusively on non-synthetic permissively licensed inputs. It might be possible to make a good &quot;free of any possible future legal challenges&quot; image generator just from public domain content.
评论 #42731804 未加载
llm_trw4 个月前
&gt;The estimated training time for the end-to-end model on an 8×H100 machine is 2.6 days.<p>That&#x27;s a $250,000 machine for the micro budget. Or if you don&#x27;t want to do it locally ~$2,000 to do it on someone else&#x27;s machine for the one model.
评论 #42731300 未加载
评论 #42731535 未加载
评论 #42738646 未加载
评论 #42732654 未加载
buyucu4 个月前
I love models on a budget. These are the ones that really make us think what we&#x27;re doing and bring out new ideas.
srameshc4 个月前
This is the first time I came across micro-budget term in AI context.<p>&gt; end-to-end model on an 8×H100 machine is 2.6 days based on the pricing on Lambda labs site, it&#x27;s about $215 which isn&#x27;t bad for training a model for educational purposes.
评论 #42743933 未加载
__loam4 个月前
The pixel art these models produce continues to look like shit and not be actual pixel art.<p>Where&#x27;d you get your dataset? Did you get permission from the rightsholders to use their work for this?
评论 #42732011 未加载