TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

All You Need Is 4x 4090 GPUs to Train Your Own Model

125 点作者 sabareesh5 个月前

25 条评论

gzer05 个月前
This is a great build, thanks for sharing your learnings.<p>The best build I have seen so far had 6x4090&#x27;s. Video: <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=C548PLVwjHA" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=C548PLVwjHA</a><p><pre><code> Specifications - GPU Accelerator - 6 x 24GB NVIDIA GeForce RTX 4090 - Processor - Intel Xeon W7-3465X, 28C&#x2F;56T, 2.5GHz - 4.8GHz - Memory - 256GB (8x32GB) DDR5 ECC 4800MHz - System Drive - 2TB Samsung 980 PRO NVMe PCIe 4.0 M.2 SSD - Storage Drive - 4TB Samsung 870 EVO SSD - Operating System - Ubuntu 20.04 </code></pre> An interesting choice to go with 256GB of DDR5 ECC; if spending so much on the 6x4090&#x27;s, might as well try to hit 1 TB of RAM as well.<p>The cost of this... not even sure. Astronomical.
评论 #42536116 未加载
评论 #42536117 未加载
keyle5 个月前
This article was written or rewritten via your model right?<p>The last paragraphs fell totally like AI.<p>Anyway I&#x27;d like a follow up on the curating, cleaning and training part which is far more interesting than how to select hardware which we&#x27;ve been doing for over 25 years.
评论 #42535950 未加载
评论 #42535760 未加载
_just7_5 个月前
I would be much more intrested in a piece on what you can train with this kind of rig, rather than the rig itself
评论 #42535770 未加载
评论 #42535777 未加载
评论 #42537245 未加载
sabareesh5 个月前
Hey HN I am sharing my experience on how i pretrained my own LLM by building a ML rig at home
评论 #42535783 未加载
magicalhippo5 个月前
On a tangent, if I wished to fine-tune one of those medium sized models like Gemma2 9B or Llama 3.2 Vision 11B, what kind of hardware would I need and how would I go about it?<p>I see a lot of guides but most focus on getting the toolchain up and running, and not much talk about what kind of dataset do I need to do a good fine tuning.<p>Any pointers appreciated.
评论 #42538252 未加载
评论 #42542386 未加载
rldjbpin5 个月前
nice writeup, but i feel that for most people, the software side of training models should be more interesting and accessible.<p>for one, &quot;full&quot; gpu utilization, one or many, remains an open topic in training workflows. spending efforts towards that, while renting from cloud, is a more accessible and fruitful to me than to finetune for marginal improvements.<p>this course was a nice source of inspiration - <a href="https:&#x2F;&#x2F;efficientml.ai&#x2F;" rel="nofollow">https:&#x2F;&#x2F;efficientml.ai&#x2F;</a> - and i highly recommend looking into this to see what to do next with whatever hardware you have to work with.
KeplerBoy5 个月前
Let&#x27;s talk riser cables. I keep encountering issues with riser connectors claiming to support PCIe 4.0, which seem to have sub-par performance. They work fine with the GPUs and NICs I tested them with, but attaching a nvme drive causes all kinds of issues and prevents the machine from booting. I guess nvme isn&#x27;t as tolerant of elevated bit-error-rates.<p>That just doesn&#x27;t inspire a lot of confidence in those risers, so now I&#x27;m contemplating mcio risers.
评论 #42536579 未加载
评论 #42537246 未加载
xena5 个月前
I&#x27;d love to read something you wrote, not something you had an AI model write for you.
abc-15 个月前
Fun for a wealthy hobbyist, but if you want to do real work, you’re better off renting from Runpod. Good blog though.
评论 #42535822 未加载
bb885 个月前
All you need is a 4x 4090 GPUs and a dedicated 30 amp circuit.
评论 #42536040 未加载
评论 #42535787 未加载
评论 #42536014 未加载
halyconWays5 个月前
Why not 3090s? Same VRAM and cheaper. With both setups you&#x27;d be limited to 1B. By contrast, you can run 4-bit quants of Llama 70B on two {3,4}090s, and it&#x27;s still pretty lobotomized by modern standards.<p>You can also train your own model even without GPUs. Just depends on parameter size.
评论 #42536496 未加载
anonytrary5 个月前
Thanks for sharing. Have you prodded the model with various inputs and written an article that show various output examples? I&#x27;d love to get an idea of what sort of &quot;end product&quot; 4x4090s is capable of producing.
评论 #42536527 未加载
NKosmatos5 个月前
Wouldn’t a cluster of M4 minis cost less and provide more VRAM? There are posts about people getting decent performance for a lot less than 12k USD.
评论 #42538023 未加载
评论 #42537129 未加载
评论 #42536490 未加载
评论 #42536172 未加载
jmward015 个月前
You can get 4060 ti 16GB cards for ~$450 or 4070 ti 16gb for ~850 instead of the $2.5k for a 4090. I wonder how well 4 of those cards would perform. The 4060 TDP is 165w instead of 450w for the 4090. The 4070 looks like the best tradeoff though for cost&#x2F;power&#x2F;etc though. You could probably set up an 8 card 4070 ti 16gb system for less than the 4 card 4090 system
评论 #42537667 未加载
评论 #42537243 未加载
评论 #42536537 未加载
AnarchismIsCool5 个月前
Couldn&#x27;t you do better with 2x AGX Orin 64gb?
jsheard5 个月前
It&#x27;s probably better to hold out for the 5090 at this point, it&#x27;s coming very soon as is expected to have 32GB of VRAM.
评论 #42537193 未加载
评论 #42535766 未加载
Bancakes5 个月前
Anyone care to publish AMD training&#x2F;inference benchmarks using ROCm? They’re hard to find.
评论 #42537231 未加载
nitred5 个月前
Can someone definitively say for sure that I can just use two independent PSUs? One for GPUs and one for GPUs and motherboard and SATA? No additional hardware?
mcdeltat5 个月前
Is anyone else concerned with the power usage of recent AI? Computational efficiency doesn&#x27;t seem to be a strong point... And for what benefit? IMO the usefulness payoff is too low
JacksonDam5 个月前
Interesting that DLSS 3 is mentioned as an advantage?
评论 #42535971 未加载
评论 #42536503 未加载
486sx335 个月前
I’d love to hear the dev story of H100 , it seemed to come out of left field !
paxys5 个月前
Where exactly do you plug in this beast?
m4635 个月前
&quot;This needs 30 AMP circuit...&quot; lol
master_crab5 个月前
All you need is 4x 4090 GPUs to Train Your Own Model -- <i>and $12000 to buy them</i>
评论 #42536064 未加载
评论 #42535877 未加载
评论 #42535792 未加载
评论 #42535875 未加载
评论 #42536293 未加载
评论 #42536067 未加载
评论 #42535830 未加载
patagonianboy5 个月前
Yeah, it&#x27;s powerful, but can it run crysis?