8 点作者 chadash5 个月前

2 条评论

Shout out to the video team, for this super cute tower of minis next to the Christmas tree.<p><a href="https://x.com/exolabs/status/1872444906851229814" rel="nofollow">https://x.com/exolabs/status/1872444906851229814</a><p>Only just considering now that Strix Halo could help fill this gap that Mac chips with their huge memory bandwidth enjoy. 256GB systems shouldn't be hard to build!!<p>MI300a APU seems not popular but for consumers, this mix of big CPU and GPU seems perhaps quite compelling!

talldayo5 个月前

> The M4 Max has 546GB/s of memory bandwidth and ~34TFLOPS (fp16) = ~68 GB/s, a ratio of ~8.02. Whereas NVIDIA RTX 4090 has 1008GB/s memory bandwidth and ~330TFLOPS (fp16) = ~660GB/s, a ratio of ~1.52.<p>Why are we comparing FP16 performance when you're inferencing INT4 quantized models? Seems like a misleading figure to compare with when it's not really even the performance you're measuring.

评论 #42523863 未加载

Running DeepSeek V3 671B on M4 Mac Mini Cluster

2 条评论

Running DeepSeek V3 671B on M4 Mac Mini Cluster

2 条评论