15 点作者 Thoreandan大约 2 个月前

4 条评论

42772827大约 2 个月前

The headline seems to imply that the full 641GB model is running at >20tok/sec on the Mac Studio, but the blog says:<p>>The model only came out a few hours ago and MLX developer Awni Hannun already has it running at >20 tokens/second on a 512GB M3 Ultra Mac Studio ($9,499 of ostensibly consumer-grade hardware) via mlx-lm and this mlx-community/DeepSeek-V3-0324-4bit 4bit quantization, which reduces the on-disk size to 352 GB.

mdaniel大约 2 个月前

the non-archive.org submission, where simonw actually is watching and commenting <a href="https://news.ycombinator.com/item?id=43462317">https://news.ycombinator.com/item?id=43462317</a>

sinenomine大约 2 个月前

Impressive use of reasoner CoT distillation method applied to deepseek R1. MIT license for the weights. Thanks, Deepseek!

ilrwbwrkhv大约 2 个月前

The only open AI in town

DeepSeek-V3-0324 released, 641GB, MIT licensed, >20tok/SEC on $<10k hardware

4 条评论

DeepSeek-V3-0324 released, 641GB, MIT licensed, >20tok/SEC on $<10k hardware

4 条评论