科技回声

skavi将近 2 年前

The GH200 was announced in May. What’s new is a variant with HBM3e memory which increases capacity (96GB -> 144GB) and bandwidth (4TB/s -> 5TB/s). The posted article butchers this. Here’s a much better source: <a href="https://www.anandtech.com/show/20001/nvidia-unveils-gh200-grace-hopper-gpu-with-hbm3e-memory" rel="nofollow noreferrer">https://www.anandtech.com/show/20001/nvidia-unveils-gh200-gr...</a>

jerojero将近 2 年前

Although this is obviously great news, it is increasingly troubling how AI innovation (from the hardware sector) is pretty much limited to Nvidia.<p>I haven't really gotten too much into it, so I'm not sure how has Nvidia come to absolutely dominate this market; AMD GPUs are quite good in the gaming sector... though I guess... with just two real players in the GPU market it's difficult to really get anywhere.

评论 #37059246 未加载

评论 #37067862 未加载

评论 #37059320 未加载

评论 #37059036 未加载

评论 #37059421 未加载

iFire将近 2 年前

Regarding NVIDIA domination, I want to promote that the Google project <a href="https://github.com/openxla/iree">https://github.com/openxla/iree</a> exists and IREE acts as a way to turn Tensorflow, Pytorch, and MLIR workflows to compute on cpu, vulkan compute, cuda, rocm, metal and others.<p><a href="https://github.com/RechieKho/IREE.gd">https://github.com/RechieKho/IREE.gd</a> -- RechieKho and I collaborate on making this work for Godot Engine, but IREE.gd is at a proof of concept stage.

DarthNebo将近 2 年前

JM2C<p>lood_in_4bit=True will let you run Llama2-7B variants at 6.3GB VRAM.

Nvidia reveals new A.I. chip, says costs of running LLMs will drop significantly

4 条评论

Nvidia reveals new A.I. chip, says costs of running LLMs will drop significantly

4 条评论