TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Nvidia reveals new A.I. chip, says costs of running LLMs will drop significantly

73 点作者 TalktoCrystal将近 2 年前

4 条评论

skavi将近 2 年前
The GH200 was announced in May. What’s new is a variant with HBM3e memory which increases capacity (96GB -&gt; 144GB) and bandwidth (4TB&#x2F;s -&gt; 5TB&#x2F;s). The posted article butchers this. Here’s a much better source: <a href="https:&#x2F;&#x2F;www.anandtech.com&#x2F;show&#x2F;20001&#x2F;nvidia-unveils-gh200-grace-hopper-gpu-with-hbm3e-memory" rel="nofollow noreferrer">https:&#x2F;&#x2F;www.anandtech.com&#x2F;show&#x2F;20001&#x2F;nvidia-unveils-gh200-gr...</a>
jerojero将近 2 年前
Although this is obviously great news, it is increasingly troubling how AI innovation (from the hardware sector) is pretty much limited to Nvidia.<p>I haven&#x27;t really gotten too much into it, so I&#x27;m not sure how has Nvidia come to absolutely dominate this market; AMD GPUs are quite good in the gaming sector... though I guess... with just two real players in the GPU market it&#x27;s difficult to really get anywhere.
评论 #37059246 未加载
评论 #37067862 未加载
评论 #37059320 未加载
评论 #37059036 未加载
评论 #37059421 未加载
iFire将近 2 年前
Regarding NVIDIA domination, I want to promote that the Google project <a href="https:&#x2F;&#x2F;github.com&#x2F;openxla&#x2F;iree">https:&#x2F;&#x2F;github.com&#x2F;openxla&#x2F;iree</a> exists and IREE acts as a way to turn Tensorflow, Pytorch, and MLIR workflows to compute on cpu, vulkan compute, cuda, rocm, metal and others.<p><a href="https:&#x2F;&#x2F;github.com&#x2F;RechieKho&#x2F;IREE.gd">https:&#x2F;&#x2F;github.com&#x2F;RechieKho&#x2F;IREE.gd</a> -- RechieKho and I collaborate on making this work for Godot Engine, but IREE.gd is at a proof of concept stage.
DarthNebo将近 2 年前
JM2C<p>lood_in_4bit=True will let you run Llama2-7B variants at 6.3GB VRAM.