TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Groq surpasses 1,200 tokens/sec with Llama 3 8B

43 点作者 YourCupOTea12 个月前

4 条评论

LorenDB12 个月前
Groq is an insane company. SambaNova (discussed yesterday[0]) is also very promising. However, what I really want to see is local AI accelerator chips a la Tenstorrent Grayskull that can boost local generation to hundreds of tokens per second while being more efficient than GPUs.<p>[0]: <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=40508797">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=40508797</a>
评论 #40530512 未加载
windowshopping12 个月前
Is groq related to Twitter&#x27;s grok or is that just a very unfortunate naming coincidence?
评论 #40527650 未加载
评论 #40527683 未加载
评论 #40527504 未加载
评论 #40527491 未加载
评论 #40527587 未加载
andy_xor_andrew12 个月前
When reading Hacker News you develop a signal&#x2F;noise filter, where lots of headlines make bold claims but you filter them out as embellishment or exaggeration.<p>My bullshit detector went off when I first saw Groq posted on HN - a startup is making their own chips (doubt) that performs faster than anything Nvidia has for inference (doubt) and accelerates LLMs to hundreds&#x2F;thousands of tokens per second?? Mega doubt.<p>But... then I tried their demo, and... yeah, it&#x27;s that good. Such an amazing company of talented individuals.
评论 #40527662 未加载
评论 #40531104 未加载
behnamoh12 个月前
They&#x27;re not responsive to my questions on Twitter, so I&#x27;m asking here:<p><pre><code> When will Groq support a real API (not experimental beta preview)? When will Groq support logprobs?! When will Groq actually tell us what their rate limit is?! </code></pre> Until these aren&#x27;t answered, many of us can&#x27;t actually build on Groq.<p>Edit: It seems I&#x27;m getting downvoted by Groq employees...
评论 #40527670 未加载