TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Transformer Inference Arithmetic (2022)

47 点作者 lawrencechen大约 2 年前

2 条评论

thatcherc大约 2 年前
Author seems to be using billion == 10^12 instead of the common billion == 10^9. A lot of the math still works out since there&#x27;s a multiply and a divide by a billion, but it is a little confusing to see passages like this:<p>&gt; Given the parameter count, we can multiply by two to get bytes. So to calculate the size of the weights for a 52B model.<p>&gt; 52e12⋅2 = 104e12 bytes ≈ 104GB
评论 #35950956 未加载
评论 #35951507 未加载
gxh8N大约 2 年前
Very nicely written. I also like how it changes color every time I reload the article.