TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Optimizing AI Inference at Character.ai

63 点作者 georgehill11 个月前

3 条评论

hackernewds11 个月前
Noam has been cooking at character.ai in stealth. Their model is impressively engaging
评论 #40741617 未加载
eachro11 个月前
Training in int8 is noteable (to me). I've been out of date with ML research for a bit now but last I recall, people were mostly training at full precision and then quantizing after training and finetuning a bit on the quantized model afterwards.
评论 #40740317 未加载
janalsncm11 个月前
&gt; we implemented customized int8 kernels for matrix multiplications and attention<p>I would be curious how this differs from [1] which is supported in Huggingface’s transformers library.<p>[1] <a href="https:&#x2F;&#x2F;arxiv.org&#x2F;abs&#x2F;2208.07339" rel="nofollow">https:&#x2F;&#x2F;arxiv.org&#x2F;abs&#x2F;2208.07339</a>