63 点作者 georgehill11 个月前

3 条评论

Noam has been cooking at character.ai in stealth. Their model is impressively engaging

评论 #40741617 未加载

eachro11 个月前

Training in int8 is noteable (to me). I've been out of date with ML research for a bit now but last I recall, people were mostly training at full precision and then quantizing after training and finetuning a bit on the quantized model afterwards.

评论 #40740317 未加载

janalsncm11 个月前

> we implemented customized int8 kernels for matrix multiplications and attention<p>I would be curious how this differs from [1] which is supported in Huggingface’s transformers library.<p>[1] <a href="https://arxiv.org/abs/2208.07339" rel="nofollow">https://arxiv.org/abs/2208.07339</a>

Optimizing AI Inference at Character.ai

3 条评论

Optimizing AI Inference at Character.ai

3 条评论