27 点作者 langitbiru10 个月前

3 条评论

[dupe]<p>More discussion: <a href="https://news.ycombinator.com/item?id=41046540">https://news.ycombinator.com/item?id=41046540</a><p><a href="https://news.ycombinator.com/item?id=41046773">https://news.ycombinator.com/item?id=41046773</a>

sagz10 个月前

405B is already being served on WhatsApp!<p><a href="https://ibb.co/kQ2tKX5" rel="nofollow">https://ibb.co/kQ2tKX5</a>

msoad10 个月前

MMLU PRO is the benchmark I trust the most. I noticed they are using 5 shots and CoT. Is that true for GPT4 and Sonnet as well?

Llama 3.1: Our most capable models to date

3 条评论

Llama 3.1: Our most capable models to date

3 条评论