76 点作者 sytelus将近 2 年前

5 条评论

bigyikes将近 2 年前

For reference, this beat GPT-3.5 which scores 47%, but not GPT-4 which scored a massive 67%.<p>Beating out GPT-3.5 at <i>any</i> task with such a small model is very cool to me.<p>How much longer until these dumb virtual assistants (Siri, Google, Alexa) get replaced with on-device LLMs? We’ve gotta be getting close. These small, optimized models are catching up quickly in so many domains.

asicsp将近 2 年前

Dupe: "Textbooks are all you need" <a href="https://news.ycombinator.com/item?id=36413768">https://news.ycombinator.com/item?id=36413768</a>

RecycledEle将近 2 年前

I wonder if learning to train Generative AIs will teach us anything about teaching humans? I mean other than the use of AI tutors. Can we determine the usefulness of text by how well it trains AI?

p0w3n3d将近 2 年前

What is B in this context? Billion?

评论 #36416858 未加载

评论 #36417016 未加载

throwaway4good将近 2 年前

What is HumanEval?

50% on HumanEval with just 1.3B model

5 条评论

50% on HumanEval with just 1.3B model

5 条评论