So, an LLM scoring over 80 on MMLU is no longer newsworthy on Hacker News? We've indeed come a long way.<p>Or perhaps it's just the skepticism surrounding such "claims". That's why I reckon more chatbots that don't rely on an API, similar to Inflection (or Microsoft's Copilot ?), should be directly integrated into the LMSYS Leaderboard. Isn't this what they did with Gemini/Bard?