And here's the academic article: <a href="https://arxiv.org/abs/2412.14161" rel="nofollow">https://arxiv.org/abs/2412.14161</a>
Waiting for the proponent default responses:<p>1. Next model will solve these problems.<p>2. Humans would have done worse.<p>3. Something something prompt engineering something something.