Looks like o1 performance without reasoning. Pretty good but seems reasonable that they didn’t want to call this 5 as they’ve already got a product out there that is as performant.<p>Another notable thing here is a big drop in hallucination rate as measured by their benchmarks (for whatever those are worth).