seems like it's not a new 671 B behemoth, just the V3 MoE engine with long-context RoPE + FA2, retargeted at theorem proving. see [1,2]:<p>[1]: <a href="https://arxiv.org/abs/2405.14333" rel="nofollow">https://arxiv.org/abs/2405.14333</a><p>[2]: <a href="https://arxiv.org/abs/2408.08152" rel="nofollow">https://arxiv.org/abs/2408.08152</a>