TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Early Evals for OpenAI O3

30 pointsby maurycy5 months ago

2 comments

macawfish5 months ago
Wow, the demo where the user asks for untraceable payments shows some pretty sophisticated reasoning. The word "crafty" comes to mind.
og_kalu5 months ago
New SOTA&#x27;s on:<p>SWE-Bench - 71.7<p>Competition Code - 2727<p>ARC (Semi Private Eval) - 75.7 on low, 87.5% on high compute<p>Frontier Math (previous SOTA was 2%) - 25% on high compute