I'm fascinated by stories like these, because I think it shows that LLM's have only shown a small amount of their potential so far.<p>In a way, we've solved the raw "intelligence" part -- the next token prediction. (At least in certain domains like text.)<p>But now we have to figure out how to structure that raw intelligence into actual useful thinking patterns. How to take a problem, analyze it, figure out ways of breaking it down, try those ways until you run into roadblocks, then start figuring out some solution ideas, thinking about them more to see if they stand up to scrutiny, etc.<p>I think there's going to be a lot of really interesting work around that in the next few years. A kind of "engineering of practical thinking". This blog post is a great example of one first step.