I get excited when I see things like this, even if they're simple because I think I can build a business on top of it. However, in complex tasks and real-life cases, it's successful in very few instances. I can't trust its stability. This makes me feel like I've been deceived. I believe agents will be used for tasks that require very little intelligence and constant repetition. It's actually an assistant in situations like this demo. I want to use agents everywhere, but they're not successful in their outputs. GPT-4 is still being used at large scales. I don't know what the situation is with high-level usage of models that do reasoning like o1 through APIs, I haven't tried it. I tried Deepseek, and I encountered stability issues with Deepseek APIs. Besides, R1 doesn't have function calls.