TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: Qwen3 – is it ready for driving AI agents?

1 点作者 morisil7 天前
It seems that Qwen3 is not capable of driving independent reasoning - it lacks the quality needed to power fully autonomous AI agents.<p>Initially I was quite impressed with it&#x27;s problem solving capabilities, when outputting the code through the chat interface. It addressed certain problems much better than Claude or Gemini. However, as soon as I switched to Alibaba Cloud&#x27;s API to provide Dashscope based implementation of cognizer interface of my new generation of AI agents (chain of code), the whole charm was gone.<p>Qwen3 struggles with structured generation attempts, quite often falling into an infinite loop when spitting out tokens.<p>It has troubles crossing boundaries of languages, which is crucial for my agents which are &quot;thinking in code&quot; - writing Kotlin script, containing JavaScript, containing SQL, etc., therefore it will not work well as automated software engineer.<p>It is &quot;stubborn&quot; - even when the syntax error in generated code is clearly indicated, it is rather wiling to output the same error code again and again, instead of testing another hypothesis.<p>It lacks the theory of mind and understanding of the context and the environment. For example when asked to check the recent news, it is always responding by trying to use BBC API url, with non-filled API key as a part of the request, while passing this url to the Files tool instead of the WebBrowser tool, which obviously fails.<p>And the last, but not least - censorship, for example Qwen3 will refuse to search for the information on the most recent anti-governmental protests in China. I wouldn&#x27;t be surprised if these censorship blockers were partially responsible for poor quality of cognition in other areas.<p>Maybe I&#x27;m doing something wrong, and you are getting much better results with this model for fully autonomous agents with feedback loop?

暂无评论

暂无评论