TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

"Devin" AI automates Upwork job, making inferences on a computer vision model

3 点作者 drubio大约 1 年前

1 comment

ben_w大约 1 年前
The SWE-bench graph (Devin: 13.85%, Clause 2: 4.8%, GPT-4: 1.74%) looks surprising, perhaps even a bit suspicious (too far above the next best) — can someone elaborate further?<p>Perhaps it really is this good, perhaps it&#x27;s just my anxiety looking for reason to doubt. It was only 9 hours ago I was writing:<p>&gt; It&#x27;s certainly still possible today that some random individual has a crucial insight that gives them an edge over the big names<p>- <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=39679669">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=39679669</a>