TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

LDB: Large Language Model Debugger via Verifying Runtime Execution Step by Step

2 pointsby panquecaabout 1 year ago

1 comment

panquecaabout 1 year ago
HumanEval Benchmark: 95.1 @ GPT-3.5<p>I wonder if it can be combined with projects like SWE-Agent to build powerful yet opensource coding agents.<p>- <a href="https:&#x2F;&#x2F;paperswithcode.com&#x2F;sota&#x2F;code-generation-on-humaneval" rel="nofollow">https:&#x2F;&#x2F;paperswithcode.com&#x2F;sota&#x2F;code-generation-on-humaneval</a><p>- <a href="https:&#x2F;&#x2F;github.com&#x2F;princeton-nlp&#x2F;SWE-agent">https:&#x2F;&#x2F;github.com&#x2F;princeton-nlp&#x2F;SWE-agent</a>