TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

We created the first open source implementation of Meta's TestGen–LLM

142 点作者 gronky_大约 1 年前

13 条评论

data-ottawa大约 1 年前
How do people feel about LLM generated tests?<p>I tried creating some on a personal project just using ChatGPT and it saved me a lot of toil on tests I probably wouldn’t have written. I did find I had low trust in refactoring my code, but higher than if I’d had no tests.<p>It seemed like a net positive for low risk cases.
评论 #40428715 未加载
评论 #40428210 未加载
评论 #40428218 未加载
评论 #40428135 未加载
评论 #40427556 未加载
评论 #40428564 未加载
评论 #40432421 未加载
评论 #40428603 未加载
throwanem大约 1 年前
Per the cited real world figures, that&#x27;s about 1 in 40 tests that pass human review, or a success rate of about 2.5%.<p>It&#x27;s hard to see value in spending resources this way right now - most notably, engineer time to review the generated tests. Improve the hit rate by an order of magnitude, and I suspect I&#x27;d feel differently.
评论 #40431634 未加载
rohitpaulk大约 1 年前
Tried this out on a Ruby codebase and it generated Python tests: <a href="https:&#x2F;&#x2F;github.com&#x2F;Codium-ai&#x2F;cover-agent&#x2F;issues&#x2F;17">https:&#x2F;&#x2F;github.com&#x2F;Codium-ai&#x2F;cover-agent&#x2F;issues&#x2F;17</a>. Is there any data available on whether this actually works?
评论 #40430696 未加载
darknoon大约 1 年前
Why does this webpage have auto-playing audio?
评论 #40427397 未加载
评论 #40428457 未加载
ryoshu大约 1 年前
The audio track on load that has no obvious way to stop playing prevents me from reading this content. Please don&#x27;t do that.
_pdp_大约 1 年前
Using ChatGPT to generate unit tests works great almost out of the box, but I guess this system solves the remaining 5% to make it fully automated end-to-end. I believe this will work and help us write better software, given that I have experienced numerous cases where the generated tests (even with inferior models) catch no-so-obvious bugs.
joeberg8大约 1 年前
Seems decent enough for boilerplate. But if my code is incorrect, won’t an LLM generated a test for incorrect code?
评论 #40429077 未加载
Havoc大约 1 年前
Interesting idea. I generally don’t run tests at all (hobbyist) so even mediocre llm tests may actually be a win
muglug大约 1 年前
Don&#x27;t see any actual output measurement in the conclusion — it seems like the effort may not have really borne fruit.
EGreg大约 1 年前
To the OP:<p>Is your name a reference to Gronky Scripples? <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=4KG3v365mq4" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=4KG3v365mq4</a>
yuvalkarmi大约 1 年前
Love that you took something that meta wrote about but didn&#x27;t actually release and then... did it for them haha :)
wocka大约 1 年前
I get redirected to an oops 404 page when I try to create an account using Github.
jrawlings大约 1 年前
Any chance of supporting integrations with AWS, Azure, GCP APIs?