科技回声

13 条评论

How do people feel about LLM generated tests?I tried creating some on a personal project just using ChatGPT and it saved me a lot of toil on tests I probably wouldn’t have written. I did find I had low trust in refactoring my code, but higher than if I’d had no tests.It seemed like a net positive for low risk cases.

评论 #40428715 未加载

评论 #40428210 未加载

评论 #40428218 未加载

评论 #40428135 未加载

评论 #40427556 未加载

评论 #40428564 未加载

评论 #40432421 未加载

评论 #40428603 未加载

throwanem大约 1 年前

Per the cited real world figures, that's about 1 in 40 tests that pass human review, or a success rate of about 2.5%.It's hard to see value in spending resources this way right now - most notably, engineer time to review the generated tests. Improve the hit rate by an order of magnitude, and I suspect I'd feel differently.

评论 #40431634 未加载

rohitpaulk大约 1 年前

Tried this out on a Ruby codebase and it generated Python tests: <a href="https://github.com/Codium-ai/cover-agent/issues/17">https://github.com/Codium-ai/cover-agent/issues/17</a>. Is there any data available on whether this actually works?

评论 #40430696 未加载

darknoon大约 1 年前

Why does this webpage have auto-playing audio?

评论 #40427397 未加载

评论 #40428457 未加载

ryoshu大约 1 年前

The audio track on load that has no obvious way to stop playing prevents me from reading this content. Please don't do that.

_pdp_大约 1 年前

Using ChatGPT to generate unit tests works great almost out of the box, but I guess this system solves the remaining 5% to make it fully automated end-to-end. I believe this will work and help us write better software, given that I have experienced numerous cases where the generated tests (even with inferior models) catch no-so-obvious bugs.

joeberg8大约 1 年前

Seems decent enough for boilerplate. But if my code is incorrect, won’t an LLM generated a test for incorrect code?

评论 #40429077 未加载

Havoc大约 1 年前

Interesting idea. I generally don’t run tests at all (hobbyist) so even mediocre llm tests may actually be a win

muglug大约 1 年前

Don't see any actual output measurement in the conclusion — it seems like the effort may not have really borne fruit.

EGreg大约 1 年前

To the OP:Is your name a reference to Gronky Scripples? <a href="https://www.youtube.com/watch?v=4KG3v365mq4" rel="nofollow">https://www.youtube.com/watch?v=4KG3v365mq4</a>

yuvalkarmi大约 1 年前

Love that you took something that meta wrote about but didn't actually release and then... did it for them haha :)

wocka大约 1 年前

I get redirected to an oops 404 page when I try to create an account using Github.

jrawlings大约 1 年前

Any chance of supporting integrations with AWS, Azure, GCP APIs?

13 条评论

data-ottawa大约 1 年前

评论 #40428715 未加载

评论 #40428210 未加载

评论 #40428218 未加载

评论 #40428135 未加载

评论 #40427556 未加载

评论 #40428564 未加载

评论 #40432421 未加载

评论 #40428603 未加载

throwanem大约 1 年前

评论 #40431634 未加载

rohitpaulk大约 1 年前

评论 #40430696 未加载

darknoon大约 1 年前

Why does this webpage have auto-playing audio?

评论 #40427397 未加载

评论 #40428457 未加载

ryoshu大约 1 年前

The audio track on load that has no obvious way to stop playing prevents me from reading this content. Please don't do that.

_pdp_大约 1 年前

joeberg8大约 1 年前

Seems decent enough for boilerplate. But if my code is incorrect, won’t an LLM generated a test for incorrect code?

评论 #40429077 未加载

Havoc大约 1 年前

Interesting idea. I generally don’t run tests at all (hobbyist) so even mediocre llm tests may actually be a win

muglug大约 1 年前

Don't see any actual output measurement in the conclusion — it seems like the effort may not have really borne fruit.

EGreg大约 1 年前

To the OP:Is your name a reference to Gronky Scripples? <a href="https://www.youtube.com/watch?v=4KG3v365mq4" rel="nofollow">https://www.youtube.com/watch?v=4KG3v365mq4</a>

yuvalkarmi大约 1 年前

Love that you took something that meta wrote about but didn't actually release and then... did it for them haha :)

wocka大约 1 年前

I get redirected to an oops 404 page when I try to create an account using Github.

jrawlings大约 1 年前

Any chance of supporting integrations with AWS, Azure, GCP APIs?

We created the first open source implementation of Meta's TestGen–LLM

13 条评论

We created the first open source implementation of Meta's TestGen–LLM

13 条评论