TE
TechEcho
Home
24h Top
Newest
Best
Ask
Show
Jobs
English
GitHub
Twitter
Home
Evaluating RAG for large scale codebases
41 points
by
GavCo
3 months ago
3 comments
jimminyx
3 months ago
Collapse
Conceptually, LLM-as-a-judge doesn't feel like it should work — it's like asking a student to grade their own homework. it's very unintuitive for me that it actually seems to work pretty well
评论 #43046366 未加载
评论 #43051458 未加载
评论 #43047443 未加载
评论 #43047676 未加载
33a
3 months ago
If the self evaluation makes it better, then why not do the self evaluation as part of the normal RAG workflow?
namanyayg
3 months ago
Collapse
Who's data are they training on? Are they storing and using all customer data?
评论 #43047659 未加载