TE
科技回声
首页
24小时热榜
最新
最佳
问答
展示
工作
中文
GitHub
Twitter
首页
Evaluating RAG for large scale codebases
41 点
作者
GavCo
3 个月前
3 条评论
jimminyx
3 个月前
Collapse
Conceptually, LLM-as-a-judge doesn't feel like it should work — it's like asking a student to grade their own homework. it's very unintuitive for me that it actually seems to work pretty well
评论 #43046366 未加载
评论 #43051458 未加载
评论 #43047443 未加载
评论 #43047676 未加载
33a
3 个月前
If the self evaluation makes it better, then why not do the self evaluation as part of the normal RAG workflow?
namanyayg
3 个月前
Collapse
Who's data are they training on? Are they storing and using all customer data?
评论 #43047659 未加载