TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Evaluating RAG for large scale codebases

41 pointsby GavCo3 months ago

3 comments

jimminyx3 months ago
Conceptually, LLM-as-a-judge doesn't feel like it should work — it's like asking a student to grade their own homework. it's very unintuitive for me that it actually seems to work pretty well
评论 #43046366 未加载
评论 #43051458 未加载
评论 #43047443 未加载
评论 #43047676 未加载
33a3 months ago
If the self evaluation makes it better, then why not do the self evaluation as part of the normal RAG workflow?
namanyayg3 months ago
Who's data are they training on? Are they storing and using all customer data?
评论 #43047659 未加载