3 pointsby sbpayneabout 1 month ago

1 comment

sbpayneabout 1 month ago

As new model releases support longer and longer context windows, there is a lot of discussion around whether RAG is still relevant.<p>RAG is here to stay for a while:<p>(1) Enterprises have much more data than reasonably will fit in a context window any time soon (2) Even if you can technically put 1M tokens in, that does not mean the model can effectively use it all (3) Longer input = higher latency and cost for inference<p>Would love any other thoughts on the topic!

评论 #43709765 未加载

Why RAG Is (Still) Not Dead

1 comment

Why RAG Is (Still) Not Dead

1 comment