The article explains very little about how it works, other than “end to end.” Basically it just claims higher benchmarks. Is there a paper?<p>It sounds like a downside will be that you can’t mix and match. You’ll have to use their LLM and their way of creating the embeddings.