科技回声

1 comment

3Sophons超过 1 年前

LlamaEdge is not just a lightweight and portable runtime for chatbots. It is an app dev platform for portable LLM apps! RAG chatbots and AI agents can be developed on Mac or Windows, compiled to Wasm, and then deployed across Nvidia / AMD / ARM devices, fully taking advantage of on-device GPUs, NPUs etc.<p>Hence, besides the LLM runtime, those LLM apps also need to manage embeddings in vector db. This example shows how to access the Qdrant vector db from your portable Wasm apps!<p><a href="https://github.com/WasmEdge/wasmedge-db-examples/tree/main/qdrant">https://github.com/WasmEdge/wasmedge-db-examples/tree/main/q...</a>

Develop+Deploy RAG Bots with LlamaEdge: Across OSes, NPUs, GPUs Using Vector DB

1 comment

Develop+Deploy RAG Bots with LlamaEdge: Across OSes, NPUs, GPUs Using Vector DB

1 comment