Hi all, at work I get a lot of questions about the state of the art in open source language models, and how to build chatbots on top of your own data.<p>I made a 100% open source knowledge-grounded chatbot that allows you to ask questions and chat with the Transformers docs. Powered by Flan-UL2 (which I've anecdotally found to be the most performative commercially licensed open source instruction tuned LLM), Langchain, Instructor Embeddings (STOTA in vector embeddings), and FAISS.<p>You can clone the space and play around with your own data, clone the repo locally, and take every line of code for your own projects.