I want to create an LLM UI interface for my fine-tuned Llama model, similar to a private GPT chat. The interface should allow me to upload multiple documents, add prompts to them, capture the response, and format it with Markdown tags based on one of the attached template documents. What should I use? Preferably something open-source so I can customize it. What do you recommend? I looked at Gradio, but it seems too “modest,” and OpenWebUI is too complex and extensive. I would also like the solution to support PWA technology.
I would use standard full-stack technologies. React/NextJS is popular right now, great DX. This is what I'm using and make an API call to the LLM servers like any other external service (to the full-stack app)