科技回声

Ragas is an open-source library designed for evaluating and testing RAG (Retrieval-Augmented Generation) and other LLM applications. It offers a diverse set of metrics and methods, including synthetic test data generation, to help you assess your RAG applications. Ragas was initially developed to address our own needs for evaluating RAG chatbots last year.### Problems Ragas Can Solve:- How can you select the best components for your RAG, such as the retriever, reranker, and LLM?- How can you create a test dataset without incurring significant expenses and time?We believe there's a need for an open-source standard for evaluating and testing LLM applications. Our vision is to establish this standard for the community. We're addressing this challenge by adapting ideas from the traditional ML lifecycle for LLM applications.### ML Testing Evolved for LLM ApplicationsRagas is founded on the principles of metrics-driven development. Our goal is to develop and innovate techniques inspired by the latest research to address the challenges in evaluating and testing LLM applications.We don't think that merely building a sophisticated tracing tool will solve the evaluation and testing challenges. Instead, we aim to tackle these issues from a foundational level. To this end, we're introducing methods such as automated synthetic test data curation, metrics, and feedback utilization. These approaches are inspired by lessons learned from deploying stochastic models throughout our careers as machine learning engineers.While our current focus is on RAG pipelines, we intend to expand Ragas to test a broad spectrum of compound systems. This includes systems based on RAGs, agentic workflows, and various transformations.### Try RagasExperience Ragas by trying it out in Google Colab [here](<a href="https://colab.research.google.com/github/shahules786/openai-cookbook/blob/ragas/examples/evaluation/ragas/openai-ragas-eval-cookbook.ipynb" rel="nofollow">https://colab.research.google.com/github/shahules786/openai-...</a>). For more information, read our [documentation](<a href="https://docs.ragas.io/">https://docs.ragas.io/</a>).We would love to hear feedback from the Hacker News community :)

6 条评论

jcyriac大约 1 年前

The synthetic test data generation seems very useful. Do you have any idea of the cost of running this?

评论 #39767816 未加载

diyaliza大约 1 年前

How does Ragas handle the challenge of adapting traditional ML testing methodologies to suit the intricacies of LLM applications?

kurianbenoy大约 1 年前

How is the synthetic test data generation done in ragas?Can I use custom Open source models like Mistral 7B to generate synthetic test data?

ajinmsaji大约 1 年前

Is there support to open-source models? btw love your work!!

评论 #39767750 未加载

thomaspeter1998大约 1 年前

How do you actually use models for evaluation?

donalex98大约 1 年前

Can I use OSS models like Mixtral with it?

评论 #39767721 未加载

6 条评论

jcyriac大约 1 年前

The synthetic test data generation seems very useful. Do you have any idea of the cost of running this?

评论 #39767816 未加载

diyaliza大约 1 年前

How does Ragas handle the challenge of adapting traditional ML testing methodologies to suit the intricacies of LLM applications?

kurianbenoy大约 1 年前

How is the synthetic test data generation done in ragas?Can I use custom Open source models like Mistral 7B to generate synthetic test data?

ajinmsaji大约 1 年前

Is there support to open-source models? btw love your work!!

评论 #39767750 未加载

thomaspeter1998大约 1 年前

How do you actually use models for evaluation?

donalex98大约 1 年前

Can I use OSS models like Mixtral with it?

评论 #39767721 未加载

Show HN: Ragas – Open-source library for evals and testing RAG systems

6 条评论

Show HN: Ragas – Open-source library for evals and testing RAG systems

6 条评论