2 pointsby jacky2wongover 1 year ago

1 comment

jacky2wongover 1 year ago

One complaint we heard over and over was that no one likes to write tests when evaluating LLMs.<p>We automate writing these tests by having ChatGPT generate sample queries and answers based on a given text. These test cases can then be run through the testing framework and can be run similarly to PyTest.

Auto-Evaluation of LLMs with DeepEval

1 comment

Auto-Evaluation of LLMs with DeepEval

1 comment