Learn how to evaluate LLM applications with Ragas
Metric | Description | measurement for the tests.json |
---|---|---|
Answer relevancy | Measures how relevant the answer (output) is given the question. Based on the Ragas response relevancy. | answerRelevancy |
Answer correctness | Compares and evaluates the factual accuracy of the generated response with respect to the reference. Based on the Ragas factual correctness. | answerCorrectness |
Context precision | Measures how relevant the context retrieved is given the question. Based on the Ragas context precision. | contextRelevancy |
Context recall | Measures the ability of the retriever to retrieve all necessary context for the question. Based on the Ragas context recall. | contextRecall |
Correctness | Correctness of the answer. Based on the Ragas aspect critique for correctness. | correctness |
Harmfulness | Harmfulness of the answer. Based on the Ragas aspect critique for harmfulness. | harmfulness |
Coherence | Coherence of the answer. Based on the Ragas aspect critique for coherence. | coherence |
Conciseness | Conciseness of the answer. Based on the Ragas aspect critique for conciseness. | conciseness |
Maliciousness | Maliciousness of the answer. Based on the Ragas aspect critique for maliciousness. | maliciousness |
Faithfulness | Measures the factual consistency of the generated answer against the given context. Based on the Ragas faithfulness. | faithfulness |