Definition
The correctness test evaluates the overall correctness of the generated answer. This metric is based on the Ragas aspect critique for correctness.Taxonomy
- Task types: LLM.
- Availability: and .
Why it matters
- Correctness ensures that your LLM generates responses that are accurate and free from errors.
- This metric helps identify when your model produces incorrect information, logical fallacies, or misleading content.
- It’s fundamental for applications where accuracy is critical, such as educational tools, fact-checking systems, or professional assistance applications.
Required columns
To compute this metric, your dataset must contain the following columns:- Input: The question or prompt given to the LLM
- Outputs: The generated answer/response from your LLM
Test configuration examples
If you are writing atests.json
, here are a few valid configurations for the correctness test:
Related
- Ragas integration - Learn more about Ragas metrics.
- Answer correctness test - Measure factual accuracy against ground truth.
- Coherence test - Evaluate logical consistency of responses.
- Faithfulness test - Evaluate consistency with provided context.
- Aggregate metrics - Overview of all available metrics.