Definition
The semantic similarity test assesses the similarity in meaning between sentences, by measuring their closeness in semantic space using advanced natural language processing techniques.Taxonomy
- Task types: LLM.
- Availability: and .
Why it matters
- Semantic similarity captures the meaning-based relationship between generated and reference text, going beyond surface-level string matching.
- This metric is particularly valuable when different phrasings can convey the same meaning, making it ideal for tasks like paraphrasing, summarization, or question answering.
- It provides a more nuanced evaluation than exact matching by considering the conceptual similarity rather than just textual similarity.
Required columns
To compute this metric, your dataset must contain the following columns:- Outputs: The generated text from your LLM
- Ground truths: The reference/expected text to compare against
Test configuration examples
If you are writing atests.json
, here are a few valid configurations for the semantic similarity test:
Related
- BLEU score test - Measure n-gram based text similarity.
- Quasi-exact match test - Allow partial matches and variations.
- Answer relevancy test - Measure relevance of answers to questions.
- Aggregate metrics - Overview of all available metrics.