Definition
The 95th latency percentile test ensures that the 95th percentile latency for the data is within a given range.Taxonomy
- Task types: LLM, tabular classification, tabular regression, text classification.
- Availability: .
Why it matters
- The 95th percentile latency provides insight into the performance experienced by the vast majority of users, excluding only the worst 5% of cases.
- This metric is commonly used in SLA definitions as it balances between covering most user experiences while filtering out extreme outliers.
- Monitoring the 95th percentile helps ensure that nearly all users receive acceptable performance while being practical to achieve.
Test configuration examples
If you are writing atests.json
, here are a few valid configurations for the 95th latency percentile test: