Definition
The 90th latency percentile test ensures that the 90th percentile latency for the data is within a given range.Taxonomy
- Task types: LLM, tabular classification, tabular regression, text classification.
- Availability: .
Why it matters
- The 90th percentile latency provides insight into the performance experienced by the majority of users, excluding the worst 10% of cases.
- This metric helps identify performance issues that affect a significant portion of users while being less sensitive to extreme outliers than maximum latency.
- Monitoring the 90th percentile is crucial for maintaining consistent user experience and meeting SLA requirements.
Test configuration examples
If you are writing atests.json
, here are a few valid configurations for the 90th latency percentile test: