Definition
The 99th latency percentile test ensures that the 99th percentile latency for the data is within a given range.Taxonomy
- Task types: LLM, tabular classification, tabular regression, text classification.
- Availability: .
Why it matters
- The 99th percentile latency captures the performance experienced by nearly all users, excluding only the worst 1% of cases.
- This metric helps identify performance issues that could affect even a small percentage of users, which is important for maintaining high service quality.
- Monitoring the 99th percentile is crucial for applications where consistent performance is critical, even for edge cases.
Test configuration examples
If you are writing atests.json
, here are a few valid configurations for the 99th latency percentile test: