Definition
The median latency test ensures that the median (50th percentile) latency for the data is within a given range.Taxonomy
- Task types: LLM, tabular classification, tabular regression, text classification.
- Availability: .
Why it matters
- The median latency represents the typical user experience, as it’s the latency value that half of all requests fall below.
- This metric is less sensitive to outliers than the mean latency, providing a more stable measure of central tendency.
- Monitoring median latency helps ensure that the typical user receives acceptable performance, making it a key indicator for overall system health.
Test configuration examples
If you are writing atests.json
, here are a few valid configurations for the median latency test: