Definition
The min latency test ensures that the minimum latency for the data is within a given range.Taxonomy
- Task types: LLM, tabular classification, tabular regression, text classification.
- Availability: .
Why it matters
- The minimum latency represents the best-case performance of your model, indicating the fastest possible response time.
- This metric helps identify the baseline performance capability and can reveal issues with caching, connection overhead, or model optimization.
- Monitoring minimum latency is useful for understanding the lower bound of your system’s performance and ensuring it meets expectations for optimal conditions.
Test configuration examples
If you are writing atests.json
, here are a few valid configurations for the min latency test: