Performance
Max latency
Definition
The max latency test ensures that the maximum latency for the data is within a given range.
Taxonomy
- Category: Performance.
- Task types: LLM, tabular classification, tabular regression, text classification.
- Availability: .
Why it matters
- The latency of a model directly impacts the user experience. Setting up tests to monitor the latency ensures that the latency is kept within a given range.
Test configuration examples
If you are writing a tests.json
, here are a few valid configurations for the character length test:
Related
Was this page helpful?