Definition

The max latency test ensures that the maximum latency for the data is within a given range.

Taxonomy

  • Category: Performance.
  • Task types: LLM, tabular classification, tabular regression, text classification.
  • Availability: .

Why it matters

  • The latency of a model directly impacts the user experience. Setting up tests to monitor the latency ensures that the latency is kept within a given range.

Test configuration examples

If you are writing a tests.json, here are a few valid configurations for the character length test: