Min latency

Definition
Taxonomy
Why it matters
Test configuration examples
Related

Definition

The min latency test ensures that the minimum latency for the data is within a given range.

Taxonomy

Task types: LLM, tabular classification, tabular regression, text classification.
Availability: .

Why it matters

The minimum latency represents the best-case performance of your model, indicating the fastest possible response time.
This metric helps identify the baseline performance capability and can reveal issues with caching, connection overhead, or model optimization.
Monitoring minimum latency is useful for understanding the lower bound of your system’s performance and ensuring it meets expectations for optimal conditions.

Test configuration examples

If you are writing a tests.json, here are a few valid configurations for the min latency test:

[
  {
    "name": "Min latency above 100 msec",
    "description": "Make sure that the minimum latency is above 100 msec to ensure realistic measurements",
    "type": "performance",
    "subtype": "metricThreshold",
    "thresholds": [
      {
        "insightName": "metrics",
        "insightParameters": null,
        "measurement": "minLatency",
        "operator": ">",
        "value": 100
      }
    ],
    "subpopulationFilters": null,
    "mode": "development",
    "usesValidationDataset": true, // Apply test to the validation set
    "usesTrainingDataset": false,
    "usesMlModel": false,
    "syncId": "b4dee7dc-4f15-48ca-a282-63e2c04e0689" // Some unique id
  }
]

99th latency percentile

Median latency

⌘I

Get started

Workspace setup

Governance

Observability

Data quality monitoring

Offline testing

Tests

Administration

Other resources

Definition

Taxonomy

Why it matters

Test configuration examples

Get started

Workspace setup

Governance

Observability

Data quality monitoring

Offline testing

Tests

Administration

Other resources

​Definition

​Taxonomy

​Why it matters

​Test configuration examples

​Related

Definition

Taxonomy

Why it matters

Test configuration examples

Related