Knowledge base

API reference

Community

Openlayer

Get started

Log in

Introduction

LLM quickstart

Start evaluating models in under 2 minutes

Traditional ML quickstart

Path selection

Terminology

Create an account

Find your API key

Create and load projects

Upload datasets and models for development

Aggregate metrics

LLM dataset config

Tabular classification dataset config

Tabular regression dataset config

Text classification dataset config

LLM config

Tabular classification model config

Tabular regression model config

Text classification model config

Explore the tests available on the platform

Overview

Character length

Class imbalance ratio

Conflicting labels

Column average

Column contains string

Correlated features

Data type validation

Duplicate rows

Empty features

Empty feature count

Features missing values

Feature values

Great expectations

Ill-formed rows

Is code

Is JSON

Null rows

Number of rows

Personal identifiable information (PII)

Predictive power score (PPS)

Quasi-constant features

Quasi-constant feature count

Special characters ratio

String validation

Valid URLs

Column drift

Column values match

Feature drift

Label/target drift

New categories

New labels

Size ratio

Training-validation leakage

GPT evaluation

Max cost

Max latency

Max tokens

Mean cost

Mean latency

Mean tokens

Total cost

Total tokens

Set up monitoring

Slack

Amazon S3

Databricks

Amazon SageMaker

LangChain

OpenAI assistants

Versioning

Tests

Evaluation and delay windows

Python

TypeScript

A list of projects in a user's workspace.

Retrieve projects

Retrieve a presigned URL to post the commit bundle.

Retrieve a presigned URL

A list of versions (commits) under a project.

Retrieve project versions

Create a project version

A list of goal results under a project version.

Retrieve test results

A list of the authenticated user's inference pipelines under a project.

Retrieve inference pipelines

Stream data

A list of test results under an inference pipeline.

Upload a batch of production data or update previously uploaded data.

Upload/update a batch of data

Start guides

Workspace and projects

Datasets and models

Tests

Monitoring

Integrations