Custom metrics

Openlayer supports tests based on custom metrics that you write. This guide shows how you can write a custom metric in Python and push it to your project using the Openlayer CLI.

Prerequisites

We are going to use the Openlayer CLI to push the custom metric to your project. Therefore, to follow this guide, you must:

Install the Openlayer CLI.
Login with the Openlayer CLI with the command openlayer login.
Link your working directory to an Openlayer project with the command openlayer link.

Write the metric

For each custom metric, you must prepare a directory structured as follows: Custom metric directory

Below are instructions for each component.

Write the run.py

The run.py file contains the custom metric logic.Make sure to create a class that inherits from the BaseMetric from Openlayer’s Python SDK and implements the compute_on_dataset method.Below is a sample run.py for a metric similar to the accuracy:

run.py

from openlayer.lib.core import metrics
from openlayer.types.inference_pipelines import data_stream_params

class Metric(metrics.BaseMetric):
"""Computes my custom metric. Must inherit from metrics.BaseMetric."""

    def compute_on_dataset(self, dataset: metrics.Dataset) -> metrics.MetricReturn:
        """Method that computes a metric given a dataframe and config."""

        # NOTE: Insert any logic you want here

        dataset.df["score"] = dataset.df.apply(
            lambda x: self.compute_on_row(x, dataset.config), axis=1
        )
        score = dataset.df["score"].mean()

        return metrics.MetricReturn(
            value=score, unit=None, meta=None, added_cols={"score"}
        )

    def compute_on_row(
        self, data: dict, config: data_stream_params.ConfigLlmData
    ) -> float:
        """E.g. Simple helper function to compute exact match on each row."""
        output = data[config["outputColumnName"]]
        gt = data[config["groundTruthColumnName"]]
        score = 0.0
        if output == gt:
            score = 1.0
        return score

# Don't change this
if __name__ == "__main__":
    Metric().run()

List the dependencies

Prepare the config.json

The config.json specifies how to prepare the environment and run your custom metric. Furthermore, it also provides additional information about the metric, which is displayed on the platform.For example, your config.json could look like:

config.json

{
"installCommand": "pip install -r requirements.txt",
"runCommand": "python run.py",
"name": "My Custom Metric",
"description": "This is my favorite custom metric",
"lowerBound": 0,
"upperBound": 1
}

Push the metric to Openlayer

Now that you have written your custom metric, you can push it to your Openlayer project. To push the custom metric, you can run the openlayer metrics push command:

openlayer metrics push -d metric_name

where metric_name is the directory created in the previous section. After you push your metrics to the platform, you will see them appear in the project metric settings page like so:

In the example above, we pushed/updated a single metric: metric_name.However, you can also push multiple custom metrics at once. Assuming you have the following directory structure with all your custom metrics:

metrics
├── metric_name_1
├── metric_name_2
└── ...

where each metric_name_i subdirectory with a custom metric, you can push/update them all at once with:

openlayer metrics push -d metrics

Development mode

Once you have pushed your metrics to Openlayer, any new commit you push to Openlayer will run your selected custom metrics. You can find which metrics are selected in the project’s metric settings page. Newly pushed custom metrics are selected by default. You can view the logs for your custom metric computation in the commit overview page: Custom metric logs

Monitoring mode

Once you’ve pushed your metrics to Openlayer, you can create tests in monitoring mode on your custom metrics. Any custom metric with at least one test associated with it will be run. You will see the option to create tests on your metric in the “Create tests” page: Custom metric test

(Optional) Pre-compute the metric

Typically, the custom metrics are run by Openlayer using the information you provide in the config.json (namely, the installCommand and the runCommand). However, in development mode, you have the option to run your custom metrics before pushing to Openlayer. In this case, Openlayer will only leverage the results you computed instead of executing your code. This allows you to execute your code in any environment. The only requirement is that you need to have already generated outputs for the current model + dataset pair (using openlayer batch). In order to run your custom metrics on a commit, make sure your metrics directory is in the same place as your openlayer.json.

# if you don't have metrics/ alongside your dev commit, fetch them:
openlayer metrics pull --selected

# from the directory containing your openlayer.json
openlayer metrics run

Get started

Set up tests

Test your system offline

Monitor your live system

Other resources

Prerequisites

Write the metric

Push the metric to Openlayer

Development mode

Monitoring mode

(Optional) Pre-compute the metric

Get started

Set up tests

Test your system offline

Monitor your live system

Other resources

​Prerequisites

​Write the metric

​Push the metric to Openlayer

​Development mode

​Monitoring mode

​(Optional) Pre-compute the metric

Prerequisites

Write the metric

Push the metric to Openlayer

Development mode

Monitoring mode

(Optional) Pre-compute the metric