Definition

The PPS is a metric that measures the predictive power of a feature with respect to the label/target. It has similarities with the Pearson correlation coefficient, but can also capture non-linear relationships. The PPS tests allows you to set a threshold on the PPS for a specific feature.

Taxonomy

  • Category: Integrity.
  • Task types: Tabular classification, tabular regression.
  • Availability: and .

Why it matters

  • Features with a low PPS score are likely to be redundant and can be removed from the dataset.

Test configuration examples

If you are writing a tests.json, here are a few valid configurations for the character length test: