Integrity
Predictive power score (PPS)
Definition
The PPS is a metric that measures the predictive power of a feature with respect to the label/target. It has similarities with the Pearson correlation coefficient, but can also capture non-linear relationships. The PPS tests allows you to set a threshold on the PPS for a specific feature.
Taxonomy
- Category: Integrity.
- Task types: Tabular classification, tabular regression.
- Availability: and .
Why it matters
- Features with a low PPS score are likely to be redundant and can be removed from the dataset.
Test configuration examples
If you are writing a tests.json
, here are a few valid configurations for the character length test:
Related
Was this page helpful?