Learn how to use the column drift test to detect drift in your data
Method | Application | Score |
---|---|---|
Anderson-Darling | Applies only to numerical columns. | Returns a p-value. If p-value < threshold, drift is detected. Recommended threshold: 0.05. |
Chi-Square | Applies only to categorical columns. | Returns a p-value. If p-value < threshold, drift is detected. Recommended threshold: 0.05. |
Cramer-Von-Mises | Applies only to numerical columns. | Returns a p-value. If p-value < threshold, drift is detected. Recommended threshold: 0.05. |
Energy Distance | Applies only to numerical columns. | Returns a distance. If distance >= threshold, drift is detected. Recommended threshold: 0.1. |
Epps-Singleton | Applies only to numerical columns. | Returns a p-value. If p-value < threshold, drift is detected. Recommended threshold: 0.05. |
Fisher Exact Test | Applies only to categorical columns. | Returns a p-value. If p-value < threshold, drift is detected. Recommended threshold: 0.05. |
G-test | Applies only to categorical columns. | Returns a p-value. If p-value < threshold, drift is detected. Recommended threshold: 0.05. |
Hellinger Distance | Applies to categorical and numerical columns. | Returns a distance. If distance >= threshold, drift is detected. Recommended threshold: 0.1. |
Jensen-Shannon Distance | Applies to categorical and numerical columns. | Returns a distance. If distance >= threshold, drift is detected. Recommended threshold: 0.1. |
Kullback-Leibler Divergence | Applies to categorical and numerical columns. | Returns the divergence. If divergence >= threshold, drift is detected. Recommended threshold: 0.1. |
K-S Test | Applies only to numerical columns. | Returns a p-value. If p-value < threshold, drift is detected. Recommended threshold: 0.05. |
Mann-Whitney U-Rank Test | Applies only to numerical columns. | Returns a p-value. If p-value < threshold, drift is detected. Recommended threshold: 0.05. |
Population Stability Index | Applies to categorical and numerical columns. | Returns the computed PSI value. If PSI >= threshold, drift is detected. Recommended threshold: 0.1. |
Student’s t-test | Applies only to numerical columns. | Returns a p-value. If p-value < threshold, drift is detected. Recommended threshold: 0.05. |
Text Content Drift | Applies only to text columns. | Returns the ROC AUC of a binary classifier trained to distinguish text from the current and reference data. Drift is detected when the ROC AUC is high. Recommended threshold range: 0.5 - 1 |
Total Variation Distance | Applies only to categorical columns. | Returns a p-value. If p-value < threshold, drift is detected. Recommended threshold: 0.05. |
Wasserstein Distance | Applies only to numerical columns. | Returns a distance. If distance >= threshold, drift is detected. Recommended threshold: 0.1. |
Z-test | Applies only to categorical columns. | Returns a p-value. If p-value < threshold, drift is detected. Recommended threshold: 0.05. |
tests.json
, here are a few valid configurations for the character length test: