Integrity
Personal identifiable information (PII)
Definition
The PII test asserts that no personal identifiable information (PII) is in the data. Currently, the test can check for credit card numbers and social security numbers (SSN).
Taxonomy
- Category: Integrity.
- Task types: LLM.
- Availability: and .
Why it matters
- If the dataset is not anonymized, it can lead to a data breach or biased models.
- LLMs are prone to hallucinating (or leaking) PII.
Test configuration examples
If you are writing a tests.json
, here are a few valid configurations for the character length test:
Was this page helpful?