Definition

The PII test asserts that no personal identifiable information (PII) is in the data. Currently, the test can check for credit card numbers and social security numbers (SSN).

Taxonomy

  • Category: Integrity.
  • Task types: LLM.
  • Availability: and .

Why it matters

  • If the dataset is not anonymized, it can lead to a data breach or biased models.
  • LLMs are prone to hallucinating (or leaking) PII.

Test configuration examples

If you are writing a tests.json, here are a few valid configurations for the character length test: