Supported PII & PHI entities

Syntho contains predefined recognizers for Personally Identifiable Information (PII) and Protected Health Information (PHI) entities. This page describes the different entities Syntho can detect and the method Syntho employs to detect those.

List of supported entitites

Global

Entity TypeDescriptionDetection Method

CARDINALITY

Any unique identifying number, characteristic, or code.

Cardinality / Uniqueness scan

CREDIT_CARD

Pattern match and checksum

DATE_TIME

Absolute or relative dates or periods or times smaller than a day.

Pattern match and context

EMAIL_ADDRESS

An email address identifies an email box to which email messages are delivered

Pattern match, context and RFC-822 validation

IBAN_CODE

The International Bank Account Number (IBAN) is an internationally agreed system of identifying bank accounts across national borders to facilitate the communication and processing of cross border transactions with a reduced risk of transcription errors.

Pattern match, context and checksum

IP_ADDRESS

An Internet Protocol (IP) address (either IPv4 or IPv6).

Pattern match, context and checksum

NRP

A person’s Nationality, religious or political group.

Custom logic and context

LOCATION

Name of politically or geographically defined location (cities, provinces, countries, postcodes, international regions, bodies of water, mountains)

Custom logic and context

PERSON

A full person name, which can include first names, middle names or initials, and last names.

Custom logic and context

PHONE_NUMBER

A telephone number

Custom logic, pattern match and context

MEDICAL_LICENSE

Common medical license numbers.

Pattern match, context and checksum

URL

A URL (Uniform Resource Locator), unique identifier used to locate a resource on the Internet

Pattern match, context and top level url validation

USA

FieldTypeDescriptionDetection Method

US_BANK_NUMBER

A US bank account number is between 8 to 17 digits.

Pattern match and context

US_DRIVER_LICENSE

Pattern match and context

US_ITIN

US Individual Taxpayer Identification Number (ITIN). Nine digits that start with a "9" and contain a "7" or "8" as the 4 digit.

Pattern match and context

US_PASSPORT

A US passport number with 9 digits.

Pattern match and context

US_SSN

A US Social Security Number (SSN) with 9 digits.

Pattern match and context

UK

FieldTypeDescriptionDetection Method

UK_NHS

A UK NHS number is 10 digits.

Pattern match, context and checksum

Adding a custom PII entity

Please ask Syntho for instructions on how to add a new Recognizer for a new type of PII entity.

Last updated