Default data patterns
Review the default data pattern regular expressions included in Data Discovery. These default data patterns can be used to filter table entries for further classification.
Data Discovery supports using a Named Entity Recognition (NER) model to discover data, such as names, organizations, nationalities, and political affiliations. Data patterns with the type Model use this feature.
The following table details the default patterns for data discovery.
| Name | Description | Regular Expression | Named Entity Recognition | Keywords | Examples |
|---|---|---|---|---|---|
| Address | A full or partial location identifier, including street names, unit / plot numbers, but excludes city, state, country and zip code. | ADDRESS |
|
||
| City | The name of a city or town from regions and countries around the world. | CITY |
|
||
| Country | The name of a sovereign nation or territory. | COUNTRY |
|
||
| Date & Time | Absolute or relative dates or periods or times smaller than a day. | DATE_TIME |
|
||
| Job position | A specific role or set of responsibilities within an organization, designated to be filled by an employee. | JOB_POSITION |
|
||
| Location | Name of politically or geographically defined location (cities, provinces, countries, international regions, bodies of water, mountains | LOCATION |
|
||
| Nationality, religious or political groups (NRPs) | A person's nationality, religious or political group. | NRP |
|
||
| Organization | Name of organization. | ORGANIZATION |
|
||
| Person | A full person name, which can include first names, middle names or initials, and last names. | PERSON | Fred Luddy, Abel Tuter, Abraham Lincoln |
|
|
| Salary | A numeric value representing an individual's earnings, often accompanied by currency symbols. | SALARY |
|
||
| State | States, Provinces, Prefectures and regions around the world. | STATE |
|
||
| Age | A person's age between 0-129 | \b([0-9]|[1-9][0-9]|1[012][0-9])\b | age |
|
|
| Date of birth | Date of birth using the DD/MM/YYYY format | \b[0-3]?[0-9]/[0-3]?[0-9]/(?:[0-9]{2})?[0-9]{2}\b | dob, birthday, date of birth |
|
|
| Standard email address | \b[\w!#$%&'*+/=?`{|}~^-]+(?:\.[\w!#$%&'*+/=?`{|}~^-]+)*@(?:[a-zA-Z0-9-]+\.)+[a-zA-Z]{2,6}\b |
|
|||
| Vehicle identification number | A vehicle identification number (VIN) | \b[A-HJ-NPR-Z0-9]{17}\b |
|
||
| IP Address | Standard IP address |
|
|
||
| Credit Card- Visa | Visa credit card number | \b4[0-9]{12}(?:[0-9]{3})?\b |
|
||
| Credit Card- American Express | American Express credit card number | \b3[47][0-9]{13}\b |
|
||
| Credit Card- Mastercard | Mastercard credit card number | \b(?:5[1-5][0-9]{2}|222[1-9]|22[3-9][0-9]|2[3-6][0-9]{2}|27[01][0-9]|2720)[0-9]{12}\b |
|
||
| Credit Card- Diners Club | Diners Club credit card number | \b3(?:0[0-5]|[68][0-9])[0-9]{11}\b |
|
||
| Credit Card- Discover | Discovery credit card number | \b6(?:011|5[0-9]{2})[0-9]{12}\b |
|
||
| Credit Card- CCV | Credit card security number | \b[0-9]{3,4}\b | cvv,verification code,security code |
|
|
| Credit Card- Expire Date | Credit card expiration in MM/YYYY format | \b(([1-9])|(0[1-9]|1[0-2]))\/?([0-9]{4}|[0-9]{2})\b | expire,exp |
|
|
| USA- Social security number | USA citizen social security number | \b(?!666|000|9\d{2})\d{3}-(?!00)\d{2}-(?!0{4})\d{4}\b |
|
||
| USA- Phone Number | USA phone number 警告: Does not use the USA calling code. |
\b\(?([0-9]{3})\)?[-. ]?([0-9]{3})[-. ]?([0-9]{4})\b |
|
||
| USA- Passport Number | 9 digit USA passport number | \b[a-zA-Z0-9]\\d{8}\b |
|
||
| USA- Taxpayer ID | USA taxpayer ID number | \b(9\d{2})([ \-]?)([7]\d|8[0-8])([ \-]?)(\d{4})\b |
|
||
| USA- California State Driver License number | State of California, USA driver license number | \b[a-zA-Z]\d{7}\b |
|
||
| USA - Bank Routing number | US Bank Routing (ABA) number | \b((0[0-9])|(1[0-2])|(2[1-9])|(3[0-2])|(6[1-9])|(7[0-2])|80)([0-9]{7})\b |
|