Default regular expression data patterns
Review the default data pattern regular expressions included in Data Discovery. These default data patterns can be used to filter table entries for further classification.
Data Discovery supports using a Named Entity Recognition (NER) model to discover data, such as names, organizations, nationalities, and political affiliations. Data patterns with the type Model use this feature.
The following table details the default regular expression patterns for data discovery.
| Name | Description | Regular Expression | Keywords | Examples |
|---|---|---|---|---|
| Age | A person's age between 0-129 | \b([0-9]|[1-9][0-9]|1[012][0-9])\b | age |
|
| Date of birth | Date of birth using the DD/MM/YYYY format | \b[0-3]?[0-9]/[0-3]?[0-9]/(?:[0-9]{2})?[0-9]{2}\b | dob, birthday, date of birth |
|
| Standard email address | \b[\w!#$%&'*+/=?`{|}~^-]+(?:\.[\w!#$%&'*+/=?`{|}~^-]+)*@(?:[a-zA-Z0-9-]+\.)+[a-zA-Z]{2,6}\b |
|
||
| Vehicle identification number | A vehicle identification number (VIN) | \b[A-HJ-NPR-Z0-9]{17}\b |
|
|
| IP Address | Standard IP address |
|
|
|
| Credit Card- Visa | Visa credit card number | \b4[0-9]{12}(?:[0-9]{3})?\b |
|
|
| Credit Card- American Express | American Express credit card number | \b3[47][0-9]{13}\b |
|
|
| Credit Card- Mastercard | Mastercard credit card number | \b(?:5[1-5][0-9]{2}|222[1-9]|22[3-9][0-9]|2[3-6][0-9]{2}|27[01][0-9]|2720)[0-9]{12}\b |
|
|
| Credit Card- Diners Club | Diners Club credit card number | \b3(?:0[0-5]|[68][0-9])[0-9]{11}\b |
|
|
| Credit Card- Discover | Discovery credit card number | \b6(?:011|5[0-9]{2})[0-9]{12}\b |
|
|
| Credit Card- CCV | Credit card security number | \b[0-9]{3,4}\b | cvv,verification code,security code |
|
| Credit Card- Expire Date | Credit card expiration in MM/YYYY format | \b(([1-9])|(0[1-9]|1[0-2]))\/?([0-9]{4}|[0-9]{2})\b | expire,exp |
|
| USA- Social security number | USA citizen social security number | \b(?!666|000|9\d{2})\d{3}-(?!00)\d{2}-(?!0{4})\d{4}\b |
|
|
| USA- Phone Number | USA phone number Warning: Does not use the USA calling code. |
\b\(?([0-9]{3})\)?[-. ]?([0-9]{3})[-. ]?([0-9]{4})\b |
|
|
| USA- Passport Number | 9 digit USA passport number | \b[a-zA-Z0-9]\\d{8}\b |
|
|
| USA- Taxpayer ID | USA taxpayer ID number | \b(9\d{2})([ \-]?)([7]\d|8[0-8])([ \-]?)(\d{4})\b |
|
|
| USA- California State Driver License number | State of California, USA driver license number | \b[a-zA-Z]\d{7}\b |
|
|
| USA - Bank Routing number | US Bank Routing (ABA) number | \b((0[0-9])|(1[0-2])|(2[1-9])|(3[0-2])|(6[1-9])|(7[0-2])|80)([0-9]{7})\b |
|