What the sensitive information types in Exchange 2013 look for

Applies to: Exchange Server 2013

Data loss prevention (DLP) includes sensitive information types that are ready for you to use in your DLP policies. This topic lists the sensitive information types in Exchange Server 2013, and shows what a DLP policy looks for when it detects each type.

A sensitive information type is defined by a pattern that can be identified by a regular expression or a function. Additionally, corroborative evidence such as keywords and checksums can be used to identify a sensitive information type. Confidence level and proximity are also used in the evaluation process.

ABA Routing Number

Format

9 digits which may be in a formatted or unformatted pattern

Pattern

Formatted:

  • Four digits beginning with 0, 1, 2, 3, 6, 7, or 8
  • A hyphen
  • Four digits
  • A hyphen
  • A digit

Unformatted:

  • 9 consecutive digits beginning with 0, 1, 2, 3, 6, 7, or 8

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_aba_routing finds content that matches the pattern.
  • A keyword from Keyword_ABA_Routing is found.
<!-- ABA Routing Number -->
<Entity id="cb353f78-2b72-4c3c-8827-92ebe4f69fdf" patternsProximity="300" recommendedConfidence="75">
      <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_aba_routing" />
        <Match idRef="Keyword_ABA_Routing" />
      </Pattern>
 </Entity>

Keywords

Keyword_ABA_Routing

  • aba
  • aba #
  • aba routing #
  • aba routing number
  • aba#
  • abarouting#
  • aba number
  • abaroutingnumber
  • american bank association routing #
  • american bank association routing number
  • americanbankassociationrouting#
  • americanbankassociationroutingnumber
  • bank routing number
  • bankrouting#
  • bankroutingnumber
  • routing transit number
  • RTN

Australia bank account number

Format

6-10 digits with or without a bank state branch number

Pattern

Account number is 6-10 digits.

Australia bank state branch number:

  • Three digits
  • A hyphen
  • Three digits

Checksum

No

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_australia_bank_account_number finds content that matches the pattern..
  • A keyword from Keyword_australia_bank_account_number is found.
  • The regular expression Regex_australia_bank_account_number_bsb finds content that matches the pattern.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_australia_bank_account_number finds content that matches the pattern..
  • A keyword from Keyword_australia_bank_account_number is found.
<!-- Australia Bank Account Number -->
<Entity id="74a54de9-2a30-4aa0-a8aa-3d9327fc07c7" patternsProximity="300" recommendedConfidence="75">
  <Pattern confidenceLevel="85">
        <IdMatch idRef="Regex_australia_bank_account_number" />
        <Match idRef="Keyword_australia_bank_account_number" />
        <Match idRef="Regex_australia_bank_account_number_bsb" />
  </Pattern>
  <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_australia_bank_account_number" />
        <Match idRef="Keyword_australia_bank_account_number" />
  </Pattern>
 </Entity>

Keywords

Keyword_australia_bank_account_number

  • swift bank code
  • correspondent bank
  • base currency
  • usa account
  • holder address
  • bank address
  • information account
  • fund transfers
  • bank charges
  • bank details
  • banking information
  • full names
  • iaea

Australia driver's license number

Format

Nine letters and digits

Pattern

Nine letters and digits:

  • Two digits or letters (not case sensitive)
  • Two digits
  • Five digits or letters (not case sensitive)

OR

  • 1-2 optional letters (not case sensitive)
  • 4-9 digits

OR

  • Nine digits or letters (not case sensitive)

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_australia_drivers_license_number finds content that matches the pattern.
  • A keyword from Keyword_australia_drivers_license_number is found.
  • No keyword from Keyword_australia_drivers_license_number_exclusions is found.
<!-- Australia Drivers License Number -->
<Entity id="1cbbc8f5-9216-4392-9eb5-5ac2298d1356" patternsProximity="300" recommendedConfidence="75">
   <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_australia_drivers_license_number" />
        <Match idRef="Keyword_australia_drivers_license_number" />
        <Any minMatches="0" maxMatches="0">
          <Match idRef="Keyword_australia_drivers_license_number_exclusions" />
        </Any>
  </Pattern>
</Entity>

Keywords

Keyword_australia_drivers_license_number

  • international driving permits
  • australian automobile association
  • international driving permit
  • DriverLicence
  • DriverLicences
  • Driver Lic
  • Driver Licence
  • Driver Licences
  • DriversLic
  • DriversLicence
  • DriversLicences
  • Drivers Lic
  • Drivers Lics
  • Drivers Licence
  • Drivers Licences
  • Driver'Lic
  • Driver'Lics
  • Driver'Licence
  • Driver'Licences
  • Driver' Lic
  • Driver' Lics
  • Driver' Licence
  • Driver' Licences
  • Driver'sLic
  • Driver'sLics
  • Driver'sLicence
  • Driver'sLicences
  • Driver's Lic
  • Driver's Lics
  • Driver's Licence
  • Driver's Licences
  • DriverLic#
  • DriverLics#
  • DriverLicence#
  • DriverLicences#
  • Driver Lic#
  • Driver Lics#
  • Driver Licence#
  • Driver Licences#
  • DriversLic#
  • DriversLics#
  • DriversLicence#
  • DriversLicences#
  • Drivers Lic#
  • Drivers Lics#
  • Drivers Licence#
  • Drivers Licences#
  • Driver'Lic#
  • Driver'Lics#
  • Driver'Licence#
  • Driver'Licences#
  • Driver' Lic#
  • Driver' Lics#
  • Driver' Licence#
  • Driver' Licences#
  • Driver'sLic#
  • Driver'sLics#
  • Driver'sLicence#
  • Driver'sLicences#
  • Driver's Lic#
  • Driver's Lics#
  • Driver's Licence#
  • Driver's Licences#

Keyword_australia_drivers_license_number_exclusions

  • aaa
  • DriverLicense
  • DriverLicenses
  • Driver License
  • Driver Licenses
  • DriversLicense
  • DriversLicenses
  • Drivers License
  • Drivers Licenses
  • Driver'License
  • Driver'Licenses
  • Driver' License
  • Driver' Licenses
  • Driver'sLicense
  • Driver'sLicenses
  • Driver's License
  • Driver's Licenses
  • DriverLicense#
  • DriverLicenses#
  • Driver License#
  • Driver Licenses#
  • DriversLicense#
  • DriversLicenses#
  • Drivers License#
  • Drivers Licenses#
  • Driver'License#
  • Driver'Licenses#
  • Driver' License#
  • Driver' Licenses#
  • Driver'sLicense#
  • Driver'sLicenses#
  • Driver's License#
  • Driver's Licenses#

Australia Medical Account Number

Format

10-11 digits

Pattern

10-11 digits:

  • First digit is in the range 2-6
  • Ninth digit is a check digit
  • Tenth digit is the issue digit
  • Eleventh digit (optional) is the individual number

Checksum

Yes

Definition

A DLP policy is 95% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_australian_medical_account_number finds content that matches the pattern.
  • A keyword from Keyword_Australia_Medical_Account_Number is found.
  • The checksum passes.

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_australian_medical_account_number finds content that matches the pattern.
  • The checksum passes.
  <!-- Australia Medical Account Number -->
<Entity id="104a99a0-3d3b-4542-a40d-ab0b9e1efe63" recommendedConfidence="85" patternsProximity="300">
    <Pattern confidenceLevel="95">
     <IdMatch idRef="Func_australian_medical_account_number"/>
     <Any minMatches="1">
     <Match idRef="Keyword_Australia_Medical_Account_Number"/>
     </Any>
  </Pattern>
<Pattern confidenceLevel="85">
     <IdMatch idRef="Func_australian_medical_account_number"/>
     <Any minMatches="0" maxMatches="0">
  <Match idRef="Keyword_Australia_Medical_Account_Number"/>
     </Any>
  </Pattern>
</Entity>

Keywords

Keyword_Australia_Medical_Account_Number

  • bank account details
  • medicare payments
  • mortgage account
  • bank payments
  • information branch
  • credit card loan
  • department of human services
  • local service
  • medicare

Australia Passport Number

Format

A letter followed by seven digits

Pattern

A letter (not case sensitive) followed by seven digits

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_australia_passport_number finds content that matches the pattern.
  • A keyword from Keyword_passport or Keyword_australia_passport_number is found.
<!-- Australia Passport Number -->
<Entity id="29869db6-602d-4853-ab93-3484f905df50" patternsProximity="300" recommendedConfidence="75">
  <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_australia_passport_number" />
        <Any minMatches="1">
          <Match idRef="Keyword_passport" />
          <Match idRef="Keyword_australia_passport_number" />
        </Any>
   </Pattern>
</Entity>

Keywords

Keyword_passport

  • Passport Number
  • Passport No
  • Passport #
  • Passport#
  • PassportID
  • Passportno
  • passportnumber
  • パスポート
  • パスポート番号
  • パスポートのNum
  • パスポート #
  • Numéro de passeport
  • Passeport n °
  • Passeport Non
  • Passeport #
  • Passeport#
  • PasseportNon
  • Passeportn °

Keyword_australia_passport_number

  • passport
  • passport details
  • immigration and citizenship
  • commonwealth of australia
  • department of immigration
  • residential address
  • department of immigration and citizenship
  • visa
  • national identity card
  • passport number
  • travel document
  • issuing authority

Australia Tax File Number

Format

8-9 digits

Pattern

8-9 digits typically presented with spaces as follows:

  • Three digits
  • An optional space
  • Three digits
  • An optional space
  • 2-3 digits where the last digit is a check digit

Checksum

Yes

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_australian_tax_file_number finds content that matches the pattern.
  • No keyword from Keyword_Australia_Tax_File_Number or Keyword_number_exclusions is found.
  • The checksum passes.
<!-- Australia Tax File Number -->
<Entity id="e29bc95f-ff70-4a37-aa01-04d17360a4c5" patternsProximity="300" recommendedConfidence="85">
    <Pattern confidenceLevel="95">
        <IdMatch idRef="Func_australian_tax_file_number" />
        <Any minMatches="1">
          <Match idRef="Keyword_Australia_Tax_File_Number" />
        </Any>
        <Any minMatches="0" maxMatches="0">
          <Match idRef="Keyword_number_exclusions" />
        </Any>
  </Pattern>
  <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_australian_tax_file_number" />
        <Any minMatches="0" maxMatches="0">
          <Match idRef="Keyword_Australia_Tax_File_Number" />
          <Match idRef="Keyword_number_exclusions" />
        </Any>
  </Pattern>
</Entity>

Keywords

Keyword_Australia_Tax_File_Number

  • australian business number
  • marginal tax rate
  • medicare levy
  • portfolio number
  • service veterans
  • withholding tax
  • individual tax return
  • tax file number

Keyword_number_exclusions

  • 00000000
  • 11111111
  • 22222222
  • 33333333
  • 44444444
  • 55555555
  • 66666666
  • 77777777
  • 88888888
  • 99999999
  • 000000000
  • 111111111
  • 222222222
  • 333333333
  • 444444444
  • 555555555
  • 666666666
  • 777777777
  • 888888888
  • 999999999
  • 0000000000
  • 1111111111
  • 2222222222
  • 3333333333
  • 4444444444
  • 5555555555
  • 6666666666
  • 7777777777
  • 8888888888
  • 9999999999

Canada Bank Account Number

Format

Seven or twelve digits

Pattern

A Canada Bank Account Number is seven or twelve digits.

A Canada bank account transit number is:

  • Five digits
  • A hyphen
  • Three digits OR
  • A zero "0"
  • Eight digits

Checksum

No

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_canada_bank_account_number finds content that matches the pattern.
  • A keyword from Keyword_canada_bank_account_number is found.
  • The regular expression Regex_canada_bank_account_transit_number finds content that matches the pattern.

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_canada_bank_account_number finds content that matches the pattern.
  • A keyword from Keyword_canada_bank_account_number is found.
<!-- Canada Bank Account Number -->
<Entity id="552e814c-cb50-4d94-bbaa-bb1d1ffb34de" patternsProximity="300" recommendedConfidence="75">
  <Pattern confidenceLevel="85">
        <IdMatch idRef="Regex_canada_bank_account_number" />
        <Match idRef="Keyword_canada_bank_account_number" />
        <Match idRef="Regex_canada_bank_account_transit_number" />
   </Pattern>
   <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_canada_bank_account_number" />
        <Match idRef="Keyword_canada_bank_account_number" />
   </Pattern>
</Entity>

Keywords

Keyword_canada_bank_account_number

  • canada savings bonds
  • canada revenue agency
  • canadian financial institution
  • direct deposit form
  • canadian citizen
  • legal representative
  • notary public
  • commissioner for oaths
  • child care benefit
  • universal child care
  • canada child tax benefit
  • income tax benefit
  • harmonized sales tax
  • social insurance number
  • income tax refund
  • child tax benefit
  • territorial payments
  • institution number
  • deposit request
  • banking information
  • direct deposit

Canada driver's license number

Format

Varies by province

Pattern

Various patterns covering Alberta, British Columbia, Manitoba, New Brunswick, Newfoundland/Labrador, Nova Scotia, Ontario, Prince Edward Island, Quebec, and Saskatchewan

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_[province_name]_drivers_license_number finds content that matches the pattern.
  • A keyword from Keyword_[province_name]_drivers_license_name is found.
  • A keyword from Keyword_canada_drivers_license is found.
<!-- Canada Driver's License Number -->
    <Entity id="37186abb-8e48-4800-ad3c-e3d1610b3db0" patternsProximity="300" recommendedConfidence="75">
      <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_alberta_drivers_license_number" />
        <Match idRef="Keyword_alberta_drivers_license_name" />
        <Match idRef="Keyword_canada_drivers_license" />
      </Pattern>
      <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_british_columbia_drivers_license_number" />
        <Match idRef="Keyword_british_columbia_drivers_license_name" />
        <Match idRef="Keyword_canada_drivers_license" />
      </Pattern>
      <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_manitoba_drivers_license_number" />
        <Match idRef="Keyword_manitoba_drivers_license_name" />
        <Match idRef="Keyword_canada_drivers_license" />
      </Pattern>
      <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_new_brunswick_drivers_license_number" />
        <Match idRef="Keyword_new_brunswick_drivers_license_name" />
        <Match idRef="Keyword_canada_drivers_license" />
      </Pattern>
      <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_newfoundland_labrador_drivers_license_number" />
        <Match idRef="Keyword_newfoundland_labrador_drivers_license_name" />
        <Match idRef="Keyword_canada_drivers_license" />
      </Pattern>
      <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_nova_scotia_drivers_license_number" />
        <Match idRef="Keyword_nova_scotia_drivers_license_name" />
        <Match idRef="Keyword_canada_drivers_license" />
      </Pattern>
      <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_ontario_drivers_license_number" />
        <Match idRef="Keyword_ontario_drivers_license_name" />
        <Match idRef="Keyword_canada_drivers_license" />
      </Pattern>
      <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_prince_edward_island_drivers_license_number" />
        <Match idRef="Keyword_prince_edward_island_drivers_license_name" />
        <Match idRef="Keyword_canada_drivers_license" />
      </Pattern>
      <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_quebec_drivers_license_number" />
        <Match idRef="Keyword_quebec_drivers_license_name" />
        <Match idRef="Keyword_canada_drivers_license" />
      </Pattern>
      <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_saskatchewan_drivers_license_number" />
        <Match idRef="Keyword_saskatchewan_drivers_license_name" />
        <Match idRef="Keyword_canada_drivers_license" />
      </Pattern>
    </Entity>

Keywords

Keyword_[province_name]_drivers_license_name

  • The province abbreviation, for example AB
  • The province name, for example Alberta

Keyword_canada_drivers_license

  • DL
  • DLS
  • CDL
  • CDLS
  • DriverLic
  • DriverLics
  • DriverLicense
  • DriverLicenses
  • DriverLicence
  • DriverLicences
  • Driver Lic
  • Driver Lics
  • Driver License
  • Driver Licenses
  • Driver Licence
  • Driver Licences
  • DriversLic
  • DriversLics
  • DriversLicence
  • DriversLicences
  • DriversLicense
  • DriversLicenses
  • Drivers Lic
  • Drivers Lics
  • Drivers License
  • Drivers Licenses
  • Drivers Licence
  • Drivers Licences
  • Driver'Lic
  • Driver'Lics
  • Driver'License
  • Driver'Licenses
  • Driver'Licence
  • Driver'Licences
  • Driver' Lic
  • Driver' Lics
  • Driver' License
  • Driver' Licenses
  • Driver' Licence
  • Driver' Licences
  • Driver'sLic
  • Driver'sLics
  • Driver'sLicense
  • Driver'sLicenses
  • Driver'sLicence
  • Driver'sLicences
  • Driver's Lic
  • Driver's Lics
  • Driver's License
  • Driver's Licenses
  • Driver's Licence
  • Driver's Licences
  • Permis de Conduire
  • id
  • ids
  • idcard number
  • idcard numbers
  • idcard #
  • idcard #s
  • idcard card
  • idcard cards
  • idcard
  • identification number
  • identification numbers
  • identification #
  • identification #s
  • identification card
  • identification cards
  • identification
  • DL#
  • DLS#
  • CDL#
  • CDLS#
  • DriverLic#
  • DriverLics#
  • DriverLicense#
  • DriverLicenses#
  • DriverLicence#
  • DriverLicences#
  • Driver Lic#
  • Driver Lics#
  • Driver License#
  • Driver Licenses#
  • Driver License#
  • Driver Licences#
  • DriversLic#
  • DriversLics#
  • DriversLicense#
  • DriversLicenses#
  • DriversLicence#
  • DriversLicences#
  • Drivers Lic#
  • Drivers Lics#
  • Drivers License#
  • Drivers Licenses#
  • Drivers Licence#
  • Drivers Licences#
  • Driver'Lic#
  • Driver'Lics#
  • Driver'License#
  • Driver'Licenses#
  • Driver'Licence#
  • Driver'Licences#
  • Driver' Lic#
  • Driver' Lics#
  • Driver' License#
  • Driver' Licenses#
  • Driver' Licence#
  • Driver' Licences#
  • Driver'sLic#
  • Driver'sLics#
  • Driver'sLicense#
  • Driver'sLicenses#
  • Driver'sLicence#
  • Driver'sLicences#
  • Driver's Lic#
  • Driver's Lics#
  • Driver's License#
  • Driver's Licenses#
  • Driver's Licence#
  • Driver's Licences#
  • Permis de Conduire#
  • id#
  • ids#
  • idcard card#
  • idcard cards#
  • idcard#
  • identification card#
  • identification cards#
  • identification#

Canada Health Service Number

Format

10 digits

Pattern

10 digits

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_canada_health_service_number finds content that matches the pattern.
  • A keyword from Keyword_canada_health_service_number is found.
<!-- Canada Health Service Number -->
<Entity id="59c0bf39-7fab-482c-af25-00faa4384c94" patternsProximity="300" recommendedConfidence="75">
  <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_canada_health_service_number" />
        <Any minMatches="1">
          <Match idRef="Keyword_canada_health_service_number" />
        </Any>
  </Pattern>
</Entity>

Keywords

Keyword_canada_health_service_number

  • personal health number
  • patient information
  • health services
  • speciality services
  • automobile accident
  • patient hospital
  • psychiatrist
  • workers compensation
  • disability

Canada passport number

Format

Two uppercase letters followed by six digits

Pattern

Two uppercase letters followed by six digits

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_canada_passport_number finds content that matches the pattern.
  • A keyword from Keyword_canada_passport_number or Keyword_passport is found.
<!-- Canada Passport Number -->
<Entity id="14d0db8b-498a-43ed-9fca-f6097ae687eb" patternsProximity="300" recommendedConfidence="75">
  <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_canada_passport_number" />
        <Any minMatches="1">
          <Match idRef="Keyword_canada_passport_number" />
          <Match idRef="Keyword_passport" />
        </Any>
  </Pattern>
</Entity>

Keywords

Keyword_canada_passport_number

  • canadian citizenship
  • canadian passport
  • passport application
  • passport photos
  • certified translator
  • canadian citizens
  • processing times
  • renewal application

Keyword_passport

  • Passport Number
  • Passport No
  • Passport #
  • Passport#
  • PassportID
  • Passportno
  • passportnumber
  • パスポート
  • パスポート番号
  • パスポートのNum
  • パスポート#
  • Numéro de passeport
  • Passeport n °
  • Passeport Non
  • Passeport #
  • Passeport#
  • PasseportNon
  • Passeportn °

Canada Personal Health Identification Number (PHIN)

Format

Nine digits

Pattern

Nine digits

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_canada_phin finds content that matches the pattern.
  • At least two keywords from Keyword_canada_phin or Keyword_canada_provinces are found.
<!-- Canada PHIN -->
<Entity id="722e12ac-c89a-4ec8-a1b7-fea3469f89db" patternsProximity="300" recommendedConfidence="75">
  <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_canada_phin" />
        <Any minMatches="2">
          <Match idRef="Keyword_canada_phin" />
          <Match idRef="Keyword_canada_provinces" />
        </Any>
  </Pattern>
</Entity>

Keywords

Keyword_canada_phin

  • social insurance number
  • health information act
  • income tax information
  • manitoba health
  • health registration
  • prescription purchases
  • benefit eligibility
  • personal health
  • power of attorney
  • registration number
  • personal health number
  • practitioner referral
  • wellness professional
  • patient referral
  • health and wellness

Keyword_canada_provinces

  • Nunavut
  • Quebec
  • Northwest Territories
  • Ontario
  • British Columbia
  • Alberta
  • Saskatchewan
  • Manitoba
  • Yukon
  • Newfoundland and Labrador
  • New Brunswick
  • Nova Scotia
  • Prince Edward Island
  • Canada

Canada Social Insurance Number

Format

Nine digits with optional hyphens or spaces

Pattern

Formatted:

  • Three digits
  • A hyphen or space
  • Three digits
  • A hyphen or space
  • Three digits

Unformatted:

  • Nine digits

Checksum

Yes

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_canadian_sin finds content that matches the pattern.

  • At least two of any combination of the following:

    • A keyword from Keyword_sin is found.
    • A keyword from Keyword_sin_collaborative is found.
    • The function Func_eu_date finds a date in the right date format.
  • The checksum passes.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_unformatted_canadian_sin finds content that matches the pattern.
  • A keyword from Keyword_sin is found.
  • The checksum passes.
<!-- Canada Social Insurance Number -->
<Entity id="a2f29c85-ecb8-4514-a610-364790c0773e" patternsProximity="300" recommendedConfidence="75">
  <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_canadian_sin" />
        <Any minMatches="2">
          <Match idRef="Keyword_sin" />
          <Match idRef="Keyword_sin_collaborative" />
          <Match idRef="Func_eu_date" />
        </Any>
  </Pattern>
  <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_unformatted_canadian_sin" />
        <Match idRef="Keyword_sin" />
  </Pattern>
</Entity>

Keywords

Keyword_sin

  • sin
  • social insurance
  • numero d'assurance sociale
  • sins
  • ssn
  • ssns
  • social security
  • numero d'assurance social
  • national identification number
  • national id
  • sin#
  • soc ins
  • social ins

Keyword_sin_collaborative

  • driver's license
  • drivers license
  • driver's licence
  • drivers licence
  • DOB
  • Birthdate
  • Birthday
  • Date of Birth

Drug Enforcement Agency (DEA) Number

Format

Two letters followed by seven digits

Pattern

Pattern must include all of the following:

  • One letter (not case sensitive) from this set of possible letters: abcdefghjklmnprstux, which is a registrant code
  • One letter (not case sensitive), which is the first letter of the registrant's last name
  • Seven digits, the last of which is the check digit

Checksum

Yes

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_dea_number finds content that matches the pattern.
  • The checksum passes.
<!-- DEA Number -->
<Entity id="9a5445ad-406e-43eb-8bd7-cac17ab6d0e4" recommendedConfidence="85" patternsProximity="300">
  <Pattern confidenceLevel="85">
     <IdMatch idRef="Func_dea_number"/>
  </Pattern>
</Entity>

Keywords

None

EU Debit Card Number

Format

16 digits

Pattern

Very complex and robust pattern

Checksum

Yes

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_eu_debit_card finds content that matches the pattern.

  • At least one of the following is true:

    • A keyword from Keyword_eu_debit_card is found.
    • A keyword from Keyword_card_terms_dict is found.
    • A keyword from Keyword_card_security_terms_dict is found.
    • A keyword from Keyword_card_expiration_terms_dict is found.
    • The function Func_expiration_date finds a date in the right date format.
  • The checksum passes.

    <!-- EU Debit Card Number -->
    <Entity id="0e9b3178-9678-47dd-a509-37222ca96b42" patternsProximity="300" recommendedConfidence="85">
      <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_eu_debit_card" />
        <Any minMatches="1">
          <Match idRef="Keyword_eu_debit_card" />
          <Match idRef="Keyword_card_terms_dict" />
          <Match idRef="Keyword_card_security_terms_dict" />
          <Match idRef="Keyword_card_expiration_terms_dict" />
          <Match idRef="Func_expiration_date" />
        </Any>
      </Pattern>
    </Entity>

Keywords

Keyword_eu_debit_card

  • account number
  • card number
  • card no.
  • security number
  • cc#

Keyword_card_terms_dict

  • acct nbr
  • acct num
  • acct no
  • american express
  • americanexpress
  • americano espresso
  • amex
  • atm card
  • atm cards
  • atm kaart
  • atmcard
  • atmcards
  • atmkaart
  • atmkaarten
  • bancontact
  • bank card
  • bankkaart
  • card holder
  • card holders
  • card num
  • card number
  • card numbers
  • card type
  • cardano numerico
  • cardholder
  • cardholders
  • cardnumber
  • cardnumbers
  • carta bianca
  • carta credito
  • carta di credito
  • cartao de credito
  • cartao de crédito
  • cartao de debito
  • cartao de débito
  • carte bancaire
  • carte blanche
  • carte bleue
  • carte de credit
  • carte de crédit
  • carte di credito
  • carteblanche
  • cartão de credito
  • cartão de crédito
  • cartão de debito
  • cartão de débito
  • cb
  • ccn
  • check card
  • check cards
  • checkcard
  • checkcards
  • chequekaart
  • cirrus
  • cirrus-edc-maestro
  • controlekaart
  • controlekaarten
  • credit card
  • credit cards
  • creditcard
  • creditcards
  • debetkaart
  • debetkaarten
  • debit card
  • debit cards
  • debitcard
  • debitcards
  • debito automatico
  • diners club
  • dinersclub
  • discover
  • discover card
  • discover cards
  • discovercard
  • discovercards
  • débito automático
  • edc
  • eigentümername
  • european debit card
  • hoofdkaart
  • hoofdkaarten
  • in viaggio
  • japanese card bureau
  • japanse kaartdienst
  • jcb
  • kaart
  • kaart num
  • kaartaantal
  • kaartaantallen
  • kaarthouder
  • kaarthouders
  • karte
  • karteninhaber
  • karteninhabers
  • kartennr
  • kartennummer
  • kreditkarte
  • kreditkarten-nummer
  • kreditkarteninhaber
  • kreditkarteninstitut
  • kreditkartennummer
  • kreditkartentyp
  • maestro
  • master card
  • master cards
  • mastercard
  • mastercards
  • mc
  • mister cash
  • n carta
  • carta
  • no de tarjeta
  • no do cartao
  • no do cartão
  • no. de tarjeta
  • no. do cartao
  • no. do cartão
  • nr carta
  • nr. carta
  • numeri di scheda
  • numero carta
  • numero de cartao
  • numero de carte
  • numero de cartão
  • numero de tarjeta
  • numero della carta
  • numero di carta
  • numero di scheda
  • numero do cartao
  • numero do cartão
  • numéro de carte
  • nº carta
  • nº de carte
  • nº de la carte
  • nº de tarjeta
  • nº do cartao
  • nº do cartão
  • nº. do cartão
  • número de cartao
  • número de cartão
  • número de tarjeta
  • número do cartao
  • scheda dell'assegno
  • scheda dell'atmosfera
  • scheda dell'atmosfera
  • scheda della banca
  • scheda di controllo
  • scheda di debito
  • scheda matrice
  • schede dell'atmosfera
  • schede di controllo
  • schede di debito
  • schede matrici
  • scoprono la scheda
  • scoprono le schede
  • solo
  • supporti di scheda
  • supporto di scheda
  • switch
  • tarjeta atm
  • tarjeta credito
  • tarjeta de atm
  • tarjeta de credito
  • tarjeta de debito
  • tarjeta debito
  • tarjeta no
  • tarjetahabiente
  • tipo della scheda
  • ufficio giapponese della
  • scheda
  • v pay
  • v-pay
  • visa
  • visa plus
  • visa electron
  • visto
  • visum
  • vpay

Keyword_card_security_terms_dict

  • card identification number
  • card verification
  • cardi la verifica
  • cid
  • cod seg
  • cod seguranca
  • cod segurança
  • cod sicurezza
  • cod. seg
  • cod. seguranca
  • cod. segurança
  • cod. sicurezza
  • codice di sicurezza
  • codice di verifica
  • codigo
  • codigo de seguranca
  • codigo de segurança
  • crittogramma
  • cryptogram
  • cryptogramme
  • cv2
  • cvc
  • cvc2
  • cvn
  • cvv
  • cvv2
  • cód seguranca
  • cód segurança
  • cód. seguranca
  • cód. segurança
  • código
  • código de seguranca
  • código de segurança
  • de kaart controle
  • geeft nr uit
  • issue no
  • issue number
  • kaartidentificatienummer
  • kreditkartenprufnummer
  • kreditkartenprüfnummer
  • kwestieaantal
  • no. dell'edizione
  • no. di sicurezza
  • numero de securite
  • numero de verificacao
  • numero dell'edizione
  • numero di identificazione della
  • scheda
  • numero di sicurezza
  • numero van veiligheid
  • numéro de sécurité
  • nº autorizzazione
  • número de verificação
  • perno il blocco
  • pin block
  • prufziffer
  • prüfziffer
  • security code
  • security no
  • security number
  • sicherheits kode
  • sicherheitscode
  • sicherheitsnummer
  • speldblok
  • veiligheid nr
  • veiligheidsaantal
  • veiligheidscode
  • veiligheidsnummer
  • verfalldatum

Keyword_card_expiration_terms_dict

  • ablauf
  • data de expiracao
  • data de expiração
  • data del exp
  • data di exp
  • data di scadenza
  • data em que expira
  • data scad
  • data scadenza
  • date de validité
  • datum afloop
  • datum van exp
  • de afloop
  • espira
  • espira
  • exp date
  • exp datum
  • expiration
  • expire
  • expires
  • expiry
  • fecha de expiracion
  • fecha de venc
  • gultig bis
  • gultigkeitsdatum
  • gültig bis
  • gültigkeitsdatum
  • la scadenza
  • scadenza
  • valable
  • validade
  • valido hasta
  • valor
  • venc
  • vencimento
  • vencimiento
  • verloopt
  • vervaldag
  • vervaldatum
  • vto
  • válido hasta

Finland National ID

This sensitive information type entity is included in the EU National Identification Number sensitive information type and is available as a stand alone sensitive information type entity.

Format

Six digits plus a character indicating a century plus three digits plus a check digit

Pattern

Pattern must include all of the following:

  • Six digits in the format format DDMMYY which are a date of birth
  • Century marker (either '-', '+' or 'a')
  • Three-digit personal identification number
  • A digit or letter (case insensitive) which is a check digit

Checksum

Yes

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_finnish_national_id finds content that matches the pattern.
  • A keyword from Keyword_finnish_national_id is found.
  • The checksum passes.
<!-- Finnish National ID-->
<Entity id="338FD995-4CB5-4F87-AD35-79BD1DD926C1" patternsProximity="300" recommendedConfidence="85">
  <Pattern confidenceLevel="85">
          <IdMatch idRef="Func_finnish_national_id" />
          <Match idRef="Keyword_finnish_national_id" />
  </Pattern>
</Entity>

Keywords

Keyword_finnish_national_id

  • ainutlaatuinen henkilökohtainen tunnus
  • henkilökohtainen tunnus
  • henkilötunnus
  • henkilötunnusnumero#
  • henkilötunnusnumero
  • hetu
  • id no
  • id number
  • identification number
  • identiteetti numero
  • identity number
  • idnumber
  • kansallinen henkilötunnus
  • kansallisen henkilökortin
  • national id card
  • national id no.
  • personal id
  • personal identity code
  • personalidnumber#
  • personbeteckning
  • personnummer
  • social security number
  • sosiaaliturvatunnus
  • tax id
  • tax identification no
  • tax identification number
  • tax no#
  • tax no
  • tax number
  • tax registration number
  • taxid#
  • taxidno#
  • taxidnumber#
  • taxno#
  • taxnumber#
  • taxnumber
  • tin id
  • tin no
  • tin#
  • tunnistenumero
  • tunnus numero
  • tunnusluku
  • tunnusnumero
  • verokortti
  • veronumero
  • verotunniste
  • verotunnus

France Driver's License Number

Format

12 digits

Pattern

12 digits with validation to discount similar patterns such as French telephone numbers

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_french_drivers_license finds content that matches the pattern.

  • At least one of the following is true:

    • A keyword from Keyword_french_drivers_license is found.
    • The function Func_eu_date finds a date in the right date format.
<!-- France Driver's License Number -->
<Entity id="18e55a36-a01b-4b0f-943d-dc10282a1824" patternsProximity="300" recommendedConfidence="75">
  <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_french_drivers_license" />
        <Any minMatches="1">
          <Match idRef="Keyword_french_drivers_license" />
          <Match idRef="Func_eu_date" />
        </Any>
  </Pattern>
</Entity>

Keywords

Keyword_french_drivers_license

  • drivers licence
  • drivers license
  • driving licence
  • driving license
  • permis de conduire
  • licence number
  • license number
  • licence numbers
  • license numbers

France National ID Card (CNI)

Format

12 digits

Pattern

12 digits

Checksum

No

Definition

A DLP policy is 65% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_france_cni finds content that matches the pattern.
<!-- France CNI -->
<Entity id="f741ac74-1bc0-4665-b69b-f0c7f927c0c4" patternsProximity="300" recommendedConfidence="65">
  <Pattern confidenceLevel="65">
        <IdMatch idRef="Regex_france_cni" />
  </Pattern>
</Entity>

Keywords

None

France Passport Number

Format

Nine digits and letters

Pattern

Nine digits and letters:

  • Two digits
  • Two letters (not case sensitive)
  • Five digits

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_fr_passport finds content that matches the pattern.
  • A keyword from Keyword_passport is found.
<!-- France Passport Number -->
<Entity id="3008b884-8c8c-4cd8-a289-99f34fc7ff5d" patternsProximity="300" recommendedConfidence="75">
  <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_fr_passport" />
        <Match idRef="Keyword_passport" />
  </Pattern>
</Entity>

Keywords

Keyword_passport

  • Passport Number
  • Passport No
  • Passport #
  • Passport#
  • PassportID
  • Passportno
  • passportnumber
  • パスポート
  • パスポート番号
  • パスポートのNum
  • パスポート #
  • Numéro de passeport
  • Passeport n °
  • Passeport Non
  • Passeport #
  • Passeport#
  • PasseportNon
  • Passeportn °

France Social Security Number (INSEE)

Format

15 digits

Pattern

Must match one of two patterns:

  • 13 digits followed by a space followed by two digits or
  • 15 consecutive digits

Checksum

Yes

Definition

A DLP policy is 95% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_french_insee or Func_fr_insee finds content that matches the pattern.
  • A keyword from Keyword_fr_insee is found.
  • The checksum passes.

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_french_insee or Func_fr_insee finds content that matches the pattern.
  • No keyword from Keyword_fr_insee is found.
  • The checksum passes.
<!-- France INSEE -->
<Entity id="71f62b97-efe0-4aa1-aa49-e14de253619d" patternsProximity="300" recommendedConfidence="85">
  <Pattern confidenceLevel="95">
        <IdMatch idRef="Func_french_insee" />
        <Match idRef="Func_fr_insee" />
        <Any minMatches="1">
          <Match idRef="Keyword_fr_insee" />
        </Any>
  </Pattern>
  <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_french_insee" />
        <Match idRef="Func_fr_insee" />
        <Any minMatches="0" maxMatches="0">
          <Match idRef="Keyword_fr_insee" />
        </Any>
  </Pattern>
</Entity>

Keywords

Keyword_fr_insee

  • insee
  • securité sociale
  • securite sociale
  • national id
  • national identification
  • numéro d'identité
  • no d'identité
  • no. d'identité
  • numero d'identite
  • no d'identite
  • no. d'identite
  • social security number
  • social security code
  • social insurance number
  • le numéro d'identification nationale
  • d'identité nationale
  • numéro de sécurité sociale
  • le code de la sécurité sociale
  • numéro d'assurance sociale
  • numéro de sécu
  • code sécu

German Driver's License Number

Format

Combination of 11 digits and letters

Pattern

11 digits and letters (not case sensitive):

  • A digit or letter
  • Two digits
  • Six digits or letters
  • A digit
  • A digit or letter

Checksum

Yes

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_german_drivers_license finds content that matches the pattern.

  • At least one of the following is true:

    • A keyword from Keyword_german_drivers_license_number is found.
    • A keyword from Keyword_german_drivers_license_collaborative is found.
    • A keyword from Keyword_german_drivers_license is found.
  • The checksum passes.

<!-- Germany Driver's License Number -->
<Entity id="91da9335-1edb-45b7-a95f-5fe41a16c63c" patternsProximity="300" recommendedConfidence="75">
  <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_german_drivers_license" />
        <Any minMatches="1">
          <Match idRef="Keyword_german_drivers_license_number" />
          <Match idRef="Keyword_german_drivers_license_collaborative" />
          <Match idRef="Keyword_german_drivers_license" />
        </Any>
  </Pattern>
</Entity>

Keywords

Keyword_german_drivers_license_number

  • Führerschein
  • Fuhrerschein
  • Fuehrerschein
  • Führerscheinnummer
  • Fuhrerscheinnummer
  • Fuehrerscheinnummer
  • Führerschein-
  • Fuhrerschein-
  • Fuehrerschein-
  • FührerscheinnummerNr
  • FuhrerscheinnummerNr
  • FuehrerscheinnummerNr
  • FührerscheinnummerKlasse
  • FuhrerscheinnummerKlasse
  • FuehrerscheinnummerKlasse
  • Führerschein- Nr
  • Fuhrerschein- Nr
  • Fuehrerschein- Nr
  • Führerschein- Klasse
  • Fuhrerschein- Klasse
  • Fuehrerschein- Klasse
  • FührerscheinnummerNr
  • FuhrerscheinnummerNr
  • FuehrerscheinnummerNr
  • FührerscheinnummerKlasse
  • FuhrerscheinnummerKlasse
  • FuehrerscheinnummerKlasse
  • Führerschein- Nr
  • Fuhrerschein- Nr
  • Fuehrerschein- Nr
  • Führerschein- Klasse
  • Fuhrerschein- Klasse
  • Fuehrerschein- Klasse
  • DL
  • DLS
  • Driv Lic
  • Driv Licen
  • Driv License
  • Driv Licenses
  • Driv Licence
  • Driv Licences
  • Driv Lic
  • Driver Licen
  • Driver License
  • Driver Licenses
  • Driver Licence
  • Driver Licences
  • Drivers Lic
  • Drivers Licen
  • Drivers License
  • Drivers Licenses
  • Drivers Licence
  • Drivers Licences
  • Driver's Lic
  • Driver's Licen
  • Driver's License
  • Driver's Licenses
  • Driver's Licence
  • Driver's Licences
  • Driving Lic
  • Driving Licen
  • Driving License
  • Driving Licenses
  • Driving Licence
  • Driving Licences

Keyword_german_drivers_license_collaborative

  • Nr-Führerschein
  • Nr-Fuhrerschein
  • Nr-Fuehrerschein
  • No-Führerschein
  • No-Fuhrerschein
  • No-Fuehrerschein
  • N-Führerschein
  • N-Fuhrerschein
  • N-Fuehrerschein
  • Nr-Führerschein
  • Nr-Fuhrerschein
  • Nr-Fuehrerschein
  • No-Führerschein
  • No-Fuhrerschein
  • No-Fuehrerschein
  • N-Führerschein
  • N-Fuhrerschein
  • N-Fuehrerschein

Keyword_german_drivers_license

  • ausstellungsdatum
  • ausstellungsort
  • ausstellende behöde
  • ausstellende behorde
  • ausstellende behoerde

German Passport Number

Format

10 digits or letters

Pattern

Pattern must include all of the following:

  • First character is a digit or a letter from this set (C, F, G, H, J, K)
  • Three digits
  • Five digits or letters from this set (C, -H, J-N, P, R, T, V-Z)
  • A digit

Checksum

Yes

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_german_passport finds content that matches the pattern.
  • A keyword from any of the five keyword lists is found.
  • The checksum passes.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_german_passport_data finds content that matches the pattern.
  • A keyword from any of the five keyword lists is found.
  • The checksum passes.
<!-- Germany Passport Number -->
<Entity id="2e3da144-d42b-47ed-b123-fbf78604e52c" patternsProximity="300" recommendedConfidence="75">
  <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_german_passport" />
        <Any minMatches="1">
          <Match idRef="Keyword_german_passport" />
          <Match idRef="Keyword_german_passport_collaborative" />
          <Match idRef="Keyword_german_passport_number" />
          <Match idRef="Keyword_german_passport1" />
          <Match idRef="Keyword_german_passport2" />
        </Any>
  </Pattern>
  <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_german_passport_data" />
        <Any minMatches="1">
          <Match idRef="Keyword_german_passport" />
          <Match idRef="Keyword_german_passport_collaborative" />
          <Match idRef="Keyword_german_passport_number" />
          <Match idRef="Keyword_german_passport1" />
          <Match idRef="Keyword_german_passport2" />
        </Any>
  </Pattern>
</Entity>

Keywords

Keyword_german_passport

  • reisepass
  • reisepasse
  • reisepassnummer
  • passport
  • passports

Keyword_german_passport_collaborative

  • geburtsdatum
  • ausstellungsdatum
  • ausstellungsort

Keyword_german_passport_number

No-Reisepass Nr-Reisepass

Keyword_german_passport1

Reisepass-Nr

Keyword_german_passport2

bnationalit.t

International banking account number (IBAN)

Format

Country code (two letters) plus check digits (two digits) plus bban number (up to 30 characters)

Pattern

Pattern must include all of the following:

  • Two-letter country code
  • Two check digits (followed by an optional space)
  • 1-7 groups of four letters or digits (can be separated by spaces)
  • 1-3 letters or digits

The format for each country is slightly different. The IBAN sensitive information type covers these 60 countries:

ad, ae, al, at, az, ba, be, bg, bh, ch, cr, cy, cz, de, dk, do, ee, es, fi, fo, fr, gb, ge, gi, gl, gr, hr, hu, ie, il, is, it, kw, kz, lb, li, lt, lu, lv, mc, md, me, mk, mr, mt, mu, nl, no, pl, pt, ro, rs, sa, se, si, sk, sm, tn, tr, vg

Checksum

Yes

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_iban finds content that matches the pattern.
  • The checksum passes.
<Entity id="e7dc4711-11b7-4cb0-b88b-2c394a771f0e" patternsProximity="300" recommendedConfidence="85">
  <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_iban" />
  </Pattern>
</Entity>

Keywords

None

IP address

Format

IPv4:

  • Complex pattern which accounts for formatted (periods) and unformatted (no periods) versions of the IPv4 addresses.

IPv6:

  • Complex pattern which accounts for formatted IPv6 numbers (which include colons).

Pattern

n/a

Checksum

No

Definition

For IPv6, a DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_ipv6_address finds content that matches the pattern.
  • No keyword from Keyword_ipaddress is found.

For IPv4, a DLP policy is 95% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_ipv4_address finds content that matches the pattern.
  • A keyword from Keyword_ipaddress is found.

For IPv6, a DLP policy is 95% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_ipv6_address finds content that matches the pattern.
  • No keyword from Keyword_ipaddress is found.
    <!-- IP Address -->
    <Entity id="1daa4ad5-e2dd-4ca4-a788-54722c09efb2" patternsProximity="300" recommendedConfidence="85">
      <Pattern confidenceLevel="85">
        <IdMatch idRef="Regex_ipv6_address" />
        <Any minMatches="0" maxMatches="0">
          <Match idRef="Keyword_ipaddress" />
        </Any>
      </Pattern>
      <Pattern confidenceLevel="95">
        <IdMatch idRef="Regex_ipv4_address" />
        <Any minMatches="1">
          <Match idRef="Keyword_ipaddress" />
        </Any>
      </Pattern>
      <Pattern confidenceLevel="95">
        <IdMatch idRef="Regex_ipv6_address" />
        <Any minMatches="1">
          <Match idRef="Keyword_ipaddress" />
        </Any>
      </Pattern>
    </Entity>

Keywords

Keyword_ipaddress

  • IP (this keyword is case sensitive)
  • ip address
  • ip addresses
  • internet protocol
  • IP-כתובת ה

Israel bank account number

Format

13 digits

Pattern

Formatted:

  • Two digits
  • A dash
  • Three digits
  • A dash
  • Eight digits

Unformatted:

  • 13 consecutive digits

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_israel_bank_account_number finds content that matches the pattern.
  • A keyword from Keyword_israel_bank_account_number is found.
<!-- Israel Bank Account Number -->
<Entity id="7d08b2ff-a0b9-437f-957c-aeddbf9b2b25" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_israel_bank_account_number" />
        <Any minMatches="1">
          <Match idRef="Keyword_israel_bank_account_number" />
        </Any>
    </Pattern>
</Entity>

Keywords

Keyword_israel_bank_account_number

  • Bank Account Number
  • Bank Account
  • Account Number
  • מספר חשבון בנק

Israel National ID

Format

Nine digits

Pattern

Nine consecutive digits

Checksum

Yes

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_israeli_national_id_number finds content that matches the pattern.
  • A keyword from Keyword_Israel_National_ID is found.
  • The checksum passes.
<!-- Israel National ID Number -->
<Entity id="e05881f5-1db1-418c-89aa-a3ac5c5277ee" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_israeli_national_id_number" />
        <Any minMatches="1">
          <Match idRef="Keyword_Israel_National_ID" />
        </Any>
    </Pattern>
</Entity>

Keywords

Keyword_Israel_National_ID

  • מספר זהות
  • National ID Number

Italy Ddriver's License Number

Format

A combination of 10 letters and digits

Pattern

  • A combination of 10 letters and digits:

    • One letter (not case sensitive)
    • The letter "A" or "V" (not case sensitive)
    • Seven letters (not case sensitive), digits, or the underscore character
    • One letter (not case sensitive)

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_italy_drivers_license_number finds content that matches the pattern.
  • A keyword from Keyword_italy_drivers_license_number is found.
<!-- Italy Driver's license Number -->
<Entity id="97d6244f-9157-41bd-8e0c-9d669a5c4d71" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_italy_drivers_license_number" />
        <Any minMatches="1">
          <Match idRef="Keyword_italy_drivers_license_number" />
        </Any>
    </Pattern>
</Entity>

Keywords

Keyword_italy_drivers_license_number

  • numero di patente di guida
  • patente di guida

Japan Bank Account Number

Format

Seven or eight digits

Pattern

Bank account number:

  • Seven or eight digits

Bank account branch code:

  • Four digits
  • A space or dash (optional)
  • Three digits

Checksum

No

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_jp_bank_account finds content that matches the pattern.

  • A keyword from Keyword_jp_bank_accountis found.

  • One of the following is true:

    • The function Func_jp_bank_account_branch_code finds content that matches the pattern.
    • A keyword from Keyword_jp_bank_branch_code is found.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_jp_bank_account finds content that matches the pattern.
  • A keyword from Keyword_jp_bank_account is found.
<!-- Japan Bank Account Number -->
<Entity id="d354f95b-96ee-4b80-80bc-4377312b55bc" patternsProximity="300" recommendedConfidence="75">
  <Version minEngineVersion="15.01.0131.000">
    <Pattern confidenceLevel="85">
          <IdMatch idRef="Func_jp_bank_account" />
          <Match idRef="Keyword_jp_bank_account" />
          <Any minMatches="1">
            <Match idRef="Func_jp_bank_account_branch_code" />
            <Match idRef="Keyword_jp_bank_branch_code" />
          </Any>
      </Pattern>
  </Version>
     <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_jp_bank_account" />
        <Match idRef="Keyword_jp_bank_account" />
    </Pattern>
</Entity>

Keywords

Keyword_jp_bank_account

  • Checking Account Number
  • Checking Account
  • Checking Account #
  • Checking Acct Number
  • Checking Acct #
  • Checking Acct No.
  • Checking Account No.
  • Bank Account Number
  • Bank Account
  • Bank Account #
  • Bank Acct Number
  • Bank Acct #
  • Bank Acct No.
  • Bank Account No.
  • Savings Account Number
  • Savings Account
  • Savings Account #
  • Savings Acct Number
  • Savings Acct #
  • Savings Acct No.
  • Savings Account No.
  • Debit Account Number
  • Debit Account
  • Debit Account #
  • Debit Acct Number
  • Debit Acct #
  • Debit Acct No.
  • Debit Account No.
  • 口座番号を当座預金口座の確認
  • #アカウントの確認、勘定番号の確認
  • #勘定の確認
  • 勘定番号の確認
  • 口座番号の確認
  • 銀行口座番号
  • 銀行口座
  • 銀行口座#
  • 銀行の勘定番号
  • 銀行のacct#
  • 銀行の勘定いいえ
  • 銀行口座番号
  • 普通預金口座番号
  • 預金口座
  • 貯蓄口座#
  • 貯蓄勘定の数
  • 貯蓄勘定#
  • 貯蓄勘定番号
  • 普通預金口座番号
  • 引き落とし口座番号
  • 口座番号
  • 口座番号#
  • デビットのacct番号
  • デビット勘定#
  • デビットACCTの番号
  • デビット口座番号

Keyword_jp_bank_branch_code

Otemachi

Japan Driver's License Number

Format

12 digits

Pattern

12 consecutive digits

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_jp_drivers_license_number finds content that matches the pattern.
  • A keyword from Keyword_jp_drivers_license_number is found.
<!-- Japan Driver's License Number -->
<Entity id="c6011143-d087-451c-8313-7f6d4aed2270" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_jp_drivers_license_number" />
        <Match idRef ="Keyword_jp_drivers_license_number" />
    </Pattern>
</Entity>

Keywords

Keyword_jp_drivers_license_number

  • dl#
  • DL#
  • dls#
  • DLS#
  • driver license
  • driver licenses
  • drivers license
  • driver's license
  • drivers licenses
  • driver's licenses
  • driving licence
  • lic#
  • LIC#
  • lics#
  • state id
  • state identification
  • state identification number
  • 低所得国#
  • 免許証
  • 状態ID
  • 状態の識別
  • 状態の識別番号
  • 運転免許
  • 運転免許証
  • 運転免許証番号

Japan passport number

Format

Two letters followed by seven digits

Pattern

Two letters (not case sensitive) followed by seven digits

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_jp_passport finds content that matches the pattern.
  • A keyword from Keyword_jp_passport is found.
<!-- Japan Passport Number -->
<Entity id="75177310-1a09-4613-bf6d-833aae3743f8" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_jp_passport" />
        <Match idRef="Keyword_jp_passport" />
    </Pattern>
</Entity>

Keywords

Keyword_jp_passport

  • パスポート
  • パスポート番号
  • パスポートのNum
  • パスポート#

Japan Resident Registration Number

Format

11 digits

Pattern

11 consecutive digits

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_jp_resident_registration_number finds content that matches the pattern.
  • A keyword from Keyword_jp_resident_registration_number is found.
<!-- Japan Resident Registration Number -->
<Entity id="01c1209b-6389-4faf-a5f8-3f7e13899652" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_jp_resident_registration_number" />
        <Match idRef ="Keyword_jp_resident_registration_number" />
    </Pattern>
</Entity>

Keywords

Keyword_jp_resident_registration_number

  • Resident Registration Number
  • Resident Register Number
  • Residents Basic Registry Number
  • Resident Registration No.
  • Resident Register No.
  • Residents Basic Registry No.
  • Basic Resident Register No.
  • 住民登録番号、登録番号をレジデント
  • 住民基本登録番号、登録番号
  • 住民基本レジストリ番号を常駐
  • 登録番号を常駐住民基本台帳登録番号

Japan Social Insurance Number (SIN)

Format

7-12 digits

Pattern

7-12 digits:

  • Four digits
  • A hyphen (optional)
  • Six digits

OR

  • 7-12 consecutive digits

Checksum

No

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_jp_sin finds content that matches the pattern.
  • A keyword from Keyword_jp_sin is found.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_jp_sin_pre_1997 finds content that matches the pattern.
  • A keyword from Keyword_jp_sin is found.
<!-- Japan Social Insurance Number -->
<Entity id="c840e719-0896-45bb-84fd-1ed5c95e45ff" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_jp_sin" />
        <Match idRef="Keyword_jp_sin" />
    </Pattern>
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_jp_sin_pre_1997" />
        <Match idRef="Keyword_jp_sin" />
    </Pattern>
</Entity>

Keywords

Keyword_jp_sin

  • Social Insurance No.
  • Social Insurance Num
  • Social Insurance Number
  • 社会保険のテンキー
  • 社会保険番号

New Zealand Ministry of Health Number

Format

Three letters, a space (optional), and four digits

Pattern

Three letters (not case sensitive) a space (optional) four digits

Checksum

Yes

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_new_zealand_ministry_of_health_number finds content that matches the pattern.
  • A keyword from Keyword_nz_terms is found.
  • The checksum passes.
<!-- New Zealand Health Number -->
<Entity id="2b71c1c8-d14e-4430-82dc-fd1ed6bf05c7" patternsProximity="300" recommendedConfidence="85">
    <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_new_zealand_ministry_of_health_number" />
        <Any minMatches="1">
          <Match idRef="Keyword_nz_terms" />
        </Any>
    </Pattern>
</Entity>

Keywords

Keyword_nz_terms

  • NHI
  • New Zealand
  • Health
  • treatment

Poland Identity Card

Format

Three letters and six digits

Pattern

Three letters (not case sensitive) followed by six digits

Checksum

Yes

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_polish_national_id finds content that matches the pattern.
  • A keyword from Keyword_polish_national_id_passport_number is found.
  • The checksum passes.
<!-- Poland Identity Card-->
<Entity id="25E64989-ED5D-40CA-A939-6C14183BB7BF" patternsProximity="300" recommendedConfidence="85">
      <Pattern confidenceLevel="85">
          <IdMatch idRef="Func_polish_national_id" />
          <Match idRef="Keyword_polish_national_id_passport_number" />
      </Pattern>
</Entity>

Keywords

Keyword_polish_national_id_passport_number

  • Dowód osobisty
  • Numer dowodu osobistego
  • Nazwa i numer dowodu osobistego
  • Nazwa i nr dowodu osobistego
  • Nazwa i nr dowodu tożsamości
  • Dowód Tożsamości
  • dow. os.

Poland National ID (PESEL)

Format

11 digits

Pattern

11 consecutive digits

Checksum

Yes

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_pesel_identification_number finds content that matches the pattern.
  • A keyword from Keyword_pesel_identification_number is found.
  • The checksum passes.
<!-- Poland National ID (PESEL) -->
<Entity id="E3AAF206-4297-412F-9E06-BA8487E22456" patternsProximity="300" recommendedConfidence="85">
      <Pattern confidenceLevel="85">
          <IdMatch idRef="Func_pesel_identification_number" />
          <Match idRef="Keyword_pesel_identification_number" />
      </Pattern>
</Entity>

Keywords

Keyword_pesel_identification_number

  • Nr PESEL
  • PESEL

Poland Passport

Format

Two letters and seven digits

Pattern

Two letters (not case sensitive) followed by seven digits

Checksum

Yes

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_polish_passport_number finds content that matches the pattern.
  • A keyword from Keyword_polish_national_id_passport_number is found.
  • The checksum passes.
<!-- Poland Passport Number -->
<Entity id="03937FB5-D2B6-4487-B61F-0F8BFF7C3517" patternsProximity="300" recommendedConfidence="85">
      <Pattern confidenceLevel="85">
          <IdMatch idRef="Func_polish_passport_number" />
          <Match idRef="Keyword_polish_national_id_passport_number" />
      </Pattern>
</Entity>
</Version>

Keywords

Keyword_polish_national_id_passport_number

  • Nazwa i nr dowodu tożsamości
  • Dowód Tożsamości
  • dow. os.

Saudi Arabia National ID

Format

10 digits

Pattern

10 consecutive digits

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_saudi_arabia_national_id finds content that matches the pattern.
  • A keyword from Keyword_saudi_arabia_national_id is found.
<!-- Saudi Arabia National ID -->
<Entity id="8c5a0ba8-404a-41a3-8871-746aa21ee6c0" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_saudi_arabia_national_id" />
        <Any minMatches="1">
          <Match idRef="Keyword_saudi_arabia_national_id" />
        </Any>
    </Pattern>
</Entity>

Keywords

Keyword_saudi_arabia_national_id

  • Identification Card
  • I card number
  • ID number
  • الوطنية الهوية بطاقة رقم

Spain Social Security Number (SSN)

Format

11-12 digits

Pattern

11-12 digits:

  • Two digits
  • A forward slash (optional)
  • 7-8 digits
  • A forward slash (optional)
  • Two digits

Checksum

Yes

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_spanish_social_security_number finds content that matches the pattern.
  • The checksum passes.
<!-- Spain SSN -->
<Entity id="5df987c0-8eae-4bce-ace7-b316347f3070" patternsProximity="300" recommendedConfidence="85">
    <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_spanish_social_security_number" />
    </Pattern>
</Entity>

Keywords

None

Sweden National ID

Format

10 or 12 digits and an optional delimiter

Pattern

10 or 12 digits and an optional delimiter:

  • 2-4 digits (optional)
  • Six digits in date format YYMMDD
  • Delimiter of "-" or "+" (optional), plus
  • Four digits

Checksum

Yes

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_swedish_national_identifier finds content that matches the pattern.
  • The checksum passes.
<!-- Sweden National ID -->
<Entity id="f69aaf40-79be-4fac-8f05-fd1910d272c8" patternsProximity="300" recommendedConfidence="85">
    <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_swedish_national_identifier" />
    </Pattern>
</Entity>

Keywords

None

Sweden Passport Number

Format

Eight digits

Pattern

Eight consecutive digits

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_sweden_passport_number finds content that matches the pattern.

  • One of the following is true:

    • A keyword from Keyword_passport is found.
    • A keyword from Keyword_sweden_passport is found.
<!-- Sweden Passport Number -->
<Entity id="ba4e7456-55a9-4d89-9140-c33673553526" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_sweden_passport_number" />
        <Any minMatches="1">
          <Match idRef="Keyword_passport" />
          <Match idRef="Keyword_sweden_passport" />
        </Any>
    </Pattern>
</Entity>

Keywords

Keyword_sweden_passport

  • visa requirements
  • Alien Registration Card
  • Schengen visas
  • Schengen visa
  • Visa Processing
  • Visa Type
  • Single Entry
  • Multiple Entry
  • G3 Processing Fees

Keyword_passport

  • Passport Number
  • Passport No
  • Passport #
  • Passport#
  • PassportID
  • Passportno
  • passportnumber
  • パスポート
  • パスポート番号
  • パスポートのNum
  • パスポート#
  • Numéro de passeport
  • Passeport n °
  • Passeport Non
  • Passeport #
  • Passeport#
  • PasseportNon
  • Passeportn °

SWIFT Code

Format

Four letters followed by 5-31 letters or digits

Pattern

Four letters followed by 5-31 letters or digits:

  • Four-letter bank code (not case sensitive)
  • An optional space
  • 4-28 letters or digits (the Basic Bank Account Number (BBAN))
  • An optional space
  • 1-3 letters or digits (remainder of the BBAN)

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_swift finds content that matches the pattern.
  • A keyword from Keyword_swift is found.
<Entity id="cb2ab58c-9cb8-4c81-baf8-a4e106791df4" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_swift" />
        <Match idRef="Keyword_swift" />
    </Pattern>
</Entity>

Keywords

Keyword_swift

  • international organization for standardization 9362
  • iso 9362
  • iso9362
  • swift#
  • swiftcode
  • swiftnumber
  • swiftroutingnumber
  • swift code
  • swift number #
  • swift routing number
  • bic number
  • bic code
  • bic #
  • bic#
  • bank identifier code
  • 標準化9362
  • 迅速#
  • SWIFTコード
  • SWIFT番号
  • 迅速なルーティング番号
  • BIC番号
  • BICコード
  • 銀行識別コードのための国際組織
  • Organisation internationale de normalisation 9362
  • rapide #
  • code SWIFT
  • le numéro de swift
  • swift numéro d'acheminement
  • le numéro BIC
  • # BIC
  • code identificateur de banque

Taiwan National ID

Format

One letter (in English) followed by nine digits

Pattern

One letter (in English) followed by nine digits:

  • One letter (in English, not case sensitive)
  • The digit "1" or "2"
  • Eight digits

Checksum

Yes

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_taiwanese_national_id finds content that matches the pattern.
  • A keyword from Keyword_taiwanese_national_id is found.
  • The checksum passes.
<!-- Taiwanese National ID -->
<Entity id="4C7BFC34-8DD1-421D-8FB7-6C6182C2AF03" patternsProximity="300" recommendedConfidence="85">
      <Pattern confidenceLevel="85">
          <IdMatch idRef="Func_taiwanese_national_id" />
          <Match idRef="Keyword_taiwanese_national_id" />
      </Pattern>
</Entity>

Keywords

Keyword_taiwanese_national_id

  • 身份證字號
  • 身份證
  • 身份證號碼
  • 身份證號
  • 身分證字號
  • 身分證
  • 身分證號碼
  • 身份證號
  • 身分證統一編號
  • 國民身分證統一編號
  • 簽名
  • 蓋章
  • 簽名或蓋章
  • 簽章

U.K. Driver's License Number

Format

Combination of 18 letters and digits in the specified format

Pattern

18 letters and digits:

  • Five letters (not case sensitive) or the digit "9" in place of a letter
  • One digit
  • Five digits in the date format MMDDY for date of birth (7th character is incremented by 50 if driver is female, i.e. 51 to 62 instead of 01 to 12)
  • Two letters (not case sensitive) or the digit "9" in place of a letter
  • Five digits

Checksum

Yes

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_uk_drivers_license finds content that matches the pattern.
  • A keyword from Keyword_uk_drivers_license is found.
  • The checksum passes.
<!-- U.K. Driver's License Number -->
<Entity id="f93de4be-d94c-40df-a8be-461738047551" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_uk_drivers_license" />
        <Match idRef="Keyword_uk_drivers_license" />
    </Pattern>
</Entity>

Keywords

Keyword_uk_drivers_license

  • DVLA
  • light vans
  • quadbikes
  • motor cars
  • 125cc
  • sidecar
  • tricycles
  • motorcycles
  • photocard licence
  • learner drivers
  • licence holder
  • licence holders
  • driving licences
  • driving licence
  • dual control car

U.K. Electoral Roll Number

Format

Two letters followed by 1-4 digits

Pattern

Two letters (not case sensitive) followed by 1-4 numbers

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_uk_electoral finds content that matches the pattern.
  • A keyword from Keyword_uk_electoral is found.
<!-- U.K. Electoral Number -->
<Entity id="a3eea206-dc0c-4f06-9e22-aa1be3059963" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_uk_electoral" />
        <Any minMatches="1">
          <Match idRef="Keyword_uk_electoral" />
        </Any>
    </Pattern>
</Entity>

Keywords

Keyword_uk_electoral

  • council nomination
  • nomination form
  • electoral register
  • electoral roll

U.K. National Health Service Number

Format

10-17 digits separated by spaces

Pattern

10-17 digits:

  • Either 3 or 10 digits
  • A space
  • Three digits
  • A space
  • Four digits

Checksum

Yes

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_uk_nhs_number finds content that matches the pattern.

  • One of the following is true:

    • A keyword from Keyword_uk_nhs_number is found.
    • A keyword from Keyword_uk_nhs_number1 is found.
    • A keyword from Keyword_uk_nhs_number_dob is found.
  • The checksum passes.

<!-- U.K. NHS Number -->
<Entity id="3192014e-2a16-44e9-aa69-4b20375c9a78" patternsProximity="300" recommendedConfidence="85">
    <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_uk_nhs_number" />
        <Any minMatches="1">
          <Match idRef="Keyword_uk_nhs_number" />
          <Match idRef="Keyword_uk_nhs_number1" />
          <Match idRef="Keyword_uk_nhs_number_dob" />
        </Any>
    </Pattern>
</Entity>

Keywords

Keyword_uk_nhs_number

  • national health service
  • nhs
  • health services authority
  • health authority

Keyword_uk_nhs_number1

  • patient id
  • patient identification
  • patient no
  • patient number

Keyword_uk_nhs_number_dob

  • GP
  • DOB
  • D.O.B
  • Date of Birth
  • Birth Date

U.K. National Insurance Number (NINO)

Format

7 characters or 9 characters separated by spaces or dashes

Pattern

Two possible patterns:

  • Two letters (valid NINOs use only certain characters in this prefix, which this pattern validates; not case sensitive)
  • Six digits
  • Either 'A', 'B', 'C', or 'D' (like the prefix, only certain characters are allowed in the suffix; not case sensitive)

OR

  • Two letters
  • A space or dash
  • Two digits
  • A space or dash
  • Two digits
  • A space or dash
  • Two digits
  • A space or dash
  • Either 'A', 'B', 'C', or 'D'

Checksum

No

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_uk_nino finds content that matches the pattern.
  • A keyword from Keyword_uk_nino is found.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_uk_nino finds content that matches the pattern.
  • No keyword from Keyword_uk_nino is found.
<!-- U.K. NINO -->
<Entity id="16c07343-c26f-49d2-a987-3daf717e94cc" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_uk_nino" />
        <Any minMatches="1">
          <Match idRef="Keyword_uk_nino" />
        </Any>
    </Pattern>
     <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_uk_nino" />
        <Any minMatches="0" maxMatches="0">
          <Match idRef="Keyword_uk_nino" />
        </Any>
    </Pattern>
</Entity>

Keywords

Keyword_uk_nino

  • national insurance number
  • national insurance contributions
  • protection act
  • insurance
  • social security number
  • insurance application
  • medical application
  • social insurance
  • medical attention
  • social security
  • great britain
  • insurance

U.S. / U.K. Passport Number

Format

Nine digits

Pattern

Nine consecutive digits

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_usa_uk_passport finds content that matches the pattern.
  • A keyword from Keyword_passport is found.
<Entity id="178ec42a-18b4-47cc-85c7-d62c92fd67f8" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_usa_uk_passport" />
        <Match idRef="Keyword_passport" />
    </Pattern>
</Entity>

Keywords

Keyword_passport

  • Passport Number
  • Passport No
  • Passport #
  • Passport#
  • PassportID
  • Passportno
  • passportnumber
  • パスポート
  • パスポート番号
  • パスポートのNum
  • パスポート#
  • Numéro de passeport
  • Passeport n °
  • Passeport Non
  • Passeport #
  • Passeport#
  • PasseportNon
  • Passeportn °

U.S. Bank Account Number

Format

8-17 digits

Pattern

8-17 consecutive digits

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_usa_bank_account_number finds content that matches the pattern.
  • A keyword from Keyword_usa_Bank_Account is found.
<!-- U.S. Bank Account Number -->
<Entity id="a2ce32a8-f935-4bb6-8e96-2a5157672e2c" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_usa_bank_account_number" />
        <Match idRef="Keyword_usa_Bank_Account" />
    </Pattern>
</Entity>

Keywords

Keyword_usa_Bank_Account

  • Checking Account Number
  • Checking Account
  • Checking Account #
  • Checking Acct Number
  • Checking Acct #
  • Checking Acct No.
  • Checking Account No.
  • Bank Account Number
  • Bank Account #
  • Bank Acct Number
  • Bank Acct #
  • Bank Acct No.
  • Bank Account No.
  • Savings Account Number
  • Savings Account.
  • Savings Account #
  • Savings Acct Number
  • Savings Acct #
  • Savings Acct No.
  • Savings Account No.
  • Debit Account Number
  • Debit Account
  • Debit Account #
  • Debit Acct Number
  • Debit Acct #
  • Debit Acct No.
  • Debit Account No.

U.S. Driver's License Number

Format

Depends on the state

Pattern

Depends on the state -- for example, New York:

  • Nine digits formatted like ddd ddd ddd will match.
  • Nine digits like ddddddddd will not match.

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_new_york_drivers_license_number finds content that matches the pattern.
  • A keyword from Keyword_[state_name]_drivers_license_name is found.
  • A keyword from Keyword_us_drivers_license is found.

A DLP policy is 65% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_new_york_drivers_license_number finds content that matches the pattern.
  • A keyword from Keyword_[state_name]_drivers_license_name is found.
  • A keyword from Keyword_us_drivers_license_abbreviations is found.
  • No keyword from Keyword_us_drivers_license is found.
<Entity id="dfeb356f-61cd-459e-bf0f-7c6d28b458c6 patternsProximity="300">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_new_york_drivers_license_number" />
        <Match idRef="Keyword_new_york_drivers_license_name" />
        <Match idRef="Keyword_us_drivers_license" />
    </Pattern>
    <Pattern confidenceLevel="65">
        <IdMatch idRef="Func_new_york_drivers_license_number" />
        <Match idRef="Keyword_new_york_drivers_license_name" />
        <Match idRef="Keyword_us_drivers_license_abbreviations" />
        <Any minMatches="0" maxMatches="0">
          <Match idRef="Keyword_us_drivers_license" />
        </Any>
    </Pattern>
</Entity>

Keywords

Keyword_us_drivers_license_abbreviations

  • DL
  • DLS
  • CDL
  • CDLS
  • ID
  • IDs
  • DL#
  • DLS#
  • CDL#
  • CDLS#
  • ID#
  • IDs#
  • ID number
  • ID numbers
  • LIC
  • LIC#

Keyword_us_drivers_license

  • DriverLic
  • DriverLics
  • DriverLicense
  • DriverLicenses
  • Driver Lic
  • Driver Lics
  • Driver License
  • Driver Licenses
  • DriversLic
  • DriversLics
  • DriversLicense
  • DriversLicenses
  • Drivers Lic
  • Drivers Lics
  • Drivers License
  • Drivers Licenses
  • Driver'Lic
  • Driver'Lics
  • Driver'License
  • Driver'Licenses
  • Driver' Lic
  • Driver' Lics
  • Driver' License
  • Driver' Licenses
  • Driver'sLic
  • Driver'sLics
  • Driver'sLicense
  • Driver'sLicenses
  • Driver's Lic
  • Driver's Lics
  • Driver's License
  • Driver's Licenses
  • identification number
  • identification numbers
  • identification #
  • id card
  • id cards
  • identification card
  • identification cards
  • DriverLic#
  • DriverLics#
  • DriverLicense#
  • DriverLicenses#
  • Driver Lic#
  • Driver Lics#
  • Driver License#
  • Driver Licenses#
  • DriversLic#
  • DriversLics#
  • DriversLicense#
  • DriversLicenses#
  • Drivers Lic#
  • Drivers Lics#
  • Drivers License#
  • Drivers Licenses#
  • Driver'Lic#
  • Driver'Lics#
  • Driver'License#
  • Driver'Licenses#
  • Driver' Lic#
  • Driver' Lics#
  • Driver' License#
  • Driver' Licenses#
  • Driver'sLic#
  • Driver'sLics#
  • Driver'sLicense#
  • Driver'sLicenses#
  • Driver's Lic#
  • Driver's Lics#
  • Driver's License#
  • Driver's Licenses#
  • id card#
  • id cards#
  • identification card#
  • identification cards#

Keyword_[state_name]_drivers_license_name

  • State abbreviation (for example, "NY")
  • State name (for example, "New York")

U.S. Individual Taxpayer Identification Number (ITIN)

Format

Nine digits that start with a "9" and contain a "7" or "8" as the fourth digit, optionally formatted with spaces or dashes

Pattern

Formatted:

  • The digit "9"
  • Two digits
  • A space or dash
  • A "7" or "8"
  • A digit
  • A space, or dash
  • Four digits

Unformatted:

  • The digit "9"
  • Two digits
  • A "7" or "8"
  • Five digits

Checksum

No

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_formatted_itin finds content that matches the pattern.

  • At least one of the following is true:

    • A keyword from Keyword_itin is found.
    • The function Func_us_address finds an address in the right date format.
    • The function Func_us_date finds a date in the right date format.
    • A keyword from Keyword_itin_collaborative is found.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_unformatted_itin finds content that matches the pattern.

  • At least one of the following is true:

    • A keyword from Keyword_itin_collaborative is found.
    • The function Func_us_address finds an address in the right date format.
    • The function Func_us_date finds a date in the right date format.
<!-- U.S. Individual Taxpayer Identification Number (ITIN) -->
<Entity id="e55e2a32-f92d-4985-a35d-a0b269eb687b" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_formatted_itin" />
        <Any minMatches="1">
          <Match idRef="Keyword_itin" />
          <Match idRef="Func_us_address" />
          <Match idRef="Func_us_date" />
          <Match idRef="Keyword_itin_collaborative" />
        </Any>
    </Pattern>
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_unformatted_itin" />
        <Match idRef="Keyword_itin" />
        <Any minMatches="1">
          <Match idRef="Keyword_itin_collaborative" />
          <Match idRef="Func_us_address" />
          <Match idRef="Func_us_date" />
        </Any>
    </Pattern>
</Entity>

Keywords

Keyword_itin

  • taxpayer
  • tax id
  • tax identification
  • itin
  • ssn
  • tin
  • social security
  • tax payer
  • itins
  • taxid
  • individual taxpayer

Keyword_itin_collaborative

  • License
  • DL
  • DOB
  • Birthdate
  • Birthday
  • Date of Birth

U.S. Social Security Number (SSN)

Format

9 digits, which may be in a formatted or unformatted pattern

Note

If issued before mid-2011, an SSN has strong formatting where certain parts of the number must fall within certain ranges to be valid (but there's no checksum).

Pattern

Four functions look for SSNs in four different patterns:

  • Func_ssn finds SSNs with pre-2011 strong formatting that are formatted with dashes or spaces (ddd-dd-dddd OR ddd dd dddd)
  • Func_unformatted_ssn finds SSNs with pre-2011 strong formatting that are unformatted as nine consecutive digits (ddddddddd)
  • Func_randomized_formatted_ssn finds post-2011 SSNs that are formatted with dashes or spaces (ddd-dd-dddd OR ddd dd dddd)
  • Func_randomized_unformatted_ssn finds post-2011 SSNs that are unformatted as nine consecutive digits (ddddddddd)

Checksum

No

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_ssn finds content that matches the pattern.
  • A keyword from Keyword_ssn is found.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_unformatted_ssn finds content that matches the pattern.
  • A keyword from Keyword_ssn is found.

A DLP policy is 65% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_randomized_formatted_ssn finds content that matches the pattern.
  • A keyword from Keyword_ssn is found.

A DLP policy is 55% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_randomized_unformatted_ssn finds content that matches the pattern.
  • A keyword from Keyword_ssn is found.
<!-- U.S. Social Security Number (SSN) -->
  <Entity id="a44669fe-0d48-453d-a9b1-2cc83f2cba77" patternsProximity="300" recommendedConfidence="75">
      <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_ssn" />
        <Match idRef="Keyword_ssn" />
      </Pattern>
      <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_unformatted_ssn" />
        <Match idRef="Keyword_ssn" />
      </Pattern>
      <Pattern confidenceLevel="65">
        <IdMatch idRef="Func_randomized_formatted_ssn" />
        <Match idRef="Keyword_ssn" />
      </Pattern>
      <Pattern confidenceLevel="55">
        <IdMatch idRef="Func_randomized_unformatted_ssn" />
        <Match idRef="Keyword_ssn" />
      </Pattern>
  </Entity>

Keywords

Keyword_ssn

  • Social Security
  • Social Security#
  • Soc Sec
  • SSN
  • SSNS
  • SSN#
  • SS#
  • SSID