The following examples refer to custom metadata that would have to be created before creating the Smart Classification rule; a Smart Classification rule cannot be saved unless you specify which metadata field to set.
Identifying files less than 5 MB containing US social security numbers
|
Rule name |
Tag files <5 MB with US social security numbers |
|
Automatic execution |
Enable |
|
Filters |
File size is less than [5 MB] |
|
Classifier |
Default |
|
Classifier patterns |
Match pattern by name [U. S. Social Security Number (SSN)] |
|
Classifier condition |
Number of matches is greater than [0] |
|
Match action |
Set metadata [SSN.found] to yes |
|
Non-match action |
Set metadata [SSN.found] to no |
Identifying files with extensions .txt and .pdf containing US social security numbers
|
Rule name |
Tag txt and pdf files with US social security numbers |
|
Automatic execution |
Enable |
|
Filters |
File extension is equal to [txt] OR File extension is equal to [pdf]
|
|
Classifier |
Default |
|
Classifier patterns |
Match pattern by name [U. S. Social Security Number (SSN)] |
|
Classifier condition |
Number of matches is greater than [0] |
|
Match action |
Set metadata [SSN.found] to yes |
|
Non-match action |
Set metadata [SSN.found] to no |
Identifying all files containing US social security numbers
|
Rule name |
Tag all files with US social security numbers |
|
Automatic execution |
Enable |
|
Filters |
Anything |
|
Classifier |
Default |
|
Classifier patterns |
Match pattern by name [U. S. Social Security Number (SSN)] |
|
Classifier condition |
Number of matches is greater than [0] |
|
Match action |
Set metadata [SSN.found] to yes |
|
Non-match action |
Set metadata [SSN.found] to no |
Identifying files in the Team Folder HumanResources containing US social security numbers
|
Rule name |
Tag all Human Resources files containing US social security numbers |
|
Automatic execution |
Enable |
|
Filters |
File path starts with [TeamFolderAdmin/HumanResources]
|
|
Classifier |
Default |
|
Classifier patterns |
Match pattern by name [U. S. Social Security Number (SSN)] |
|
Classifier condition |
Number of matches [is greater than 0] |
|
Match action |
Set metadata [SSN.found] to yes |
|
Non-match action |
Set metadata [SSN.found] to no |
Identifying files containing any pattern in the custom pattern group France ID numbers
|
Rule name |
France ID numbers |
|
Automatic execution |
Enable |
|
Filters |
Anything |
|
Classifier |
Default |
|
Classifier patterns |
Match pattern by group [France ID numbers] |
|
Classifier condition |
Number of matches [is greater than 0] |
|
Match action |
Set [metadata ID.found] to yes |
|
Non-match action |
Set [metadata ID.found] to no |
Identifying files with Singapore National Registry Identity Card (NRIC)
|
Rule name |
Tag files with Singapore NRIC |
|
Automatic execution |
Enable |
|
Filters |
Anything |
|
Classifier |
Singapore NRIC |
|
Classifier condition |
Number of matches [is greater than 0] |
|
Match action |
Set [metadata ID.found] to yes |
|
Non-match action |
Set [metadata ID.found] to no |
Identifying files with patterns matching American Express credit cards
|
Rule name |
Tag files with American Express card numbers |
|
Automatic execution |
Enable |
|
Filters |
Anything |
|
Classifier |
Default |
|
Classifier patterns |
Match RegEx [3[47]{1}[0-9]{13}]
|
|
Classifier condition |
Number of matches [is greater than 0] |
|
Match action |
Set [metadata ID.found] to yes |
|
Non-match action |
Set [metadata ID.found] to no |
Identifying files with the exact phrase "Confidential - for internal use only"
|
Rule name |
Tag files marked as confidential |
|
Automatic execution |
Enable |
|
Filters |
Anything |
|
Classifier |
Default |
|
Classifier patterns |
Match RegEx [Confidential - for internal use only]
|
|
Classifier condition |
Number of matches [is greater than 0] |
|
Match action |
Set [IncludesText.Confidential] to yes |
|
Non-match action |
Set [IncludesText.Confidential] to no |
Mark files with different tags depending on the number of matches
In this rule, if a file has 0-2 five-digit numbers, it is marked as having a low possibility of personal ID information. If it has >2 five-digit numbers, it is marked as having a high possibility of personal ID information. This enables you to perform different operations on files with low and high likelihood of having a match. For example, you might choose to manually review files with low possibility, but automatically block files with high possibility.
|
Rule name |
Tag files based on number of 5-digit numbers |
|
Automatic execution |
Enable |
|
Filters |
Anything |
|
Classifier |
Default |
|
Classifier patterns |
Match RegEx [[0-9]{5}]
|
|
Classifier condition |
Number of matches [is greater than 2] |
|
Match action |
Set [ID.found] to high |
|
Non-match action |
Set [ID.found] to low |
Identifying files with a phrase that is the same or similar to "Confidential - for internal use only"
|
Rule name |
Tag files with confidentiality phrases |
|
Automatic execution |
Enable |
|
Filters |
Anything |
|
Classifier |
Solr Standard Query |
|
Classifier patterns |
Match Standard Query ["Confidential - for internal use only"~4]
|
|
Classifier condition |
Number of matches [is greater than 0] |
|
Match action |
Set [IncludesText.Confidential] to yes |
|
Non-match action |
Set [IncludesText.Confidential] to no |
Identifying files with a word that matches or is one letter different from "Confidential"
|
Rule name |
Tag files with words spelled similarly to confidential |
|
Automatic execution |
Enable |
|
Filters |
Anything |
|
Classifier |
Solr Standard Query |
|
Classifier patterns |
Match Standard Query [Confidential~1]
|
|
Classifier condition |
Number of matches [is greater than 0] |
|
Match action |
Set [Spelling.similar] to yes |
|
Non-match action |
Set [Spelling.similar] to no |
Identifying files with the word "classified" and not the word "declassified"
|
Rule name |
Tag classified files |
|
Automatic execution |
Enable |
|
Filters |
Anything |
|
Classifier |
Solr Standard Query |
|
Classifier patterns |
Match Standard Query ["CLASSIFIED" NOT "DECLASSIFIED"]
|
|
Classifier condition |
Number of matches [is greater than 0] |
|
Match action |
Set [Classified.found] to yes |
|
Non-match action |
Set [Classified.found] to no |
Identifying files marked for blocking by ICAP-DLP
In the case of the ICAP-DLP classifier, the pattern is checked by ICAP-DLP, which tags the file if it is sensitive and does not tag it if it is not sensitive. Therefore, if the file is tagged by ICAP-DLP as sensitive, it is a match, and the following rule sets File.allowed to false, indicating that the file is not allowed to be downloaded, uploaded, or shared.
|
Rule name |
Identifying files flagged by ICAP-DLP |
|
Automatic execution |
Enable |
|
Filters |
Anything |
|
Classifier |
ICAP-DLP |
|
Classifier condition |
Number of matches [is greater than 0] |
|
Match action |
Set [File.allowed] to false |
|
Non-match action |
Set [File.allowed] to true |
Identifying files with the names or addresses (AI Classifier example)
|
Rule name |
Tag files with names or addresses |
|
Automatic execution |
Enable |
|
Filters |
Anything |
|
Classifier |
AI Classifier |
|
Classifier patterns |
Match instances of [people names]
|
|
Classifier condition |
Number of matches [is greater than 0] |
|
Match action |
Set [Personal Info.found] to yes |
|
Non-match action |
Set [Personal Info.found] to no |
Identifying files with company names (AI Classifier example)
|
Rule name |
Identify files with company names |
|
Automatic execution |
Enable |
|
Filters |
Anything |
|
Classifier |
AI Classifier |
|
Classifier patterns |
Match instances of [company names]
|
|
Classifier condition |
Number of matches [is greater than 0] |
|
Match action |
Set [CompanyName.detected] to yes |
|
Non-match action |
Set [CompanyName.detected] to no |
Identifying files with contact information (AI Classifier example)
|
Rule name |
Identify files with contact information |
|
Automatic execution |
Enable |
|
Filters |
Anything |
|
Classifier |
AI Classifier |
|
Classifier patterns |
Match instances of [phone numbers]
|
|
Classifier condition |
Number of matches [is greater than 0] |
|
Match action |
Set [ContactInfo.detected] to yes |
|
Non-match action |
Set [ContactInfo.detected] to no |