Breadcrumbs

Smart Classification Examples

The following examples refer to custom metadata that would have to be created before creating the Smart Classification rule; a Smart Classification rule cannot be saved unless you specify which metadata field to set.

Identifying files less than 5 MB containing US social security numbers 

Rule name

Tag files <5 MB with US social security numbers

Automatic execution

Enable

Filters

File size is less than [5 MB]

Classifier

Default

Classifier patterns

Match pattern by name [U. S. Social Security Number (SSN)]

Classifier condition

Number of matches is greater than [0]

Match action

Set metadata  [SSN.found] to yes

Non-match action

Set metadata  [SSN.found] to no


Identifying files with extensions .txt and .pdf containing US social security numbers 

Rule name

Tag txt and pdf files with US social security numbers

Automatic execution

Enable

Filters

File extension is equal to [txt] OR File extension is equal to [pdf]
Note: Click AND to change it to OR.

Classifier

Default

Classifier patterns

Match pattern by name [U. S. Social Security Number (SSN)]

Classifier condition

Number of matches is greater than [0]

Match action

Set metadata [SSN.found] to yes

Non-match action

Set metadata [SSN.found] to no


Identifying all files containing US social security numbers 

Rule name

Tag all files with US social security numbers

Automatic execution

Enable

Filters

Anything

Classifier

Default

Classifier patterns

Match pattern by name [U. S. Social Security Number (SSN)]

Classifier condition

Number of matches is greater than [0]

Match action

Set metadata [SSN.found] to yes

Non-match action

Set metadata [SSN.found] to no


Identifying files in the  Team Folder HumanResources containing US social security numbers 

Rule name

Tag all Human Resources files containing US social security numbers

Automatic execution

Enable

Filters

File path starts with [TeamFolderAdmin/HumanResources]

Note: See Identifying a FileCloud Specific Path for help writing FileCloud folder paths.

Classifier

Default

Classifier patterns

Match pattern by name [U. S. Social Security Number (SSN)]

Classifier condition

Number of matches [is greater than 0]

Match action

Set metadata [SSN.found] to yes

Non-match action

Set metadata [SSN.found] to no

Identifying files containing any pattern in the custom pattern group France ID numbers

Rule name

France ID numbers

Automatic execution

Enable

Filters

Anything

Classifier

Default

Classifier patterns

Match pattern by group [France ID numbers]

Classifier condition

Number of matches [is greater than 0]

Match action

Set [metadata ID.found] to yes

Non-match action

Set [metadata ID.found] to no

Identifying files with Singapore National Registry Identity Card (NRIC)

Rule name

Tag files with Singapore NRIC

Automatic execution

Enable

Filters

Anything

Classifier

Singapore NRIC

Classifier condition

Number of matches [is greater than 0]

Match action

Set [metadata ID.found] to yes

Non-match action

Set [metadata ID.found] to no


Identifying files with patterns matching American Express credit cards

Rule name

Tag files with American Express card numbers

Automatic execution

Enable

Filters

Anything

Classifier

Default

Classifier patterns

Match RegEx [3[47]{1}[0-9]{13}]
OR
Match RegEx [3[47]{1}[0-9]{2}-[0-9]{6}-[0-9]{5}]
OR
Match RegEx [3[47]{1}[0-9]{2} [0-9]{4} [0-9]{4} [0-9]{3}]

Note: Smart Classification automatically inserts OR when you add multiple Classifier patterns.

Classifier condition

Number of matches [is greater than 0]

Match action

Set [metadata ID.found] to yes

Non-match action

Set [metadata ID.found] to no


Identifying files with the exact phrase "Confidential - for internal use only"

Rule name

Tag files marked as confidential

Automatic execution

Enable

Filters

Anything

Classifier

Default

Classifier patterns

Match RegEx [Confidential - for internal use only]

Classifier condition

Number of matches [is greater than 0]

Match action

Set [IncludesText.Confidential] to yes

Non-match action

Set [IncludesText.Confidential] to no


Mark files with different tags depending on the number of matches

In this rule, if a file has 0-2 five-digit numbers, it is marked as having a low possibility of personal ID information. If it has >2 five-digit numbers, it is marked as having a high possibility of personal ID information. This enables you to perform different operations on files with low and high likelihood of having a match. For example, you might choose to manually review files with low possibility, but automatically block files with high possibility.

Rule name

Tag files based on number of 5-digit numbers

Automatic execution

Enable

Filters

Anything

Classifier

Default

Classifier patterns

Match RegEx [[0-9]{5}]

Classifier condition

Number of matches [is greater than 2]

Match action

Set [ID.found] to high

Non-match action

Set [ID.found] to low


Identifying files with a phrase that is the same or similar to "Confidential - for internal use only"

Rule name

Tag files with confidentiality phrases

Automatic execution

Enable

Filters

Anything

Classifier

Solr Standard Query

Classifier patterns

Match Standard Query ["Confidential - for internal use only"~4]
(include "" around phrase)

Note: ~4 indicates that all words in the phrase must appear, but may be within 4 words of each other. For example "Confidential - use for internal only" would be a match.

Classifier condition

Number of matches [is greater than 0]

Match action

Set [IncludesText.Confidential] to yes

Non-match action

Set [IncludesText.Confidential] to no


Identifying files with a word that matches or is one letter different from "Confidential"

Rule name

Tag files with words spelled similarly to confidential

Automatic execution

Enable

Filters

Anything

Classifier

Solr Standard Query

Classifier patterns

Match Standard Query [Confidential~1]
(do not include "" around word)

Note: ~1 indicates that there may be 1 letter different in the spelling, for example "Confidental" and "Confidentials" would match, but "Confidentail" would not.

Classifier condition

Number of matches [is greater than 0]

Match action

Set [Spelling.similar] to yes

Non-match action

Set [Spelling.similar] to no


Identifying files with the word "classified" and not the word "declassified"

Rule name

Tag classified files

Automatic execution

Enable

Filters

Anything

Classifier

Solr Standard Query

Classifier patterns

Match Standard Query ["CLASSIFIED" NOT "DECLASSIFIED"]

Classifier condition

Number of matches [is greater than 0]

Match action

Set [Classified.found] to yes

Non-match action

Set [Classified.found] to no


Identifying files marked for blocking by ICAP-DLP

In the case of the ICAP-DLP classifier, the pattern is checked by ICAP-DLP, which tags the file if it is sensitive and does not tag it if it is not sensitive. Therefore, if the file is tagged by ICAP-DLP as sensitive, it is a match, and the following rule sets File.allowed to false, indicating that the file is not allowed to be downloaded, uploaded, or shared.

Rule name

Identifying files flagged by ICAP-DLP

Automatic execution

Enable

Filters

Anything

Classifier

ICAP-DLP

Classifier condition

Number of matches [is greater than 0]

Match action

Set [File.allowed] to false

Non-match action

Set [File.allowed] to true


Identifying files with the names or addresses (AI Classifier example)

Rule name

Tag files with names or addresses

Automatic execution

Enable

Filters

Anything

Classifier

AI Classifier

Classifier patterns

Match instances of [people names]
OR
Match instances of [addresses]

Classifier condition

Number of matches [is greater than 0]

Match action

Set [Personal Info.found] to yes

Non-match action

Set [Personal Info.found] to no


Identifying files with company names (AI Classifier example)

Rule name

Identify files with company names

Automatic execution

Enable

Filters

Anything

Classifier

AI Classifier

Classifier patterns

Match instances of [company names]

Classifier condition

Number of matches [is greater than 0]

Match action

Set [CompanyName.detected] to yes

Non-match action

Set [CompanyName.detected] to no


Identifying files with contact information (AI Classifier example)

Rule name

Identify files with contact information

Automatic execution

Enable

Filters

Anything

Classifier

AI Classifier

Classifier patterns

Match instances of [phone numbers]
OR
Match instances of [email addresses]

Classifier condition

Number of matches [is greater than 0]

Match action

Set [ContactInfo.detected] to yes

Non-match action

Set [ContactInfo.detected] to no