Page tree
Skip to end of metadata
Go to start of metadata

Generally, CCE can be used for a number of classification purposes such as:

  • Identification of files containing certain patterns of textual content e.g. credit card numbers, employee numbers, and social security numbers
  • Identification of files containing exact phrases
  • Classification of files into different tiers of security, privacy, etc.
  • Setting default values for custom metadata or Color Tagging metadata
  • Classification of files based on word proximity
  • Classification of files based on word similarity
  • Classification of files based on a boolean combination of SOLR queries

Identifying text patterns.

E.g. identifying files with Amex credit card numbers:

{
    "classifier": "Default",
    "precondition": "true",
    "condition": "count(_classifications) > 0",
    "matchaction": {
        "PII": {
            "Credit Card": "Amex"
        }
    },
    "parameters": {
        "SEARCH_PATTERN_SET": [
            "3[47]{1}[0-9]{13}",
            "3[47]{1}[0-9]{2}-[0-9]{6}-[0-9]{5}",
            "3[47]{1}[0-9]{2} [0-9]{4} [0-9]{4} [0-9]{3}"
        ]
    }
}

Identifying exact phrases.

E.g. identifying files containing confidentiality phrases

{
    "classifier": "Default",
    "precondition": "true",
    "condition": "count(_classifications) > 0",
    "matchaction": {
        "PII": {
            "Confidentiality": "CONFIDENTIAL"
        }
    },
    "parameters": {
        "SEARCH_PATTERN_SET": [
            "CONFIDENTIAL - FOR CODELATHE PERSONNEL ONLY"
        ]
    }
}

Classification into tiers:

(1) Classifying files into different tiers of security using pattern occurrence

/* Tier 1 */
{
    "classifier": "Default",
    "precondition": "true",
    "condition": "count(_classifications) < 5 && count(_classifications) > 0",
    "matchaction": {
        "PII": {
            "Security": "MONITOR"
        }
    },
    "defaultaction": [],
    "parameters": {
        "SEARCH_PATTERN_SET": [
            "3[47]{1}[0-9]{13}",
            "3[47]{1}[0-9]{2}-[0-9]{6}-[0-9]{5}"
        ]
    }
}

/* Tier 2 */
{
    "classifier": "Default",
    "precondition": "true",
    "condition": "count(_classifications) >= 5",
    "matchaction": {
        "PII": {
            "Security": "RESTRICT"
        }
    },
    "parameters": {
        "SEARCH_PATTERN_SET": [
            "3[47]{1}[0-9]{13}",
            "3[47]{1}[0-9]{2}-[0-9]{6}-[0-9]{5}"
        ]
    }
}


(2) Classifying files into different tiers of security using pattern match

/* Tier 1 */
{
    "classifier": "PatternMatch",
    "precondition": "true",
    "condition": "count(_classifications) == 1",
    "matchaction": {
        "PII": {
            "Security 2": "MONITOR"
        }
    },
    "defaultaction": [],
    "parameters": {
        "SEARCH_PATTERN_SET": [
            "3[47]{1}[0-9]{13}",
            "3[47]{1}[0-9]{2}-[0-9]{6}-[0-9]{5}"
        ]
    }
}

/* Tier 2 */
{
    "classifier": "PatternMatch",
    "precondition": "true",
    "condition": "count(_classifications) == 2",
    "matchaction": {
        "PII": {
            "Security 2": "RESTRICT"
        }
    },
    "parameters": {
        "SEARCH_PATTERN_SET": [
            "3[47]{1}[0-9]{13}",
            "3[47]{1}[0-9]{2}-[0-9]{6}-[0-9]{5}"
        ]
    }
}


Setting default metadata.

CCE can set custom metadata parameters values for files. Beginning in FileCloud 21.1, CCE can also set color tag metadata values for files.

Ensure file has been classified

{
    "classifier": "Default",
    "precondition": "true",
    "condition": "count(_classifications) == 0",
    "matchaction": {
        "PII": {
            "Status": "Classified"
        }
    },
    "parameters": {
        "SEARCH_PATTERN_SET": [
            "3[47]{1}[0-9]{13}",
            "3[47]{1}[0-9]{2}-[0-9]{6}-[0-9]{5}"
        ]
    }
}


Classifying files based on word proximity (phrase similarity)

{
    "classifier": "StandardQuery",
    "precondition": "true",
    "condition": "count(_classifications) > 0",
    "matchaction": {
        "PII": {
            "Confidentiality %": 99.9
        }
    },
    "defaultaction": [],
    "parameters": {
        "STANDARD_QUERY_EXPRESSION": "\"CONFIDENTIAL - FOR CODELATHE PERSONNEL ONLY\"~2"
    }
}


Classifying files based on word similarity

{
    "classifier": "StandardQuery",
    "precondition": "true",
    "condition": "count(_classifications) > 0",
    "matchaction": {
        "PII": {
            "Confidentiality 2 %": 99.9
        }
    },
    "defaultaction": [],
    "parameters": {
        "STANDARD_QUERY_EXPRESSION": "CONFIDENTIAL~1"
    }
}


Classifying files based on a boolean combination of SOLR queries

{
    "classifier": "StandardQuery",
    "precondition": "true",
    "condition": "count(_classifications) > 0",
    "matchaction": {
        "PII": {
            "Confidentiality 3 %": 99.9
        }
    },
    "defaultaction": [],
    "parameters": {
        "STANDARD_QUERY_EXPRESSION": "\"CONFIDENTIAL\" NOT \"NOT CONFIDENTIAL\""
    }
}



  • No labels