Configure Content Search for Managed Storage

Administrators must configure FileCloud to use Solr before it can be used for advanced search.

  • Configure FileCloud with the URL of the Solr server and the port number.
  • Create an index to allow Solr to search the files in managed storage.

  1. Configure Solr in the Content Search tab.

    Follow these steps to configure FileCloud with Solr, installed in the previous section.

    1. Login into admin UI. Navigate to Settings -> Content Search.


      The content search details form has default values for each field.

    2. Update these values, depending on your environment.

      ParameterRemarks
      Content Search StatusThis is a read-only field that displays the status of FileCloud, Solr configuration.
      URLThe http url of the Solr server. Default value is http://127.0.0.1. If Solr is installed on a different server, use its IP address.
      Note: Do not use the port number as part of the URL. It will be entered in the next field.
      PortPort number of the Solr server. Default value is 8983. If Solr is running on a different port, update this value.
      App ContextThis is the application context under which the Solr server is hosted. Default value is solr.
      Config PrefixUnique prefix for the current sites configuration. Default value is fccore.
      Note: For multisite FileCloud editions, each site should have a unique prefix, if they use the same Solr server. 
    3. Click 'Configure' to configure FileCloud with Solr, using the entered information.
      If this fails, a message may appear that prompts you to copy a configuration directory from a source directory to a target directory.
      If the message appears, follow the instructions to copy the template folder. In the following example, the folder C:\xampp\htdocs\thirdparty\overrides\solarium\Solarium\fcskel should be copied to C:\work\solr\solr-5.3.1\server\solr\ and renamed fccore.
      Directories and target folder name for your environment might be different.


    4. If you are prompted to copy the configuration directory, after you copy it, click the 'Configure' button again.
      Upon successful configuration, a new field will appear showing the index status of all 'Managed Storage' files.


      If it is a fresh installation, there will not be any user files. All the new files will be indexed as they get uploaded.
      If it is an existing installation, there will be files already managed. Though the new files will be indexed, the existing files has to be indexed separately.
    5. This can be done by clicking on the 'Index' button.


      Upon successful indexing, the following status should be shown.

    Multi site Solr configuration is similar to the single site configuration. After a new site is created, repeat the above steps for single except in step 2, specify a unique prefix for the new site.
    This will configure Solr to create separate index database for each new site.

    Note:

    Use Unique Prefix for each site.

  2.   Index Files in Managed Storage

    Indexing Network Folders

    Unlike Managed Storage, network folder files exist outside of FileCloud and therefore changes occurring in the network folders might not be propagated into FileCloud.

    Monitoring such changes are important in the following scenarios:

    • Faster searching
    • Content Search
    • Automatic Realtime Syncing of Network Folders
    • Pattern Searches for GDPR

    For these scenarios, you must to index network folders and keep them indexed as files and folders change.

    • To index network folders, the FileCloud Helper service is required

    Indexing Network Folders



    After you set up content searching in the Content Search tab, it displays a new section is displayed called Manage PII Patterns. In the Manage PII Patterns section, you can:

    • Add new patterns

    • Edit existing patterns

    • Search for patterns
    • Remove patterns you don't need

    (lightbulb) The list of patterns you configure here is shown on the Admin portal search screen in the Advanced options dialog box as options to choose from.

     

    A pattern contains the following information:

    Regex is short for a regular expression. This is a special alphanumeric string used to describe a search pattern.

    If you need to create a new pattern or edit an existing one, use the following table to understand the Regex format used by FileCloud.


    NameInput TypeLength of InputRegEx Format
    DescriptionIdentifies the type of protected information
    • enclosed in square brackets [ ]
    • a number range
    • uppercase letter range
    • lowercase letter range
    • enclosed in curly brackets '{ '}
    • a number
    [Type]{Length}

    Numerical

    Example

    U.S. Social Security Number0-9 (can contain any number 0-9)

    3 (for the first number)

    2 (for the second number)

    4 (for the third number)

    [0-9]{3}-[0-9]{2}-[0-9]{4}

    012-34-5678

    Letter example

    Poland Passport

    a-zA-Z

    0-9

    2 letters (followed by)

    7 numbers

    [a-zA-Z]{2}[0-9]{7}

    ME0123456


    To enable and configure PII Search:

    1. Open a browser and log in to the Admin Portal.

    2. From the left navigation panel, click Settings.

    3. On the Manage Settings screen, click the Content Search tab.

    4. On the Content Search tab, after Solr Configuration, check the Enable PII Search checkbox.

    5. To add a new pattern to the list, in the Manage PII Patterns section, in the top right corner, click Add.
    6. To edit an existing pattern, in the Manage PII Patterns section, select the pattern and then click the edit icon ().
    7. To remove an existing pattern, in the Manage PII Patterns section, select the pattern and then click the delete icon ().
    8. Click Save.



    To search the site's content for PII patterns, see PII Search on the page Search in the Admin Portal.