Skip to main content

Dropbox

The Dropbox source configuration screen allows you to enable the crawling and classification of content stored in Dropbox cloud storage.

IMPORTANT! Make sure you created App for Dropbox crawling before start adding the source. See Configure Dropbox for Crawling for more information.

source_dropbox_thumb_0_0

Complete the following fields:

OptionDescription
Access TokenProvide the token you generated when authorizing your App. See To authorize your app for more information.
Access Shared Team Folder as:If you have team folders in your Dropbox namespace and you want to crawl one or more of these folders with Netwrix Data Classification, provide the email address of an account that has access to the desired team folders. For personal folders, this option is not required. For more information, refer to Dropbox Namespace guide.
OCR Processing modeSelect documents' images processing mode: - Disabled – documents' images will not be processed. - Default – defaults to the source settings if configuring a path or the global setting if configured on a source. - Normal – images are processed with normal quality settings. - Enhanced – upscale images further to allow more.
Advanced SettingsClick the "wrench" icon in the Settings area (gdrive_advanced_settings) at the bottom of the screen to expand the following advanced settings: - Re-Index Period — specifies how often the source should be checked for changes. The number specifies the period in days. - Priority — specifies the priority of content source processing in the service queues. - Document Type — can be used to specify a value which can be used to restrict queries when utilizing the core search index.
Source GroupNetwrix recommends creating a dedicated source group for Dropbox.
Pause source on creationSelect if you want to make other configuration changes before collection of the source occurs.

Dropbox

This section contains information on how to exclude Dropbox files or folders from being crawled.

Configure Exclusions

You can define the list of file locations that will be ignored when indexing files in Dropbox, i.e. excluded from processing. All Dropbox documents start with https://www.dropbox.com/home/ followed by the full path to the file. So, to exclude specific document, enter the full document URL. You can exclude all pages within a folder - for that, use a wildcard indicator (e.g. “https://www.dropbox.com/home/Test Folder/*”).

To configure exclusions, do the following:

  1. In the management console, click Content Configuration → SourcesDropbox, then in the left pane click Collection Exclusions.

  2. Click Add.

    boxexclusions_thumb_0_0

  3. Click Filter and in the Filter field specify the objects (files or folders) to exclude:

    To exclude a certain file, enter https://www.dropbox.com/home/<full_path_to_file>. For example: https://www.dropbox.com/home/Reports__/Test Document.docx

    Wildcards can be used anywhere in the exclusion pattern definition as follows:

    • The asterisk character (*) matching any sequence of characters
    • The question mark character (?) matching any single character

    For example:

NOTE: Exclusions are case-insensitive.

  1. Optionally, enter a test path to verify the settings and click Test.

    If needed, you can use metadata conditions to restrict when an exclusion filter should be applied. For that, click Condition tab and click Add. Then select how the exclusion conditions will work: it can check if metadata field of the document has any value, is not specified, or matches a specific metadata value.

    CriteriaCondition
    ComparisonCompare a value in the document metadata field with the value set by condition. With this criteria selected, you will then need to specify: - Field name — document metadata field to check - Comparison — operator to use (for example, "does not contain") - Value — value to compare against For example, to exclude documents tagged with year 2018, set the condition as follows: - Field NameDocYear - Comparisonequals - Value2018
    Has any valueExclude the document if its metadata field has any value. With this criteria selected, specify Field Name.
    Has no valuesExclude the document if metadata field value is not specified. With this criteria selected, specify Field Name.

    When finished, click Add.

  2. Finally, click Save and close the window.