Skip to main content

Supported Content Types

The table below lists types of content and their default extensions supported out of the box.

NOTE: To review the full list of available content types, navigate to Config → Text Processing → Content Type Extraction Methods.

Default extensionContent type
.aiffAIFF
.bmpBitmap
.chmCompiled HTML
.docWord
.docxWord Xml
.dwgCAD
.emlExchange Mail
.flvFLV
.htmlHTML
.javaJava Source
.jpgJPEG
.mppProject
.msgMessage
.pdfPDF
.webppng
.pptPowerpoint
.pptxPowerpoint Xml
.pubPublisher
.rarArchive
.rtfRich Text
.tiffTiff
.tmpUnknown
.txtText
.vsdVisio
.vtlDictionary / VTL
.wavWAV
.wpWord Perfect
.xlsExcel
.xlsxExcel Xml
.xmlXML
.zipArchive
.7zArchive

Supported Data Sources

The table below lists systems that can be crawled with Netwrix Data Classification:

Data SourceSupported Versions
File System- CIFS/SMB (Preferred) - NFS
SharePoint, SharePoint Online, OneDrive for Business- 2010 and above
Database- Microsoft SQL Server 2008 and above - MySQL 5.0.2 and above - Oracle 10g and above - PostgreSQL 7.4 and above
Box- Enterprise - Business / Business Plus - Starter
Dropbox- Business
Exchange- Exchange Server 2010 and above - Exchange Online NOTE: Automatic detection, crawling and classification of multiple Exchange mailboxes from the same Exchange server (and, respectively, Exchange Server content source configuration in the NDC web console) is only supported for Exchange Server 2013 or later due to limitations in the Microsoft APIs. For earlier versions, consider using Exchange Mailbox content source.
Google Drive- N/A
Outlook Mail Archive- Outlook 2010 and above