File format support included with the dtSearch Engine.
dtSearch can automatically recognize, index, and search the following formats. While file types are detected for the most part by the binary contents of the file, the filename extension often provides useful information as well, especially to resolve ambiguities. Therefore, where possible the original filename extension should be preserved when passing document contents to the dtSearch Engine.
For more information on file format support, see "What file formats does dtSearch Support? (dtsearch.com)".
[1] Databases. Each record in a database is treated as a separate document. Previous versions of dtSearch used ODBC to index Microsoft Access databases. Versions 7.54 and later have internal parsers for Access databases, so ODBC is no longer needed. For information on indexing SQL databases, see "Indexing Databases".
[2] Outlook and Exchange. dtSearch Desktop/Network can index Outlook and Exchange message stores using MAPI. dtSearch versions 7.77 and later can also index Outlook PST and OST files directly, without using Outlook or MAPI.
[3] Web Sites. dtSearch products include a spider that can index and search dynamically-generated content or static content on web sites.
[4] RAR Support. RAR support currently applies to the Windows and Linux versions of dtSearch only.
[5] Attachments. In all supported email formats, attachments, including nested attachments (for example, a .doc instead a ZIP attached to an email) are indexed as part of the main document by default.
[6] PDF Support. Encrypted PDF files cannot be indexed, unless the PDF file can be opened without a password and the PDF file permissions allow for text extraction.
[7] dtSearch versions 7.92 and earlier support PDF 1.x. dtSearch version 7.93 adds support for the new PDF 2.0 standard.
[8] Office 365. Supported Microsoft Office formats are also supported when saved from Office 365.