Links
dtSearch Text Retrieval Engine Programmer's Reference 7.70
Supported File Formats
File Parsers | Send Feedback

File format support included with the dtSearch Engine

Remarks

dtSearch can automatically recognize, index, and search the following formats:

    Adobe Acrobat (*.pdf)

    Adobe Framemaker MIF (*.mif)

    Ami Pro (*.sam)

    Ansi Text (*.txt)

    ASCII Text

    ASF media files (metadata only) (*.asf)

    CSV (Comma-separated values) (*.csv)

    DBF (*.dbf)

    EBCDIC

    EML files (emails saved by Outlook Express) (*.eml)

    Enhanced Metafile Format (*.emf)

    Eudora MBX message files (*.mbx)

    Flash (*.swf)

    GZIP (*.gz)

    HTML (*.htm, *.html)

    JPEG (*.jpg)

    Lotus 1-2-3 (*.123, *.wk?)

    MBOX email archives (including Thunderbird) (*.mbx)

    MHT archives (HTML archives saved by Internet Explorer) (*.mht)

    MIME messages

    MSG files (emails saved by Outlook) (*.msg)

    Microsoft Access MDB files (see note 1) (*.mdb, *.accdb, including Access 2007 and Access 2010)

    Microsoft Document Imaging (*.mdi)

    Microsoft Excel (*.xls)

    Microsoft Excel 2003 XML (*.xml)

    Microsoft Excel 2007 and 2010 (*.xlsx)

    Microsoft Outlook data files (*.PST) (added in version 7.67)

    Microsoft Outlook/Exchange Messages, Notes, Contacts, Appointments, and Tasks

    Microsoft Outlook Express 5 and 6 (*.dbx) message stores

    Microsoft PowerPoint (*.ppt)

    Microsoft PowerPoint 2007 and 2010 (*.pptx)

    Microsoft Rich Text Format (*.rtf)

    Microsoft Searchable Tiff (*.tiff)

    Microsoft Word for DOS (*.doc)

    Microsoft Word for Windows (*.doc)

    Microsoft Word 2003 XML (*.xml)

    Microsoft Word 2007 and 2010 (*.docx)

    Microsoft Works (*.wks)

    MP3 (metadata only) (*.mp3)

    Multimate Advantage II (*.dox)

    Multimate version 4 (*.doc)

    OpenOffice versions 1, 2, and 3 documents, spreadsheets, and presentations (*.sxc, *.sxd, *.sxi, *.sxw, *.sxg, *.stc, *.sti, *.stw, *.stm, *.odt, *.ott, *.odg, *.otg, *.odp, *.otp, *.ods, *.ots, *.odf) (includes OASIS Open Document Format for Office Applications)

    Quattro Pro (*.wb1, *.wb2, *.wb3, *.qpw)

    QuickTime (*.mov, *.m4a, *.m4v)

    RAR (*.rar) (see note 2)

    TAR (*.tar)

    TIFF (*.tif)

    TNEF (winmail.dat files)

    Treepad HJT files (*.hjt)

    Unicode (UCS16, Mac or Windows byte order, or UTF-8)

    Visio XML files (*.vdx)

    Windows Metafile Format (*.wmf)

    WMA media files (metadata only) (*.wma)

    WMV video files (metadata only) (*.wmv)

    WordPerfect 4.2 (*.wpd, *.wpf)

    WordPerfect (5.0 and later) (*.wpd, *.wpf)

    WordStar version 1, 2, 3 (*.ws)

    WordStar versions 4, 5, 6 (*.ws)

    WordStar 2000

    Write (*.wri)

    XBase (including FoxPro, dBase, and other XBase-compatible formats) (*.dbf)

    XML (*.xml)

    XML Paper Specification (*.xps)

    XSL

    XyWrite

    ZIP (*.zip)
Notes

[1] Databases. Each record in a database is treated as a separate document. Previous versions of dtSearch used ODBC to index Microsoft Access databases. Versions 7.54 and later have internal parsers for Access databases, so ODBC is no longer needed. For information on indexing SQL databases, see "Indexing Databases". 

 

[2] RAR. RAR support currently applies to the Windows version of the dtSearch Engine only.

Group
Links
You are here: Overviews > File Parsers > Supported File Formats
Copyright (c) 1995-2012 dtSearch Corp. All rights reserved.