File
File: dtsearch.h
Syntax
C++
enum BinaryFilesSettings {
dtsoFilterBinary = 1,
dtsoIndexBinary = 2,
dtsoIndexSkipBinary = 3,
dtsoFilterBinaryUnicode = 4,
dtsoIndexBinaryNoContent = 5
};
Members
Members |
Description |
---|---|
dtsoFilterBinary = 1 |
Filter text from binary files using the character array in binaryFilterTextChars to determine which characters are text. This option is not recommended. Use dtsoFilterBinaryUnicode instead for more effective text extraction from binary data. |
dtsoIndexBinary = 2 |
Index all contents of binary files as single-byte text. This option is not recommended. Use dtsoFilterBinaryUnicode instead for more effective text extraction from binary data. |
dtsoIndexSkipBinary = 3 |
Do not index binary files |
dtsoFilterBinaryUnicode = 4 |
Filter text from binary files using a text extraction algorithm that scans for sequences of single-byte, UTF-8, or Unicode text in the input. This option is recommended for working with forensic data, particularly when searching for non-English text. |
dtsoIndexBinaryNoContent = 5 |
Index binary files disregarding all content within the file. Only the filename and any fields supplied externally to the file, such as in DocFields in the DataSource API, will be indexed. |
Group
Description
Values for dtsOptions.binaryFiles (C++), Options.BinaryFiles (.NET), and Options.setBinaryFiles (Java). See Filtering Options.