You are here: C++ API > Enumerations > BinaryFilesSettings Enumeration
Close
dtSearch Text Retrieval Engine Programmer's Reference
BinaryFilesSettings Enumeration

File: dtsearch.h

Syntax
C++
enum BinaryFilesSettings { dtsoFilterBinary = 1, dtsoIndexBinary = 2, dtsoIndexSkipBinary = 3, dtsoFilterBinaryUnicode = 4, dtsoIndexBinaryNoContent = 5 };
Members
Description
dtsoFilterBinary = 1
Filter text from binary files using the character array in binaryFilterTextChars to determine which characters are text. This option is not recommended. Use dtsoFilterBinaryUnicode instead for more effective text extraction from binary data.
dtsoIndexBinary = 2
Index all contents of binary files as single-byte text. This option is not recommended. Use dtsoFilterBinaryUnicode instead for more effective text extraction from binary data.
dtsoIndexSkipBinary = 3
Do not index binary files
dtsoFilterBinaryUnicode = 4
Filter text from binary files using a text extraction algorithm that scans for sequences of single-byte, UTF-8, or Unicode text in the input. This option is recommended for working with forensic data, particularly when searching for non-English text.
dtsoIndexBinaryNoContent = 5
Index binary files disregarding all content within the file. Only the filename and any fields supplied externally to the file, such as in DocFields in the DataSource API, will be indexed.

Values for dtsOptions.binaryFiles (C++), Options.BinaryFiles (.NET), and Options.setBinaryFiles (Java). See Filtering Options.

Copyright (c) 1995-2021 dtSearch Corp. All rights reserved.