dtSearch Text Retrieval Engine .NET interface

UnicodeFilterFlags Enumeration

Values for Options.UnicodeFilterFlags

This enumeration has a FlagsAttribute attribute that allows a bitwise combination of its member values.

public enum UnicodeFilterFlags

Members

Member Name Description Value
dtsoUfFilterAllDocs Ignore file format information and apply Unicode Filtering to all documents. 128
dtsoUfFilterFailedDocs When a document cannot be indexed due to file corruption or encryption, apply the filtering algorithm to extract text from the file. 64
dtsoUfAutoWordBreakOverlapWords When a word break is automatically inserted due to dtsoUfAutoWordBreakByLength, overlap the two words generated by the word break. 32
dtsoUfAutoWordBreakOnDigit Automatically insert a word break when a digit follows letters. 16
dtsoUfAutoWordBreakByCase Automatically insert a word break when a capital letter appears following lower-case letters. 8
dtsoUfAutoWordBreakByLength Automatically insert a word break in long sequences of letters. A word break will be inserted when the word length reaches Options.MaxWordLength. 4
dtsoUfOverlapBlocks Overlapping blocks prevents text that crosses a block boundary from being missed in the filtering process. With overlapping enabled, each block extends 256 characters past the start of the previous block. 2
dtsoUfExtractAsHtml Extracting blocks as HTML has no effect on the text that is extracted, but it adds additional information in HTML comments to each extracted block. The HTML comments identify the starting byte offset and encoding of each piece of text extracted from a file. 1

Requirements

Namespace: dtSearch.Engine

Assembly: dtSearchNetApi (in dtSearchNetApi.dll)

See Also

dtSearch.Engine Namespace