Close
dtSearch Engine API for .NET Framework 2.x-4.x 2024.01
Options::UnicodeFilterRanges Property

Indicates Unicode ranges that are of interest when filtering.

public String UnicodeFilterRanges;

UnicodeFilterRanges indicates the Unicode subranges that the filtering algorithm should look for. For example, if UnicodeFilterRanges is set to 1 and 8, then the filtering algorithm will look for characters from U+0100-U+01FF and U+0800-U+08FF 

This is used to help the filtering algorithm to distinguish text from non-text data. It is only used as a hint in the algorithm, so if the text extraction algorithm detects text in another language with a sufficient level of confidence, it will return that text even if the language was not selected. 

In .NET and COM, UnicodeFilterRanges is a comma-separated list of integers, each from 0 to 255, indicating the Unicode subranges that the filtering algorithm should look for. Example: "1,8". Options