Content |
[DataMemberAttribute] public UnsupportedFilteringSettings UnsupportedFiltering { get; set; }
Filtering unsupported/unknown file formats uses a proprietary binary-to-text filtering algorithm that attempts to extract as much UTF8, UTF-16LE (Latin languages only), and code page 1252 encoded text from the document's binary. In many cases, useful text for indexing or searching can be extracted from unknown/corrupted/unsupported file formats using binary-to-text filtering (see Unsupported).