IUnsupported ExtractorExtract Content Method
Extracts document text using a proprietary binary-to-text extractor.
Definition
Namespace: OpenDiscoverSDK.Interfaces.Extractors
Assembly: OpenDiscoverSDK.Interfaces (in OpenDiscoverSDK.Interfaces.dll) Version: 2026.2.6.0 (2026.02.06)
Returns a DocumentContent object that contains the extracted text.
Assembly: OpenDiscoverSDK.Interfaces (in OpenDiscoverSDK.Interfaces.dll) Version: 2026.2.6.0 (2026.02.06)
C#
DocumentContent ExtractContent()Return Value
DocumentContentReturns a DocumentContent object that contains the extracted text.
Remarks
This method extracts useful text (if any) from binary via a proprietary binary-to-text algorithm. The algorithm extracts
text for UTF-16 (latin unicode range), UTF-8, and code page 1252 encodings.