TextSourceType Enumeration

Extracted document text method.

Definition

Namespace: OpenDiscoverSDK.Interfaces.Content
Assembly: OpenDiscoverSDK.Interfaces (in OpenDiscoverSDK.Interfaces.dll) Version: 2026.2.6.0 (2026.02.06)
C#
[DataContractAttribute]
public enum TextSourceType

Members

None1 No text was extracted.
Extraction2 Text extracted through parsing document format.
ExtractionFallback3 Text extracted though primary document format parsing failed; but a possibly less reliable fallback parsing method was used.
BinaryToText4 Text was extracted through a proprietary binary-to-text filtering algorithm that attempts to extract useful text from the document's raw file bytes.
OCR5 RESERVED - Not set by SDK. All document text was extracted through optical character recognition (OCR) on image or scanned document image.
PartialOCR6 RESERVED - Not set by SDK. Part of the document's text was extracted through optical character recognition (OCR) on page level generated image or scanned document image.
ExtractionUserLimited7

Text extracted through parsing document format but amount of text was limited by user. For cases like database tables, which can potentially have 10's of millions of rows, the user can choose to limit the amount of database table rows outputted.

See Also