Click or drag to resize

TextSourceType Enumeration

Extracted document text method.

Namespace: OpenDiscoverSDK.Interfaces.Content
Assembly: OpenDiscoverSDK.Interfaces (in OpenDiscoverSDK.Interfaces.dll) Version: 2025.4.4.0 (2025.4.4)
Syntax
C#
[DataContractAttribute]
public enum TextSourceType
Members
Member nameValueDescription
None1 No text was extracted.
Extraction2 Text extracted through parsing document format.
ExtractionFallback3 Text extracted though primary document format parsing failed; but a possibly less reliable fallback parsing method was used.
BinaryToText4 Text was extracted through a proprietary binary-to-text filtering algorithm that attempts to extract useful text from the document's raw file bytes.
OCR5 RESERVED - Not set by SDK. All document text was extracted through optical character recognition (OCR) on image or scanned document image.
PartialOCR6 RESERVED - Not set by SDK. Part of the document's text was extracted through optical character recognition (OCR) on page level generated image or scanned document image.
ExtractionUserLimited7

Text extracted through parsing document format but amount of text was limited by user. For cases like database tables, which can potentially have 10's of millions of rows, the user can choose to limit the amount of database table rows outputted.

See Also