TextSourceType Enumeration |
Extracted document text method.
Namespace: OpenDiscoverSDK.Interfaces.ContentAssembly: OpenDiscoverSDK.Interfaces (in OpenDiscoverSDK.Interfaces.dll) Version: 2025.4.4.0 (2025.4.4)
Syntax[DataContractAttribute]
public enum TextSourceType
Members| Member name | Value | Description |
|---|
| None | 1 |
No text was extracted.
|
| Extraction | 2 |
Text extracted through parsing document format.
|
| ExtractionFallback | 3 |
Text extracted though primary document format parsing failed; but a possibly less reliable fallback parsing method was used.
|
| BinaryToText | 4 |
Text was extracted through a proprietary binary-to-text filtering algorithm that attempts to extract useful text from the document's raw file bytes.
|
| OCR | 5 |
RESERVED - Not set by SDK. All document text was extracted through optical character recognition (OCR) on image or scanned document image.
|
| PartialOCR | 6 |
RESERVED - Not set by SDK. Part of the document's text was extracted through optical character recognition (OCR) on page level generated image or scanned document image.
|
| ExtractionUserLimited | 7 |
Text extracted through parsing document format but amount of text was limited by user. For cases like database tables, which can potentially have
10's of millions of rows, the user can choose to limit the amount of database table rows outputted.
|
See Also