Click or drag to resize

PdfPageInfo Class

Information on a PDF page that failed content extraction.
Inheritance Hierarchy
SystemObject
  OpenDiscoverSDK.Interfaces.ContentPdfPageInfo

Namespace: OpenDiscoverSDK.Interfaces.Content
Assembly: OpenDiscoverSDK.Interfaces (in OpenDiscoverSDK.Interfaces.dll) Version: 2025.4.4.0 (2025.4.4)
Syntax
C#
[DataContractAttribute]
public class PdfPageInfo

The PdfPageInfo type exposes the following members.

Constructors
Properties
 NameDescription
Public propertyContentLength Length of page's content, in bytes.
Public propertyExceptionMessage The PDF page exception message (if any).
Public propertyFailedDueToException If true, the PDF page failed due to an exception while processing; false means the PDF page failed due to the PageExtractedTextCriteria criteria.
Public propertyHasImages True, if the PDF page has images; false otherwise.
Public propertyImageCount Number of images on the PDF page.
Public propertyNumTextCharsExtracted The number of text characters extracted for the failed PDF page.
Public propertyPageNumber PDF page number.
Top
Methods
 NameDescription
Public methodEqualsDetermines whether the specified object is equal to the current object.
(Inherited from Object)
Public methodGetHashCodeServes as the default hash function.
(Inherited from Object)
Public methodGetTypeGets the Type of the current instance.
(Inherited from Object)
Public methodToStringReturns a string that represents the current object.
(Inherited from Object)
Top
Remarks

The information contained in PdfPageInfo can help a developer to decide whether or not to OCR a failed page. For example, if page failed due to PageExtractedTextCriteria criteria (i.e., FailedDueToException is false) and there are one of more images on the page, then this page may be a candidate for OCR.

See Also