PdfPageInfo Class

Information on a PDF page that failed content extraction.

Inheritance Hierarchy

SystemObject
OpenDiscoverSDK.Interfaces.ContentPdfPageInfo

Namespace: OpenDiscoverSDK.Interfaces.Content
Assembly: OpenDiscoverSDK.Interfaces (in OpenDiscoverSDK.Interfaces.dll) Version: 2025.4.6.0 (2025.4.6)

Syntax

Copy

[DataContractAttribute]
public class PdfPageInfo

The PdfPageInfo type exposes the following members.

Constructors

	Name	Description
	PdfPageInfo(Int32, Boolean, String)	Constructor.
	PdfPageInfo(Int32, Int32, Int32, Boolean, Int32)	Constructor.

Top

Properties

	Name	Description
	ContentLength	Length of page's content, in bytes.
	ExceptionMessage	The PDF page exception message (if any).
	FailedDueToException	If true, the PDF page failed due to an exception while processing; false means the PDF page failed due to the PageExtractedTextCriteria criteria.
	HasImages	True, if the PDF page has images; false otherwise.
	ImageCount	Number of images on the PDF page.
	NumTextCharsExtracted	The number of text characters extracted for the failed PDF page.
	PageNumber	PDF page number.

Top

Methods

	Name	Description
	Equals	Determines whether the specified object is equal to the current object. (Inherited from Object)
	GetHashCode	Serves as the default hash function. (Inherited from Object)
	GetType	Gets the Type of the current instance. (Inherited from Object)
	ToString	Returns a string that represents the current object. (Inherited from Object)

Top

Remarks

The information contained in PdfPageInfo can help a developer to decide whether or not to OCR a failed page. For example, if page failed due to PageExtractedTextCriteria criteria (i.e., FailedDueToException is false) and there are one of more images on the page, then this page may be a candidate for OCR.

Reference

OpenDiscoverSDK.Interfaces.Content Namespace