PdfDocumentContentFailedPdfPages Property

Holds information on PDF pages that failed during content extraction.

Namespace: OpenDiscoverSDK.Interfaces.Content
Assembly: OpenDiscoverSDK.Interfaces (in OpenDiscoverSDK.Interfaces.dll) Version: 2025.4.6.0 (2025.4.6)

Syntax

Copy

[DataMemberAttribute]
public List<PdfPageInfo> FailedPdfPages { get; set; }

Property Value

ListPdfPageInfo

Remarks

PDF pages are marked as failed if either a parsing exception occurs while processing the page or the PDF page extracted text length was below the configurable threshold PageExtractedTextCriteria.

The information contained in PdfPageInfo can help a developer to decide whether or not to OCR a failed page. For example, if page failed due to PageExtractedTextCriteria criteria and there are one of more images on the page, then this page may be a candidate for OCR.

Reference

PdfDocumentContent Class

OpenDiscoverSDK.Interfaces.Content Namespace