Pdf Document ContentFailed Pdf Pages Property
Holds information on PDF pages that failed during content extraction.
Definition
Namespace: OpenDiscoverSDK.Interfaces.Content
Assembly: OpenDiscoverSDK.Interfaces (in OpenDiscoverSDK.Interfaces.dll) Version: 2026.2.6.0 (2026.02.06)
Assembly: OpenDiscoverSDK.Interfaces (in OpenDiscoverSDK.Interfaces.dll) Version: 2026.2.6.0 (2026.02.06)
C#
[DataMemberAttribute]
public List<PdfPageInfo> FailedPdfPages { get; set; }Property Value
ListPdfPageInfoRemarks
PDF pages are marked as failed if either a parsing exception occurs while processing the page or the PDF page extracted text length was below the configurable threshold PageExtractedTextCriteria.
The information contained in PdfPageInfo can help a developer to decide whether or not to OCR a failed page. For example, if page failed due to PageExtractedTextCriteria criteria and there are one of more images on the page, then this page may be a candidate for OCR.