Pdf |
[DataMemberAttribute] public List<PdfPageInfo> FailedPdfPages { get; set; }
PDF pages are marked as failed if either a parsing exception occurs while processing the page or the PDF page extracted text length was below the configurable threshold PageExtractedTextCriteria.
The information contained in PdfPageInfo can help a developer to decide whether or not to OCR a failed page. For example, if page failed due to PageExtractedTextCriteria criteria and there are one of more images on the page, then this page may be a candidate for OCR.