Document |
The DocumentDataArchiveReader type exposes the following members.
| Name | Description | |
|---|---|---|
| ClassificationCount | Gets a dictionary that contains IdClassification as key and the count of documents that have that file format classification as values. | |
| ContentResultCount | Gets a dictionary that contains ContentResult as key and a ContentResultInfo as value. | |
| CreationDate | Archive creation date (UTC). | |
| DirectoryHierarchy | All document data in input directory hierarchy. The hierarchy also contains document parent/child relationships. | |
| DocumentArchiveFolderPath | The root folder of the document data archive. | |
| DocumentByControlNumber | Returns all documents by DocControlNumber provided that the Documents had DocControlNumber set. | |
| DocumentByDocGuid | Gets a dictionary with DocGuid key and associated document value. | |
| EntityItemDocuments | All documents with at least 1 entity item found in extracted text and/or metadata. | |
| ExcludedDocuments | All documents with Result set to ExcludedType. | |
| FlatRecords | Gets all archive document entries as a flattened (non-hierarchival) list. | |
| FormatIdCount | Gets a dictionary that contains Id as key and the count of documents that have that file format identification as values. | |
| HasReadErrors | True if there were errors reading the document data archive (.dda). | |
| HierarchicalRecords | Gets all document data archive entries with parent/child hierarchy. | |
| IssueDocuments | All documents that do not have Result values set to either Ok, EmptyFile, ExcludedType, or RequeueAsSeparateTask | |
| LongRunningDocuments | All documents that have Result values set to LongRunningProcessingError. | |
| NistDocuments | All documents whose SHA1BinaryHash match a SHA1 hash in the NIST hash database (see PerformNistCheck and NistRdsDatabasePath). | |
| PdfDocumentsWithFailedPages | All PDF documents with at least 1 failed PDF page. | |
| ReaderMode | The DocumentDataArchiveReaderMode of this instance. | |
| ReadErrors | If HasReadErrors is true, this property will hold read error information. | |
| RequeueDocuments | All documents with Result set to either RequeueAsSeparateTask or UserRequeueAsSeparateTask. | |
| Settings | Task settings that were used to create this document data archive output. | |
| SHA1BinaryHashMatchGroups | Gets a list of HashMatchGroup that contain documents that have the same SHA1BinaryHash value. | |
| SHA1ContentHashMatchGroups | Gets a list of HashMatchGroup that contain documents that have the same SHA1ContentHash value. | |
| TotalFlatRecordSize | Total size in bytes of all documents in FlatRecords. | |
| TotalNumOfDocumentRecords | Total number of document records in document data archive. | |
| TotalSHA1BinaryHashMatches | Gets total number of documents that have same SHA1BinaryHash. | |
| TotalSHA1ContentHashMatches | Gets total number of documents that have same SHA1ContentHash. | |
| UnknownDocuments | All documents with FormatId set to either Unknown or UnknownCompoundFile. | |
| Version | Archive format version. |