Output |
[DataContractAttribute] public enum OutputMode
| Member name | Value | Description |
|---|---|---|
| Archive | 1 | All output is written in a Open Discover data archive format. This output mode is usually the fastest for processing as it reduces the write disk I/O, and disk I/O is often the processing bottleneck for non-SSD data storage. A further benefit is much faster backups, deletes, and moving of processing output folders. The Open Discover Archive format also reduces total storage size. Separate Open Discover archive format files are output for document metadata (DocumentDataArchive.dda), attachments/embedded items (AttachmentArchive_p#_##.ada), and text (TextArchive_p#_##.tda), and each archive can contain the data of tens of thousands of files. Very 'large' attachments ('large' is a subjective term defined by property LargeDocumentCritera) are output separately to own their folder and not stored in the attachment archive. |
| IndividualFiles | 2 | In this mode, all document level information is still written to a document data archive (DocumentDataArchive.dda) format; however, attachments/embedded items and extracted text are written as individual files to the file system. |