Processing |
[DataContractAttribute] public enum ProcessingMode
| Member name | Value | Description |
|---|---|---|
| TextAndMetadata | 1 | Identify and extract text, attributes, and metadata from all input files and child documents. |
| MetadataContainerItemsFirstLevel | 2 | Identify and extract only metadata from all input files and also that of container child items. If a container document type (e.g., archive) contains a child item which is also a container type, the container child container's items are ignored. |
| MetadataNoContainerItems | 3 | Identify and extract only metadata from all input files (for container types, the child items are counted and stored as metadata field). Input items that are containers (e.g., archive and mail stores) only have their metadata extracted and their child items are ignored; however, a count of contained child items is stored in the parent container metadata. |
| IdentificationWithContainerItemCount | 4 | Identifies the file format of all input items, gets item count for archives/mail stores (see NumOfContainerItems), and gets image width/height for supported raster image formats (see ImageWidthInPixels and ImageHeightInPixels). Optionally, this mode can also calculate binary document hashing (e.g., MD5BinaryHash) and archive item true expansion size/compression ratio tests; however, this mode cannot calculate document content hashes (e.g., MD5ContentHash). This mode is useful at the start of a document processing workflow ("pre-processing") where the document properties returned by this mode such as FormatId, NumOfContainerItems, ImageWidthInPixels, ImageHeightInPixels, and optionally, TestedSize can be used to determine how to break large document collections into manageable processing/OCR tasks. |
| IdentificationOnly | 5 | Identify the file format of all input items. Only file system metadata is captured in this mode. |