Email |
The EmailDocumentContent type exposes the following members.
| Name | Description | |
|---|---|---|
| AttachmentNames | Normalized list of attachment names separated by "; ". | |
| Attributes |
Document attributes. See DocumentAttributes for an enumeration of supported attributes.
(Inherited from DocumentContent) | |
| BccRecipients | "Bcc" recipient email information. | |
| Body | The email text body used for the extracted email text. If body was originally HTML, RTF, Flowed, or Enriched then this property value was converted to plain-text. | |
| BodyType | The email body format used for the extracted email text. The preferred email bodies, in order or preference: HTML, RTF/RTFWithEncapsulatedHtml, Text, Flowed, Enriched. | |
| CcRecipients | "Cc" recipient email information. | |
| ChildDocuments |
Child documents (attachments/embedded items). See remarks for the special cases of archives (.7z, zip, etc), media images, and mail stores (.pst, .ost, .mbox, etc.).
(Inherited from DocumentContent) | |
| CreationDate | Email's creation time. Check property DateTime.Kind to determine if UTC, Local, or Unspecified time. | |
| CustomMetadata |
Contains custom (user-defined) document metadata as a dictionary of metadata field names as keys and metadata field data as corresponding values.
(Inherited from DocumentContent) | |
| EDRMMessageIdentificationHash | The EDRM Message Identification Hash (MIH) is the MD5 hash value of the ASCII string comprised of the Message-ID header field of RFC-compliant email messages. | |
| EnrichedBody | If HasFlowedBody is true, then this property contains the Enriched formatted body. | |
| EntityExtractionResult |
Document entity item extraction result.
(Inherited from DocumentContent) | |
| EntryId | Entry ID for Outlook PST/OST extracted message objects in hexadecimal string format. | |
| ErrorMessage |
Gets or sets an error message associated with Result. This property is only set when Result is not set to Ok.
(Inherited from DocumentContent) | |
| ErrorStackTrace |
Error (exception) stack trace associated with ErrorMessage. This property is only set when Result is not Ok
and if an internal exception was caught.
(Inherited from DocumentContent) | |
| ExtractedText |
Extracted text, see remarks for limitations.
(Inherited from DocumentContent) | |
| FileEntropy |
Shannon entropy of the document's bytes.
(Inherited from DocumentContent) | |
| FlowedBody | If HasFlowedBody is true, then this property contains the Flowed formatted body. | |
| FormatId |
Document format identification result from prior file identification (this object value was an input to content extractor factory and
stored here for convenience).
(Inherited from DocumentContent) | |
| From | Specifies the author(s) of the message; that is, the EmailAddress(es) of the person(s) or system(s) responsible for the writing of the message. | |
| HasEnrichedBody | True if this email has a Enriched formatted body. | |
| HasFlowedBody | True if this email has a Flowed formatted body. | |
| HasHtmlBody | True if this email has an HTML body. | |
| HasRtfBody | True if this email has an RTF body. | |
| HasTextBody | True if this email has a plain-text body. | |
| HtmlBody | If HasHtmlBody is true, then this property contains the HTML body. | |
| HyperLinks |
Document hyperlinks.
(Inherited from DocumentContent) | |
| InReplyToId | MIME 'in-reply-to' header value (if it exists). Contains the value of the original message's MessageId property. | |
| IsEmailType |
If true, this document is an email document. This DocumentContent object should be cast to a EmailDocumentContent to
get additional email document specific properties.
(Inherited from DocumentContent) | |
| IsEncrypted |
Document is encrypted if this property is true.
(Inherited from DocumentContent) | |
| IsHtmlType |
If true, document is an HTML document. This DocumentContent object should be cast to a HtmlDocumentContent to
get additional HTML document specific properties.
(Inherited from DocumentContent) | |
| IsMimePartialMessage | True if this is an MIME email partial message (ContentType MIME header with MIME-type = "message/partial"); false otherwise. | |
| IsPdfType |
If true, document is an PDF document. This DocumentContent object should be cast to a PdfDocumentContent to
get additional PDF document specific properties.
(Inherited from DocumentContent) | |
| LanguageIdResults |
Extracted text language identification results.
(Inherited from DocumentContent) | |
| LastModificationDate | Email's last modified time. Check property DateTime.Kind value to determine if UTC, Local, or Unspecified time. | |
| MD5BinaryHash |
MD5 binary document hash (hash of all document bytes).
(Inherited from DocumentContent) | |
| MD5ContentHash |
MD5 content hash is a proprietary hash on only the content of a document file format.
(Inherited from DocumentContent) | |
| MessageId | MIME 'message-id' header value (if it exists). This is also set for Outlook .msg files, if it exists. | |
| Metadata |
Contains standard (non-user-defined) document metadata as a dictionary of metadata field names as keys and metadata field data as corresponding values.
(Inherited from DocumentContent) | |
| MimePartialMessageId | If IsMimePartialMessage is true, then property holds the unique partial-message 'id' that all parts (separate MIME files) of the partial message contain. | |
| MimePartialMessagePartNumber | If IsMimePartialMessage is true, then property holds the index of this partial-message part (valid range: 1 to MimePartialMessageTotalParts). | |
| MimePartialMessageTotalParts | If IsMimePartialMessage is true, then property holds the total number of MIME partial-message parts. | |
| Password |
The password found to decrypt the document by cycling through supplied password list.
(Inherited from DocumentContent) | |
| ReceivedDate | Email received date/time. Check property DateTime.Kind to determine if UTC, Local, or Unspecified time. | |
| Result |
Gets or sets the result of the content extraction. Check this value to see if content extraction was successful.
(Inherited from DocumentContent) | |
| RtfBody | If HasRtfBody is true, then this property contains the Rich Text Format (RTF) body. | |
| RtfHasEncapsulatedHtml | If true, the RTF body (RtfBody) has encapsulated HTML (from Outlook conversion to RTF); false otherwise. | |
| Sender | Sender information. The sender is the EmailAddress of the agent responsible for the actual transmission of the message. The sender and the From are often the same but can be different. | |
| SentDate | Email sent date/time. Check property DateTime.Kind to determine if UTC, Local, or Unspecified time. | |
| SHA1AttachmentHash | SHA-1 hash of the concatenated SHA1 hash of each attachment binary data (includes hashes of inline images). | |
| SHA1AttachmentSortedHash | SHA-1 hash of the SORTED and then concatenated SHA1 hash of each attachment binary data (includes hashes of inline images). | |
| SHA1BinaryHash |
SHA-1 binary document hash (hash of all document bytes).
(Inherited from DocumentContent) | |
| SHA1BodyHash | SHA-1 hash of the email Body text (converted to lower case and with all white space removed). | |
| SHA1ContentHash |
SHA-1 content hash is a proprietary hash on only the content part of document file format.
(Inherited from DocumentContent) | |
| SHA1HeaderHash | SHA-1 hash of concatenated message SentDate date (e.g., Outlook 'ClientSubmitTime' or MIME 'Date' field), subject, Sender name and email address (converted to all lower case and all white space removed before hashing). | |
| SHA1RecipientNamesHash | SHA-1 hash of all recipient names concatenated together (all lower case). | |
| SHA1RecipientsHash | SHA-1 hash of all recipient names and email addresses concatenated together (all lower case). | |
| SHA256BinaryHash |
SHA-256 binary document hash (hash of all document bytes).
(Inherited from DocumentContent) | |
| SHA256ContentHash |
SHA-256 content hash is a proprietary hash on only the content part of document file format.
(Inherited from DocumentContent) | |
| Subject | Email subject text. | |
| TextBody | If HasTextBody is true, then this property contains the plain-text body. | |
| TextSourceType |
Gets or sets the method of the acquired document text (if any).
(Inherited from DocumentContent) | |
| ToRecipients | "To" recipient email information. |