Supported Email file formats (IdClassification.Email - Email document (e.g., Outlook message (IPM.Note), EML, MIME mail, Lotus Notes .dxl, etc).)
If a file format does not have a supported content extractor that extracts text then, optionally (default), a binary-to-text content extractor will be used to extract UTF-8, UTF-16, Windows-1252, and ASCII from the binary. In many cases, indexable text can be extract from unknown document formats.
File Format Id Enum Value | Text | Metadata | EmbeddedItem | ContentHash | Description |
|---|---|---|---|---|---|
X | X | X | X | Microsoft Outlook .msg normal e-mail message (IPM.Note) | |
X | X | X | X | Outlook rights-managed email message object. A rights-managed email protects the email object's content from inappropriate access, use, and distribution (.rpmsg). | |
X | X | X | X | Microsoft Outlook .msg encrypted e-mail message (IPM.Note.Secure) | |
X | X | X | X | Microsoft Outlook .msg encrypted, digitally signed e-mail message (IPM.Note.Secure) | |
X | X | X | X | Microsoft Outlook .msg encrypted e-mail message that can also be signed | |
X | X | X | X | Microsoft Outlook .msg clear signed e-mail message | |
X | X | X | X | Microsoft Outlook .msg resend a failed message, message class | |
X | X | X | X | Microsoft Transport Neutral Encapsulation Format (TNEF) (also know as winmail.dat) (.dat). The TNEF format enables the encoding of rich properties in electronic mail messages over a serial data stream.The result can be transported as a stream, as a file attachment in an arbitrary transport, or as a MIME entity body on an Internet transport. | |
X | X | X | X | Email saved in MIME (RFC 822) format (.eml;.mht;.html;.htm). | |
X | X | X | X | Partial email saved in MIME (RFC 822) format. Partial emails have 'Content-Type' header value of 'message/partial' and allows large messages to be sent in pieces and re-assembled by the client. | |
X | X | X | X | Microsoft Outlook email saved in MIME (RFC 822) format (.eml). | |
X | X | X | X | MIME (RFC 822) news group format type (.eml;.mht;.html;.htm). | |
X | X | X | X | S/MIME (Secure/Multipurpose Internet Mail Extensions) clear-signed message. Clear-signed messages have MIME media type "multipart/signed" (.eml). | |
X | X | X | X | S/MIME (Secure/Multipurpose Internet Mail Extensions) opaque-signed message. Opaque-signed messages have exactly one MIME entity and this MIME entity usually has the media type "application/pkcs7-mime" (.eml). | |
X | X | X | X | S/MIME (Secure/Multipurpose Internet Mail Extensions) encrypted (enveloped-data) message (.eml). | |
X | X | X | X | Apple Mail MIME message (.emlx). | |
X | X | X | X | Apple Mail MIME clear-signed message. Clear-signed messages have MIME media type "multipart/signed" (.emlx). | |
X | X | X | X | Apple Mail MIME opaque-signed message. Opaque-signed messages have exactly one MIME entity and this MIME entity usually has the media type "application/pkcs7-mime" (.emlx). | |
X | X | X | X | Apple Mail MIME encrypted (enveloped-data) message (.emlx). | |
X | X | X | X | Pretty Good Privacy (PGP) clear-signed message (.pgp;.gpg;.asc;.txt). | |
X | X | X | Email message encoded in 7-bit ASCII text format (.txt). | ||
X | X | X | Email message encoded in UTF-8 text format (.txt). | ||
X | X | X | Email message encoded in UTF-16LE text format (.txt). | ||
X | X | X | Email message encoded in UTF-16BE text format (.txt). | ||
X | X | X | Email message encoded in UTF-32LE text format (.txt). | ||
X | X | X | Email message encoded in UTF-32BE text format (.txt). | ||
X | X | X | Email message encoded in ISO-8859-1 (Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish) text format (.txt). | ||
X | X | X | Email message encoded in ISO-8859-2 (Czech, Hungarian, Polish, Romanian) text format (.txt). | ||
X | X | X | Email message encoded in ISO-8859-5 (Cyrillic) text format (.txt). | ||
X | X | X | Email message encoded in ISO-8859-6 (Arabic) text format (.txt). | ||
X | X | X | Email message encoded in ISO-8859-7 (Greek) text format (.txt). | ||
X | X | X | Email message encoded in ISO-8859-8-I (Hebrew) text format (.txt). | ||
X | X | X | Email message encoded in ISO-8859-9 (Turkish) text format (.txt). | ||
X | X | X | Email message encoded in Windows-1250 Czech, Hungarian, Polish, Romanian (.txt). | ||
X | X | X | Email message encoded in Windows-1251 Russian (.txt). | ||
X | X | X | Email message encoded in Windows-1252 (Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish) (.txt). | ||
X | X | X | Email message encoded in Windows-1253 Greek (.txt). | ||
X | X | X | Email message encoded in Windows-1254 Turkish (.txt). | ||
X | X | X | Email message encoded in Windows-1255 Hebrew (.txt). | ||
X | X | X | Email message encoded in Windows-1256 Arabic (.txt). | ||
X | X | X | Email message encoded in KOI8-R, designed to cover Russian, which uses a Cyrillic alphabet (.txt). | ||
X | X | X | Email message encoded in IBM 424, Hebrew (.txt). | ||
X | X | X | Email message encoded in IBM 420 Arabic (.txt). | ||
X | X | X | Email message encoded in IBM 866 Russian (.txt). | ||
X | X | X | Email message encoded in Shift_JIS Japanese (.txt). | ||
X | X | X | Email message encoded in ISO-2022-JP Japanese (.txt). | ||
X | X | X | Email message encoded in ISO-2022-CN Simplified Chinese (.txt). | ||
X | X | X | Email message encoded in ISO-2022-KR Korean (.txt). | ||
X | X | X | Email message encoded in GB18030 Chinese (.txt). | ||
X | X | X | Email message encoded in Big5 Traditional Chinese (.txt). | ||
X | X | X | Email message encoded in EUC-JP Japanese (.txt). | ||
X | X | X | Email message encoded in EUC-KR Korean (.txt). | ||
X | X | X | Email message encoded in EBCDIC 500 (.txt). | ||
X | X | X | X | Domino XML (DXL) memo document (email) export file format (.dxl). | |
X | X | X | X | Domino XML (DXL) reply document (email) export file format (.dxl). | |
Advansys Message Viewer email format for accessing and sharing saved GroupWise messages (.fml). |