Click or drag to resize

Email

Supported Email file formats (IdClassification.Email - Email document (e.g., Outlook message (IPM.Note), EML, MIME mail, Lotus Notes .dxl, etc).)

  • All entries in table below are supported for file format identification.
  • 'X' in "Text" column indicates text extraction is supported for the file format.
  • 'X**' in "Text" column indicates text extraction is supported BUT binary-to-text filtering is used on partially parsed document records.
  • 'X' in "Metadata" column indicates metadata extraction is supported for the file format.
  • 'X' in "EmbeddedItem" column indicates embedded item/attachment extraction is supported for the file format.
  • 'X' in "ContentHash" column indicates a content hash is supported for the file format (see MD5ContentHash and SHA1ContentHash)

If a file format does not have a supported content extractor that extracts text then, optionally (default), a binary-to-text content extractor will be used to extract UTF-8, UTF-16, Windows-1252, and ASCII from the binary. In many cases, indexable text can be extract from unknown document formats.

Email Supported File Formats

File Format Id Enum Value

Text

Metadata

EmbeddedItem

ContentHash

Description

OutlookMessage

X

X

X

X

Microsoft Outlook .msg normal e-mail message (IPM.Note)

OutlookRightsManagedEmailObject

X

X

X

X

Outlook rights-managed email message object. A rights-managed email protects the email object's content from inappropriate access, use, and distribution (.rpmsg).

OutlookMessageSecure

X

X

X

X

Microsoft Outlook .msg encrypted e-mail message (IPM.Note.Secure)

OutlookMessageSecureSign

X

X

X

X

Microsoft Outlook .msg encrypted, digitally signed e-mail message (IPM.Note.Secure)

OutlookSMIME

X

X

X

X

Microsoft Outlook .msg encrypted e-mail message that can also be signed

OutlookSMIME_MultipartSigned

X

X

X

X

Microsoft Outlook .msg clear signed e-mail message

OutlookResend

X

X

X

X

Microsoft Outlook .msg resend a failed message, message class

TnefEmail

X

X

X

X

Microsoft Transport Neutral Encapsulation Format (TNEF) (also know as winmail.dat) (.dat). The TNEF format enables the encoding of rich properties in electronic mail messages over a serial data stream.The result can be transported as a stream, as a file attachment in an arbitrary transport, or as a MIME entity body on an Internet transport.

MimeEmail

X

X

X

X

Email saved in MIME (RFC 822) format (.eml;.mht;.html;.htm).

MimeEmailPartial

X

X

X

X

Partial email saved in MIME (RFC 822) format. Partial emails have 'Content-Type' header value of 'message/partial' and allows large messages to be sent in pieces and re-assembled by the client.

MimeOutlookEml

X

X

X

X

Microsoft Outlook email saved in MIME (RFC 822) format (.eml).

MimeNews

X

X

X

X

MIME (RFC 822) news group format type (.eml;.mht;.html;.htm).

SMimeEmailClearSigned

X

X

X

X

S/MIME (Secure/Multipurpose Internet Mail Extensions) clear-signed message. Clear-signed messages have MIME media type "multipart/signed" (.eml).

SMimeEmailOpaqueSigned

X

X

X

X

S/MIME (Secure/Multipurpose Internet Mail Extensions) opaque-signed message. Opaque-signed messages have exactly one MIME entity and this MIME entity usually has the media type "application/pkcs7-mime" (.eml).

SMimeEmailEncrypted

X

X

X

X

S/MIME (Secure/Multipurpose Internet Mail Extensions) encrypted (enveloped-data) message (.eml).

MimeAppleEmlx

X

X

X

X

Apple Mail MIME message (.emlx).

MimeAppleEmlxClearSigned

X

X

X

X

Apple Mail MIME clear-signed message. Clear-signed messages have MIME media type "multipart/signed" (.emlx).

MimeAppleEmlxOpaqueSigned

X

X

X

X

Apple Mail MIME opaque-signed message. Opaque-signed messages have exactly one MIME entity and this MIME entity usually has the media type "application/pkcs7-mime" (.emlx).

MimeAppleEmlxEncrypted

X

X

X

X

Apple Mail MIME encrypted (enveloped-data) message (.emlx).

PGPClearSignedMessage

X

X

X

X

Pretty Good Privacy (PGP) clear-signed message (.pgp;.gpg;.asc;.txt).

TextEmail7BitAscii

X

X

X

Email message encoded in 7-bit ASCII text format (.txt).

TextEmailUTF8

X

X

X

Email message encoded in UTF-8 text format (.txt).

TextEmailUTF16LE

X

X

X

Email message encoded in UTF-16LE text format (.txt).

TextEmailUTF16BE

X

X

X

Email message encoded in UTF-16BE text format (.txt).

TextEmailUTF32LE

X

X

X

Email message encoded in UTF-32LE text format (.txt).

TextEmailUTF32BE

X

X

X

Email message encoded in UTF-32BE text format (.txt).

TextEmail_ISO_8859_1

X

X

X

Email message encoded in ISO-8859-1 (Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish) text format (.txt).

TextEmail_ISO_8859_2

X

X

X

Email message encoded in ISO-8859-2 (Czech, Hungarian, Polish, Romanian) text format (.txt).

TextEmail_ISO_8859_5

X

X

X

Email message encoded in ISO-8859-5 (Cyrillic) text format (.txt).

TextEmail_ISO_8859_6

X

X

X

Email message encoded in ISO-8859-6 (Arabic) text format (.txt).

TextEmail_ISO_8859_7

X

X

X

Email message encoded in ISO-8859-7 (Greek) text format (.txt).

TextEmail_ISO_8859_8

X

X

X

Email message encoded in ISO-8859-8-I (Hebrew) text format (.txt).

TextEmail_ISO_8859_9

X

X

X

Email message encoded in ISO-8859-9 (Turkish) text format (.txt).

TextEmail_Windows_1250

X

X

X

Email message encoded in Windows-1250 Czech, Hungarian, Polish, Romanian (.txt).

TextEmail_Windows_1251

X

X

X

Email message encoded in Windows-1251 Russian (.txt).

TextEmail_Windows_1252

X

X

X

Email message encoded in Windows-1252 (Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish) (.txt).

TextEmail_Windows_1253

X

X

X

Email message encoded in Windows-1253 Greek (.txt).

TextEmail_Windows_1254

X

X

X

Email message encoded in Windows-1254 Turkish (.txt).

TextEmail_Windows_1255

X

X

X

Email message encoded in Windows-1255 Hebrew (.txt).

TextEmail_Windows_1256

X

X

X

Email message encoded in Windows-1256 Arabic (.txt).

TextEmail_KOI8_R

X

X

X

Email message encoded in KOI8-R, designed to cover Russian, which uses a Cyrillic alphabet (.txt).

TextEmail_IBM_424

X

X

X

Email message encoded in IBM 424, Hebrew (.txt).

TextEmail_IBM_420

X

X

X

Email message encoded in IBM 420 Arabic (.txt).

TextEmail_IBM_866

X

X

X

Email message encoded in IBM 866 Russian (.txt).

TextEmail_Shift_JIS

X

X

X

Email message encoded in Shift_JIS Japanese (.txt).

TextEmail_ISO_2022_JP

X

X

X

Email message encoded in ISO-2022-JP Japanese (.txt).

TextEmail_ISO_2022_CN

X

X

X

Email message encoded in ISO-2022-CN Simplified Chinese (.txt).

TextEmail_ISO_2022_KR

X

X

X

Email message encoded in ISO-2022-KR Korean (.txt).

TextEmail_GB18030

X

X

X

Email message encoded in GB18030 Chinese (.txt).

TextEmail_Big5

X

X

X

Email message encoded in Big5 Traditional Chinese (.txt).

TextEmail_EUC_JP

X

X

X

Email message encoded in EUC-JP Japanese (.txt).

TextEmail_EUC_KR

X

X

X

Email message encoded in EUC-KR Korean (.txt).

TextEmail_EBCDIC_500

X

X

X

Email message encoded in EBCDIC 500 (.txt).

DominoXmlMemo

X

X

X

X

Domino XML (DXL) memo document (email) export file format (.dxl).

DominoXmlReply

X

X

X

X

Domino XML (DXL) reply document (email) export file format (.dxl).

AdvansysPortableMessage

Advansys Message Viewer email format for accessing and sharing saved GroupWise messages (.fml).