Click or drag to resize

MailStore

Supported MailStore file formats (IdClassification.MailStore - Email container document formats (e.g., PST, OST, MBOX, Lotus NSF, etc))

  • All entries in table below are supported for file format identification.
  • 'X' in "Text" column indicates text extraction is supported for the file format.
  • 'X**' in "Text" column indicates text extraction is supported BUT binary-to-text filtering is used on partially parsed document records.
  • 'X' in "Metadata" column indicates metadata extraction is supported for the file format.
  • 'X' in "EmbeddedItem" column indicates embedded item/attachment extraction is supported for the file format.
  • 'X' in "ContentHash" column indicates a content hash is supported for the file format (see MD5ContentHash and SHA1ContentHash)

If a file format does not have a supported content extractor that extracts text then, optionally (default), a binary-to-text content extractor will be used to extract UTF-8, UTF-16, Windows-1252, and ASCII from the binary. In many cases, indexable text can be extract from unknown document formats.

MailStore Supported File Formats

File Format Id Enum Value

Text

Metadata

EmbeddedItem

ContentHash

Description

OutlookPSTAnsi

X

X

Microsoft Outlook 1997-2002 Personal Storage Table (PST) (.pst). This is an ANSI version with maximum size of 2 GB.

OutlookPSTUnicode

X

X

Microsoft Outlook 2003-2013 Personal Storage Table (PST) (.pst). This is an Unicode version with maximum size of 50 GB.

OutlookOSTAnsi

X

X

Microsoft Outlook 1997-2002 Offline Storage Table (OST) (.ost). Also refered to as the Offline Folder File (OFF) format. This is an ANSI version with maximum size of 2 GB.

OutlookOSTUnicode

X

X

Microsoft Outlook 2003-2010 Offline Storage Table (OST) (.ost). Also refered to as the Offline Folder File (OFF) format. This is an Unicode version with maximum size of 50 GB.

OutlookOST2013Unicode

X

X

Microsoft Outlook 2013 Offline Storage Table (OST) (.ost). Also refered to as the Offline Folder File (OFF) format. This is an Unicode version with maximum size of 50 GB. The internal BTREE page and blocks sizes are larger than the previous versions, additionally, the blocks can be compressed.

OutlookPersonalAddressBook

Microsoft Outlook Personal Address Book (PAB) (.pab).

OutlookExpressFoldersDbx

X

X

Outlook Express mail folders database

OutlookExpressMessagesDbx

X

X

Outlook Express messages database

OutlookExpressPop3uidlDbx

X

X

Outlook Express POP3uidl, mail database that tracks messages left on the POP server database

OutlookExpressOfflineDbx

X

X

Outlook Express offline mail database that exists on systems where user has configured Webmail services such as Hotmail.

ExchangeServer2003

Exchange Server 2003 database file (.edb).

ExchangeServer2007

Exchange Server 2007 database file (.edb).

ExchangeServer2010

Exchange Server 2010 database file (.edb).

ExchangeServer2013

Exchange Server 2013 database file (.edb).

Mbox

X

X

MBOX email store. All messages in an mbox mailbox are concatenated and stored as plain text in a single file (.mbox;.mbx).

Foxmail

X

X

Foxmail email store. Foxmail is a freeware e-mail client developed by Tencent and is mainly used in China (.box).

LotusNotesStorageFacility

Lotus Notes Storage Facility (NSF) database (.nsf).

GroupWise

GroupWise messaging database (by Novell) (.db).

OutlookForMacMailbox

X

X

Outlook for Mac Data File, this file is used to archive a user's Outlook folders, messages, calendar, contacts, etc. (.olm).

EmcEmailExtenderArchive

EMC EmailXtender email archive format (.emc).

BloombergMessagesXmlDump

Bloomberg Message (MSGXML) compliance dump format (.xml).