Click or drag to resize

NotesAndResearch

Supported NotesAndResearch file formats (IdClassification.NotesAndResearch - Notes and research document formats)

  • All entries in table below are supported for file format identification.
  • 'X' in "Text" column indicates text extraction is supported for the file format.
  • 'X**' in "Text" column indicates text extraction is supported BUT binary-to-text filtering is used on partially parsed document records.
  • 'X' in "Metadata" column indicates metadata extraction is supported for the file format.
  • 'X' in "EmbeddedItem" column indicates embedded item/attachment extraction is supported for the file format.
  • 'X' in "ContentHash" column indicates a content hash is supported for the file format (see MD5ContentHash and SHA1ContentHash)

If a file format does not have a supported content extractor that extracts text then, optionally (default), a binary-to-text content extractor will be used to extract UTF-8, UTF-16, Windows-1252, and ASCII from the binary. In many cases, indexable text can be extract from unknown document formats.

NotesAndResearch Supported File Formats

File Format Id Enum Value

Text

Metadata

EmbeddedItem

ContentHash

Description

MSOneNote2003

X

X

X

Microsoft OneNote 2003 file format (.one).

MSOneNote2007

X

X

X

Microsoft OneNote 2007 file format (.one).

MSOneNote2010

X

X

X

Microsoft OneNote 2010, 2013, and 2016 file format (.one).

MSOneNoteTOC2003

Microsoft OneNote 2003 Table of Contents (TOC) file (.onetoc2).

MSOneNoteTOC2007

Microsoft OneNote 2007 Table of Contents (TOC) file (.onetoc2).

MSOneNoteTOC2010

Microsoft OneNote 2010, 2013, and 2016 Table of Contents (TOC) file (.onetoc2).

MSOneNotePackage

X

X

Microsoft OneNote package file that contains multiple OneNote files packaged into a single archive (.onepkg).

MSOneNote2010HTTP

Microsoft OneNote 2010, 2013, and 2016 HTTP transmitted (e.g., from SharePoint or OneDrive) file format (different format packaging than OneNote2010) (.one).

MSOneNoteTOC2010HTTP

Microsoft OneNote 2010, 2013, and 2016 Table of Contents (TOC) HTTP transmitted (e.g., from SharePoint or OneDrive) file format (different format packaging than MSOneNoteTOC2010) (.onetoc2).

MSOneNoteMhtml

X

X

X

Microsoft OneNote exported as MHTML (.mht).

SNote

S Note, an advanced note taking application developed by Samsung for use with Samsung mobile devices (.snb).

MindManagerMapFile

X

X

MindManager, by MindJet, mind mapping software file format (.mmap).

SmartNotebook

X

X

SMART Notebook file. Used by teachers to create classroom lecture materials. It may contain notes, diagrams, images, audio, and video (.notebook).