DesktopPublishing |
Supported DesktopPublishing file formats (IdClassification.DesktopPublishing - Desktop publishing document formats)
If a file format does not have a supported content extractor that extracts text then, optionally (default), a binary-to-text content extractor will be used to extract UTF-8, UTF-16, Windows-1252, and ASCII from the binary. In many cases, indexable text can be extract from unknown document formats.
File Format Id Enum Value | Text | Metadata | EmbeddedItem | ContentHash | Description |
|---|---|---|---|---|---|
Microsoft Publisher compound file corrupted. Unable to determine specific format version (.pub). | |||||
X | X | X | X | Microsoft Publisher 98-2003 (.pub). | |
X | X | X | X | Microsoft Publisher 2007-2016 (.pub). | |
X | X | X | Microsoft Publisher exported as MHTML (.mht). | ||
X | X | X | Serif PagePlus desktop publishing (page layout) program developed by Serif (.ppp). | ||
X | X | X | Serif WebPlus website design program for Microsoft Windows (.wpp). | ||
X | X | X | Adobe PageMaker desktop publishing file format (.pm3;.pm4;.pm5;.pm6;.p65;.pm7;.pmd). |