Click or drag to resize

WordProcessing

Supported WordProcessing file formats (IdClassification.WordProcessing - Word processing document formats)

  • All entries in table below are supported for file format identification.
  • 'X' in "Text" column indicates text extraction is supported for the file format.
  • 'X**' in "Text" column indicates text extraction is supported BUT binary-to-text filtering is used on partially parsed document records.
  • 'X' in "Metadata" column indicates metadata extraction is supported for the file format.
  • 'X' in "EmbeddedItem" column indicates embedded item/attachment extraction is supported for the file format.
  • 'X' in "ContentHash" column indicates a content hash is supported for the file format (see MD5ContentHash and SHA1ContentHash)

If a file format does not have a supported content extractor that extracts text then, optionally (default), a binary-to-text content extractor will be used to extract UTF-8, UTF-16, Windows-1252, and ASCII from the binary. In many cases, indexable text can be extract from unknown document formats.

WordProcessing Supported File Formats

File Format Id Enum Value

Text

Metadata

EmbeddedItem

ContentHash

Description

MSWorksWordProcessor2

Microsoft Works Word Processor DOS versions 1-3 and version 2.0 for Windows (.wps).

MSWorksWordProcessor3

Microsoft Works Word Processor version 3 for Windows (.wps)

MSWorksWordProcessor4

Microsoft Works Word Processor version 4 for Windows (.wps)

WordDos1to4

X

Microsoft Word for DOS versions 1.0 - 4.0 (.doc).

WordDos5

X

Microsoft Word 5.0 for DOS (.doc).

WordDos55

X

Microsoft Word 5.5 for DOS (.doc).

WordDos6

X

Microsoft Word 6.0 for DOS (.doc).

Word1

X

Word for Windows 1.0 (version 1, 1989)

Word2

X

Word for Windows 2.0 (version 2, 1991)

Word6

X

X

X

Word for Windows 6.0 (version 6, 1993). Versions skipped from 2 to 6 to bring Windows version numbering in line with that of DOS.

Word95

X

X

X

Microsoft Word 95 (version 7, 1995)

Word95Encrypted

X

X

X

Encrypted Microsoft Word 95 (version 7, 1995)

Word97

X

X

X

X

Microsoft Word 97 (version 8, 1997)

Word97Encrypted

X

X

X

X

Encrypted Microsoft Word 97 (version 8, 1997)

Word2000

X

X

X

X

Microsoft Word 2000 (version 9, 1999)

Word2000Encrypted

X

X

X

X

Encrypted Microsoft Word 2000 (version 9, 1999)

Word2002

X

X

X

X

Microsoft Word 2002 (version 10, 2001)

Word2002Encrypted

X

X

X

X

Encrypted Microsoft Word 2002 (version 10, 2001)

Word2003

X

X

X

X

Microsoft Word 2003 (version 11, 2003)

Word2003Encrypted

X

X

X

X

Encrypted Microsoft Word 2003 (version 11, 2003)

Word2007

X

X

X

X

Microsoft Word 2007 (version 12, 2006)

Word2007Macro

X

X

X

X

Microsoft Word 2007 macro-enabled document (version 12, 2006)

Word2007Template

X

X

X

X

Microsoft Word 2007 document template (version 12, 2006)

Word2007TemplateMacro

X

X

X

X

Microsoft Word 2007 macro-enabled document template (version 12, 2006)

Word2010

X

X

X

X

Microsoft Word 2010 (version 14, 2010)

Word2010Macro

X

X

X

X

Microsoft Word 2010 macro-enabled document (version 14, 2010)

Word2010Template

X

X

X

X

Microsoft Word 2010 document template (version 14, 2010)

Word2010TemplateMacro

X

X

X

X

Microsoft Word 2010 macro-enabled document template (version 14, 2010)

Word2013

X

X

X

X

Microsoft Word 2013 (version 15, 2013)

Word2013Macro

X

X

X

X

Microsoft Word 2013 macro-enabled document (version 15, 2013)

Word2013Template

X

X

X

X

Microsoft Word 2013 document template (version 15, 2013)

Word2013TemplateMacro

X

X

X

X

Microsoft Word 2013 macro-enabled document template (version 15, 2013)

Word2016

X

X

X

X

Microsoft Word 2016 (version 16, 2015)

Word2016Macro

X

X

X

X

Microsoft Word 2016 macro-enabled document (version 16, 2015)

Word2016Template

X

X

X

X

Microsoft Word 2016 document template (version 16, 2015)

Word2016TemplateMacro

X

X

X

X

Microsoft Word 2016 macro-enabled document template (version 16, 2015)

Word2007Corrupted

X

X

X

X

Microsoft Word 2007 or higher that is potentially corrupted. The format's zip container failed inspection and format had to be identified using an alternate means (.docx).

Word2007OnwardEncrypted

X

X

X

X

Encrypted Microsoft Word 2007-2013

Word2007OnwardEncryptedIRM

X

X

X

X

Encrypted and information rights management protected (IRM) Microsoft Word 2007-2016 format.IRM(what Microsoft calls DRM) uses permissions and authorization to help prevent sensitive information from being printed, forwarded, or copied by authorized users, or accessed by unauthorized people.

Word2003Xml

X

X

X

Microsoft Word 2003 (version 11, 2003) saved as XML file (.xml).

Word2007Xml

X

X

X

Microsoft Word 2007 (version 12, 2006) saved as XML file (.xml).

Word2000Html

X

X

Microsoft Word 2007 (version 12, 2006) saved as XML file (.xml).

WordMhtml

X

X

X

Microsoft Word saved as (MIME) MHTML (.mht;.mhtml).

WordCompoundFileCorrupted

Microsoft Word 97-2003 compound file format corrupted. Unable to determine specific format version (.doc).

MicrosoftWordPicture6

X

X

X

Microsoft Word Picture 6.0 metafile (usually embedded metafiles in Microsoft Word 6 documents) (.doc).

MicrosoftWordPicture95

X

X

X

Microsoft Word Picture 95 metafile (usually embedded metafiles in Microsoft Word 95 documents) (.doc).

MicrosoftWordPicture

X

X

X

X

Microsoft Word Picture metafile (usually embedded metafiles in Microsoft Word 97-2003 documents) (.doc).

MacWord1

X

Microsoft Word 1.0 for Mac OS (.mcw;.clx;.doc; or no extension).

MacWord3

X

Microsoft Word 3.0 for Mac OS (.mcw;.clx;.doc; or no extension).

MacWord4

X

Microsoft Word 4.0 for Mac OS (.mcw;.clx;.doc; or no extension).

MacWord5

X

Microsoft Word 5.0 for Mac OS (.mcw;.clx;.doc; or no extension).

StarOfficeWriter52

X

X

X

StarOffice Writer version 5.2.

StarOfficeWriter6to7

X

X

X

StarOffice Writer versions 6.0 and 7 (.sxw;.odt).

StarOfficeWriter6to7Encrypted

Encrypted StarOffice Writer versions 6.0 and 7 (.sxw;.odt).

StarOfficeWriter8

X

X

X

StarOffice Writer version 8.0 .sxw;(.odt).

StarOfficeWriter8Encrypted

Encrypted StarOffice Writer version 8.0 (.sxw;.odt).

StarOfficeWriter9

X

X

X

StarOffice Writer version 9.0 (.sxw;.odt).

StarOfficeWriter9Encrypted

Encrypted StarOffice Writer version 9.0 (.sxw;.odt).

OpenOfficeOrgWriter1

X

X

X

OpenOffice.org Writer versions 1.x by Sun Microsystems (.sxw;.odt).

OpenOfficeOrgWriterTemplate1

X

X

X

OpenOffice.org Writer Template versions 1.x by Sun Microsystems (.stw;.ott).

OpenOfficeOrgWriter2

X

X

X

OpenOffice.org Writer versions 2.x by Sun Microsystems (.sxw;.odt).

OpenOfficeOrgWriterTemplate2

X

X

X

OpenOffice.org Writer Template versions 2.x by Sun Microsystems (.stw;.ott).

OpenOfficeOrgWriter3

X

X

X

OpenOffice.org Writer versions 3.x by Sun Microsystems (Last version of OpenOffice.org until it became Oracle OpenOffice Writer 3.3) (.sxw;.odt).

OpenOfficeOrgWriterTemplate3

X

X

X

OpenOffice.org Writer Template versions 3.x by Sun Microsystems (Last version of OpenOffice.org until it became Oracle OpenOffice Writer 3.3) (.stw;.ott).

OracleOpenOfficeWriter33

X

X

X

Oracle OpenOffice Writer version 3.3 (Last version of Oracle OpenOffice until it became Apache OpenOffice) (.odt).

OracleOpenOfficeWriterTemplate33

X

X

X

Oracle OpenOffice Writer Template version 3.3 (Last version of Oracle OpenOffice until it became Apache OpenOffice) (.ott).

ApacheOpenOfficeWriter34

X

X

X

Apache OpenOffice Writer version 3.4+ (First version of Apache OpenOffice which came from open-sourced Oracle OpenOffice 3.3) (.odt).

ApacheOpenOfficeWriterTemplate34

X

X

X

Apache OpenOffice Writer Template version 3.4+ (First version of Apache OpenOffice which came from open-sourced Oracle OpenOffice 3.3) (.ott).

ApacheOpenOfficeWriter4

X

X

X

Apache OpenOffice Writer version 4.x (.odt).

ApacheOpenOfficeWriterTemplate4

X

X

X

Apache OpenOffice Writer Template version 4.x (.ott).

LibreOfficeWriter3

X

X

X

LibreOffice Writer version 3.x (3.3 is first version of LibreOffice after fork from Apache OpenOffice) (.odt).

LibreOfficeWriterTemplate3

X

X

X

LibreOffice Writer Template version 3.x (3.3 is first version of LibreOffice after fork from Apache OpenOffice) (.ott).

LibreOfficeWriter4

X

X

X

LibreOffice Writer version 4.x (.odt).

LibreOfficeWriterTemplate4

X

X

X

LibreOffice Writer Template version 4.x (.ott).

LibreOfficeWriter5

X

X

X

LibreOffice Writer version 5.x (.odt).

LibreOfficeWriterTemplate5

X

X

X

LibreOffice Writer Template version 5.x (.ott).

LibreOfficeWriter6

X

X

X

LibreOffice Writer version 6.x (.odt).

LibreOfficeWriterTemplate6

X

X

X

LibreOffice Writer Template version 6.x (.ott).

OpenDocumentText1

X

X

X

OpenDocument Text version 1.0. Developed by OASIS, based on the original OpenOffice document format. This is a generic format as Apache OpenOffice, LibreOffice, and other applications use this format (.odt;fodt).

OpenDocumentTextEncrypted1

X

X

X

Encrypted OpenDocument Text version 1.0. Developed by OASIS, based on the original OpenOffice document format. This is a generic format as Apache OpenOffice, LibreOffice, and other applications use this format (.odt;fodt).

OpenDocumentTextTemplate1

X

X

X

OpenDocument Text version 1.0. Developed by OASIS, based on the original OpenOffice document format. This is a generic format as Apache OpenOffice, LibreOffice, and other applications use this format (.ott;fott).

OpenDocumentTextTemplateEncrypted1

X

X

X

Encrypted OpenDocument Text version 1.0. Developed by OASIS, based on the original OpenOffice document format. This is a generic format as Apache OpenOffice, LibreOffice, and other applications use this format (.ott;fott).

OpenDocumentText11

X

X

X

OpenDocument Text version 1.1. Developed by OASIS, based on the original OpenOffice document format. This is a generic format as Apache OpenOffice, LibreOffice, and other applications use this format (.odt;fodt).

OpenDocumentTextEncrypted11

X

X

X

Encrypted OpenDocument Text version 1.1. Developed by OASIS, based on the original OpenOffice document format. This is a generic format as Apache OpenOffice, LibreOffice, and other applications use this format (.odt;fodt).

OpenDocumentTextTemplate11

X

X

X

OpenDocument Text version 1.1. Developed by OASIS, based on the original OpenOffice document format. This is a generic format as Apache OpenOffice, LibreOffice, and other applications use this format (.ott;fott).

OpenDocumentTextTemplateEncrypted11

X

X

X

Encrypted OpenDocument Text version 1.1. Developed by OASIS, based on the original OpenOffice document format. This is a generic format as Apache OpenOffice, LibreOffice, and other applications use this format (.ott;fott).

OpenDocumentText12

X

X

X

OpenDocument Text version 1.2. Developed by OASIS, based on the original OpenOffice document format. This is a generic format as Apache OpenOffice, LibreOffice, and other applications use this format (.odt;fodt).

OpenDocumentTextEncrypted12

X

X

X

Encrypted OpenDocument Text version 1.2. Developed by OASIS, based on the original OpenOffice document format. This is a generic format as Apache OpenOffice, LibreOffice, and other applications use this format (.odt;fodt).

OpenDocumentTextTemplate12

X

X

X

OpenDocument Text version 1.2. Developed by OASIS, based on the original OpenOffice document format. This is a generic format as Apache OpenOffice, LibreOffice, and other applications use this format (.ott;fott).

OpenDocumentTextTemplateEncrypted12

X

X

X

Encrypted OpenDocument Text version 1.2. Developed by OASIS, based on the original OpenOffice document format. This is a generic format as Apache OpenOffice, LibreOffice, and other applications use this format (.ott;fott).

OpenDocumentText1_Embedded

X

X

X

Embedded OpenDocument Text version 1.0. Developed by OASIS, based on the original OpenOffice document format. Embedded/extracted ODF documents that are missing their own "manifest.xml" and "mimetype" zip entries. They can result from improper extraction from their parent OpenDocument formats (.odt).

OpenDocumentText11_Embedded

X

X

X

Embedded OpenDocument Text version 1.1. Developed by OASIS, based on the original OpenOffice document format. Embedded/extracted ODF documents that are missing their own "manifest.xml" and "mimetype" zip entries. They can result from improper extraction from their parent OpenDocument formats (.odt).

OpenDocumentText12_Embedded

X

X

X

Embedded OpenDocument Text version 1.2. Developed by OASIS, based on the original OpenOffice document format. Embedded/extracted ODF documents that are missing their own "manifest.xml" and "mimetype" zip entries. They can result from improper extraction from their parent OpenDocument formats (.odt).

HancomHWP2

Hancom HWP2 word processor file format version 2.0 (.hwp).

HancomHWP21

Hancom HWP2.1 word processor file format version 2.1 (.hwp).

HancomHWP3

X

X

Hancom HWP3 word processor file format version 3.0 (.hwp).

HancomHWP5

X

X

X

X

Hancom HWP5 (Hanword) word processor file format version 5.0 (.hwp).

HancomHWP5Encrypted

Encrypted Hancom HWP5 (Hanword) word processor file format version 5.0 (.hwp).

HancomHWPML

X

Hancom Word Processor Markup Language (HWPML) (.hml).

Ichitaro3

Ichitaro version 3 Japanese word processor produced by JustSystems (.jaw;.jbw;.jtw;.juw).

Ichitaro4

Ichitaro version 4 Japanese word processor produced by JustSystems (.jaw;.jbw;.jtw;.juw).

Ichitaro5

X

Ichitaro version 5 Japanese word processor produced by JustSystems (.jaw;.jbw;.jtw;.juw).

Ichitaro6

X

Ichitaro version 6 Japanese word processor produced by JustSystems (.jaw;.jbw;.jtw;.juw).

Ichitaro7

X

X

X

Ichitaro version 7 Japanese word processor produced by JustSystems (.jfw;.jvw;.jtw;.juw).

Ichitaro8

X

X

X

Ichitaro version 8 Japanese word processor produced by JustSystems (.jtd;.jtdc;.jtt;.jttc)

IchitaroCompressed8

X

X

X

Ichitaro compressed version 8 Japanese word processor produced by JustSystems (.jtd;.jtdc;.jtt;.jttc).

PerfectWorks

Novell PerfectWorks for Windows (.wpw).

WordPerfectMac1

X

X

X

Corel WordPerfect version 1.0 for Mac.

WordPerfectMac2

X

X

X

Corel WordPerfect version 2.0 for Mac.

WordPerfectMac3

X

X

X

Corel WordPerfect version 3.0 for Mac.

WordPerfectMac35e

X

X

X

Corel WordPerfect version 3.5e for Mac.

WordPerfect4

X

X

X

Corel WordPerfect version 4.0 (.wp4;.wpf).

WordPerfect42

X

X

X

Corel WordPerfect version 4.2 (.wp4;.wpf).

WordPerfect5

X

X

X

Corel WordPerfect version 5.0 (.wp5;.wp).

WordPerfect5Encrypted

Encrypted Corel WordPerfect version 5.0 (.wp5;.wp).

WordPerfect51

X

X

X

Corel WordPerfect version 5.1 (.wp5;.wp).

WordPerfect51Encrypted

Encrypted Corel WordPerfect version 5.1 (.wp5;.wp).

WordPerfect51FarEast

X

X

X

Corel WordPerfect version 5.1 Far East (.wp5;.wp).

WordPerfect6toX8

X

X

X

Corel WordPerfect versions 6.0 to X8 (.wpd;.wp;.wp6;.wp7).

WordPerfect6toX8Encrypted

Encrypted Corel WordPerfect versions 6.0 to X8 (.wpd;.wp;.wp6;.wp7).

WordPerfectCompoundFile6toX8

X

X

X

Corel WordPerfect versions 6.0 to X8 saved in compound file format (.wp;.wp6;.wp7).

WordPerfectCompoundFile6toX8Encrypted

Encrypted Corel WordPerfect versions 6.0 to X8 saved in compound file format (.wp;.wp6;.wp7).

WordPerfectTemplateFile

Corel WordPerfect Template File. Used by Corel WordPerfect to create automated templates (.wpx).

WordPerfectCompoundFile61

X

X

X

Corel WordPerfect compound file version 6.1(.wpd;.wp;.wp6).

MicrosoftWrite

X

Microsoft Write (.wri).

XyWrite

XyWrite for DOS and Windows versions 1-4. The final version for DOS was 4.18 (1993); for Windows, 4.13 (.xy;.xy3;.xyp;.xy4;.xyw).

WordStar5

WordStar word processor version 5 (.wsd;.ws5;.ws).

WordStar55

WordStar word processor version 5.5 (.wsd;.ws5;.ws).

WordStar6

WordStar word processor version 6 (.wsd;.ws6;.ws).

WordStar7

WordStar word processor version 7 (.wsd;.ws7;.ws).

WordStar2000

WordStar word processor version 2000 (version 1) (.wsd;.wsw;.ws).

LegacyWordProcessor

Legacy (purchased by WordStar) for Windows (.chp).

WordStarWindows

WordStar for Windows (last version of WordStar and was an altered version of LegacyWordProcessor and released as WordStar, 1991) (.wsd).

AmiPro

X

X

Lotus Ami Pro (originally by Samna, Samna was purchased by Lotus Software in 1990) (.sam).

LotusWordPro97

Lotus Word Pro 97 word processor (based on Lotus Ami Pro) (.lwp).

LotusWordPro97Encrypted

Encrypted Lotus Word Pro 87 word processor (based on Lotus Ami Pro) (.lwp).

LotusWordPro9

Lotus Word Pro 9 word processor (.lwp).

FirstChoiceWP

First Choice word processor (.doc).

FirstChoiceWP3

First Choice word processor version 3.0 (.doc).

DisplayWrite

IBM DisplayWrite versions 3.0, 4.0, and 5.0 (.txt;.doc).

DisplayWriteFFT

IBM DisplayWrite Final Form Text (FFT) (.fft;.txt;.doc).

DisplayWriteRFT

IBM DisplayWrite Reversible Format Text (RFT) (.fft;.txt;.doc).

JustWrite

Symantec JustWrite word processor versions 1.0 and 2.0 (.jw).

MultiMate36

MultiMate word processor version 3.3 - 3.6 (.dox;.doc).

MultiMate4

MultiMate word processor version 4.0 (.dox;.doc).

NavyDIFStandard

Navy Data Interchange Format (DIF) is a historical word processor/spreadsheet standard format.

OfficeWriter

OfficeWriter word processor version 6.x (.wp).

Volkswriter

Volkswriter word processor (.vw;.vw3;.vw4).

WangIWP

Wang IWP (.doc).

EnableWP4

Enable WP 4 (.wpf;.en4).

ProfessionalWrite1

Professional Write 1 (.pfs).

ProfessionalWrite2

Professional Write 2 (.pfs).

AdobeFrameMaker

Adobe FrameMaker document (all versions) (.fm).

AdobeFrameMakerMIF3

Adobe FrameMaker Interchange Format document version 3.0 (.mif)..

AdobeFrameMakerMIF4

Adobe FrameMaker Interchange Format document version 4.0 (.mif)..

AdobeFrameMakerMIF5

Adobe FrameMaker Interchange Format document version 5.0 (.mif)..

AdobeFrameMakerMIF55

Adobe FrameMaker Interchange Format document version 5.5 (.mif)..

AdobeFrameMakerMIF6

Adobe FrameMaker Interchange Format document version 6.0 (.mif)..

AdobeFrameMakerMIF

Adobe FrameMaker Interchange Format document (all versions) (.mif).

AbiWord

X

AbiWord Document (open-source word processor similar to Microsoft Word) (.abw).

AbiWordCompressed

X

AbiWord Document (open-source word processor similar to Microsoft Word) (.abw;.zabw;.gz).

AbiWordTemplate

AbiWord Document Template (open-source word processor similar to Microsoft Word) (.abw;.awt).

AbiWordTemplateCompressed

AbiWord Document Template (open-source word processor similar to Microsoft Word) (.abw;.awt;.zawt;.gz).

StarOfficeMath52

X

X

X

StarOffice Formula 5.x (.sxf;.sxm).

StarOfficeMath6to7

X

X

X

StarOffice Math versions 6 (beta) to 7 (.sxm;.sxf).

StarOfficeMath6to7Encrypted

Encrypted StarOffice Math versions 6 (beta) to 7 (.sxm;.sxf).

OpenDocumentMath

X

X

X

OpenDocument Math (formula) document (.sxm;.odf).

OpenDocumentMathEncrypted

X

X

X

Encrypted OpenDocument Math (formula) document (.sxm;.odf).

OpenDocumentMath_Embedded

X

X

X

Embedded OpenDocument Math document. Embedded/extracted ODF documents are missing their own "META-INF/manifest.xml" and "mimetype" zip entries. They can result from improper extraction from their parent OpenDocument formats (.sxm;.odf).

iWorkPages

X

X

Apple iWork '05 - '09 Productivity Suite Pages word processor versions 1.0 - 4.0 (.pages;.pages.zip;.zip).

iWorkPagesEncrypted

Encrypted Apple iWork '05 - '09 Productivity Suite Pages word processor versions 1.0 - 4.0 (.pages;.pages.zip;.zip).

iWorkPages2013

X**

Apple iWork 2013-2016 Productivity Suite Pages word processor versions 5.0 - 6.0 (.pages;.pages.zip;.zip).

iWorkPages2013Encrypted

Encrypted Apple iWork 2013-2016 Productivity Suite Pages word processor versions 5.0 - 6.0 (.pages;.pages.zip;.zip).

ClarisWorksWordProcessor1

ClarisWorks Word Processor versions 1 (.cwk).

ClarisWorksWordProcessor2

ClarisWorks Word Processor versions 2-3 (.cwk).

ClarisWorksWordProcessor4

ClarisWorks Word Processor version 4 (.cwk).

ClarisWorksWordProcessor5

ClarisWorks Word Processor version 5 (.cwk).

AppleWorksWordProcessor6

AppleWorks Word Processor version 6 (originally ClarisWorks and was renamed AppleWorks after version 5) (.cwk).

AbilityWrite

X

X

X

Ability Write word processor format version 4.0-6.0 by Ability Plus Software (.aww).

Scrivener

X

Scrivener word-processing and outliner designed for authors (XML format) (.scrivx).