Click or drag to resize

ScientificDataFormat

Supported ScientificDataFormat file formats (IdClassification.ScientificDataFormat - Scientific/mathematic/bioinformatic data formats)

  • All entries in table below are supported for file format identification.
  • 'X' in "Text" column indicates text extraction is supported for the file format.
  • 'X**' in "Text" column indicates text extraction is supported BUT binary-to-text filtering is used on partially parsed document records.
  • 'X' in "Metadata" column indicates metadata extraction is supported for the file format.
  • 'X' in "EmbeddedItem" column indicates embedded item/attachment extraction is supported for the file format.
  • 'X' in "ContentHash" column indicates a content hash is supported for the file format (see MD5ContentHash and SHA1ContentHash)

If a file format does not have a supported content extractor that extracts text then, optionally (default), a binary-to-text content extractor will be used to extract UTF-8, UTF-16, Windows-1252, and ASCII from the binary. In many cases, indexable text can be extract from unknown document formats.

ScientificDataFormat Supported File Formats

File Format Id Enum Value

Text

Metadata

EmbeddedItem

ContentHash

Description

GRIBFormat1

Gridded Binary (GRIB) file format version 1 (created by World Meteorological Organization) (.grb;.wmo).

GRIBFormat2

Gridded Binary (GRIB) file format version 2 (created by World Meteorological Organization) (.grb2;.grb;.grib;.grib2;.wmo).

CDFBinaryFormatPreVer26

Common Data Format (CDF) pre-version 2.6 (self-describing scientific data format for the storage of scalar and multidimensional data in a platform- and discipline-independent way) (.cdf).

CDFBinaryFormat27

Common Data Format (CDF) version 2.6/2.7 (self-describing scientific data format for the storage of scalar and multidimensional data in a platform- and discipline-independent way) (.cdf).

CDFCompressedBinaryFormat27

Compressed Common Data Format (CDF) version 2.6/2.7 (self-describing scientific data format for the storage of scalar and multidimensional data in a platform- and discipline-independent way) (.cdf).

CDFBinaryFormat3

Common Data Format (CDF) version 3.0 (self-describing scientific data format for the storage of scalar and multidimensional data in a platform- and discipline-independent way) (.cdf).

CDFCompressedBinaryFormat3

Compressed Common Data Format (CDF) version 3.0 (self-describing scientific data format for the storage of scalar and multidimensional data in a platform- and discipline-independent way) (.cdf).

CDFXmlFormat

X

Common Data Format (CDF) Markup Language (CDFML) (self-describing scientific data format for the storage of scalar and multidimensional data in a platform- and discipline-independent way) (.xml;.cdf).

NetCDF3

netCDF-3 (classic) data file format (.nc;.cdf).

NetCDF3_64

netCDF-3 64-bit data file format (.nc;.cdf).

SigmaPlotGeneric

Generic SigmaPlot (unknown compound file version) scientific graphing and data analysis version.

SigmaPlot6

X

X

X

SigmaPlot scientific graphing and data analysis version 6.0.

SigmaPlotNotebook6

X

X

X

SigmaPlot Notebook scientific graphing and data analysis version 6.0.

SigmaPlot8

X

X

X

SigmaPlot scientific graphing and data analysis version 8.0.

SigmaPlot11

X

X

X

SigmaPlot scientific graphing and data analysis version 11.0.

MatlabMATFileLevel4

MathWorks MATLAB Level 4 MAT-file binary file format (stores workspace variables) (.mat;.fig).

MatlabMATFileLevel5

MathWorks MATLAB Level 5 MAT-file binary file format (stores workspace variables) (.mat;.fig).

MatlabMATFile7

MathWorks MATLAB version 7.0 MAT-file binary file format (stores workspace variables) (.mat;.fig).

MatlabMATFile73

MathWorks MATLAB version 7.3 MAT-file binary file format (stores workspace variables) (.mat;.fig).

MatlabLiveScript

X

X

MathWorks MATLAB Live Script (Live scripts contain code, output, and graphics) (.mxl).

MinitabPortableWorksheet

Minitab statistical software Portable Worksheet Format (.mtp).

MinitabWorksheetCompoundFile

X

X

X

Minitab statistical software worksheet file (compound document format)(.mtw).

MinitabWorksheet61

Minitab statistical software worksheet file version 6.1 (.mtw).

MinitabGraphCompoundFile

Minitab data analysis and statistical software graph file (compound file format) (.mgf).

SPlusAsciiDataDumpFile

S-Plus statistical software ASCII data dump file (.sdd).

SPlusBinaryDataDumpFile

S-Plus statistical software binary data dump file.

StataDataFile104

Stata data analysis and statisical software data file format 104 (.dta).

StataDataFile105

Stata data analysis and statisical software data file format 105 (.dta).

StataDataFile106

Stata data analysis and statisical software data file format 106 (.dta).

StataDataFile107

Stata data analysis and statisical software data file format 107 (.dta).

StataDataFile108

Stata data analysis and statisical software data file format 108 (.dta).

StataDataFile109

Stata data analysis and statisical software data file format 109 (.dta).

StataDataFile110

Stata data analysis and statisical software data file format 110 (associated with a Stata version earlier than version 8) (.dta).

StataDataFile111

Stata data analysis and statisical software data file format 111 (associated with a Stata version earlier than version 8) (.dta).

StataDataFile113

Stata data analysis and statisical software data file format 113 (associated with Stata version 8) (.dta).

StataDataFile114

Stata data analysis and statisical software data file format 114 (associated with Stata version 10) (.dta).

StataDataFile115

Stata data analysis and statisical software data file format 115 (associated with Stata version 12) (.dta).

StataDataFile117

Stata data analysis and statisical software data file format 117 (associated with Stata version 13, no external customer format 116 ever released) (.dta).

StataDataFile118

Stata data analysis and statisical software data file format 118 (associated with Stata version 14) (.dta).

StataDataFile119

Stata data analysis and statisical software data file format 119 (associated with Stata version 15) (.dta).

SAS7BDataFile

Statistical Analysis System (SAS), data analysis and statisical software, binary data file (.sas7bdat;.sd7).

SASTransportFormatV5V6

Statistical Analysis System (SAS), data analysis and statisical software, SAS Transport Format Version 5 or 6 (.xpt;.tpt).

SASTransportFormatV8V9

Statistical Analysis System (SAS), data analysis and statisical software, SAS Transport Format Version 8 or 9 (.xpt;.tpt).

SPSSPortableDataFormatAscii

SPSS (Statistical Package for the Social Sciences), data analysis and statisical software, Portable Data Format (.por).

SPSSDataFile

SPSS (Statistical Package for the Social Sciences), data analysis and statisical software, Data File (.sav).

OriginProject

OriginLab Origin project file. Origin is used for advanced graphing and data analysis (.opj).