Click or drag to resize

eBook

Supported eBook file formats (IdClassification.eBook - Electronic book (e-book, eBook, etc.) is a book publication in digital form)

  • All entries in table below are supported for file format identification.
  • 'X' in "Text" column indicates text extraction is supported for the file format.
  • 'X**' in "Text" column indicates text extraction is supported BUT binary-to-text filtering is used on partially parsed document records.
  • 'X' in "Metadata" column indicates metadata extraction is supported for the file format.
  • 'X' in "EmbeddedItem" column indicates embedded item/attachment extraction is supported for the file format.
  • 'X' in "ContentHash" column indicates a content hash is supported for the file format (see MD5ContentHash and SHA1ContentHash)

If a file format does not have a supported content extractor that extracts text then, optionally (default), a binary-to-text content extractor will be used to extract UTF-8, UTF-16, Windows-1252, and ASCII from the binary. In many cases, indexable text can be extract from unknown document formats.

eBook Supported File Formats

File Format Id Enum Value

Text

Metadata

EmbeddedItem

ContentHash

Description

ePub

X

X

Open eBook Forum (of the International Digital Publishing Forum (IDPF)) ePub book format (.epub).

ePub_Encrypted

X

X

Encrypted Open eBook Forum (of the International Digital Publishing Forum (IDPF)) ePub book format (.epub).

iBooks

X

Apple iBooks electronic book (eBook) file format (.ibooks).

iBooksAuthor

Apple iBooks Author electronic book (eBook) authoring application file format (.iba).

FictionBook2

X

FictionBook 2.0 file saved in an eBook format. This XML based format was developed in Russia and specifies the structure of the eBook instead of the appearance (.fb2).

MicrosoftReaderEBook

Microsoft Reader software (discontinued) eBook format (.lit).

MobiPocketEBook

Mobipocket eBook format (purchased by Amazon.com) (.mobi).

AmazonKindleEBook

Amazon Kindle eBook format (.azw;.azw3;.azw4;.azw6).

BroadBandEBook

BroadBand eBook (created by Sony for Sony ebook readers) (.lrf).

RocketBookEBook

Rocket Book eBook (.rb).