OpenDiscoverSDK.Interfaces.Content Namespace

This section presents the various content extracted by the derived IContentExtractor interfaces.

Remarks

The classes and enumerations in this namespace, starting with the DocumentContent base class, define the various content extracted by the derived IContentExtractor interfaces.

Classes

ArchiveContent Archive top level content.
BooleanListProperty Represents a list of Boolean values document metadata property.
BooleanProperty Boolean valued document metadata property.
ChildDocument Represents an attachment or embedded item that is extracted from another document such as an email, document container (e.g., archive or media image), or word processing document.
ColumnInfo Database table column information.
ContainerFolder Represents an internal folder in a container format such as an archive directory or Microsoft Outlook PST/OST folder.
DatabaseContent Database top level content.
DateTimeListProperty Represents a list of DateTime values document metadata property.
DateTimeProperty DateTime valued document metadata property.
DocumentContent Represents extracted document content.
DocumentPropertyBaseT Base class for all document metadata properties.
DoubleListProperty Represents a list of double values document metadata property.
DoubleProperty Double valued document metadata property.
EmailAddress Email address information for an email sender or recipient.
EmailDocumentContent Extracted email content.
HtmlDocumentContent Extracted HTML document content.
HtmlImage Represents an HTML 'img' tag.
Hyperlink Represents a document hyperlink.
Int32ListProperty Represents a list of Int32 values document metadata property.
Int32Property Int32 valued document metadata property.
Int64ListProperty Represents a list of Int64 values document metadata property.
Int64Property Int64 valued document metadata property.
JoinedMimeMessage Represents a joined (complete) MIME message that was produced by joining all MIME message partial parts.
LanguageIdResult Language identification result.
MailStoreContent Mail store content. PST, OST, MBOX, etc. are examples of mail store (email container formats).
MessagingStoreContent Messaging store content. Slack exports, Teams exports, Skype exports, etc. are examples of messaging store (container formats).
PdfDocumentContent Extracted PDF document content.
PdfPageInfo Information on a PDF page that failed content extraction.
ScriptRegion Represents a detected Unicode script region in text.
StringListProperty Represents a list of string values document metadata property.
StringProperty String valued document metadata property.
TableInfo Database table information.

Interfaces

IDocumentProperty Common interface for document metadata properties.

Enumerations

ColumnDataType Column data type.
DocumentAttributes Document attributes. Document attributes give extra information about a document such as if it has hidden content, is password protected (encrypted), has macros, is inline image (e.g., image is an inline email image), has external document references, etc.
EmailAddressType Email address type.
EmailBodyType Email body format type.
PropertyType Supported document metadata property data types.
TextSourceType Extracted document text method.