Click or drag to resize

ContentExtractorFactory Class

Document content extractor factory API.
Inheritance Hierarchy
SystemObject
  OpenDiscoverSDKContentExtractorFactory

Namespace: OpenDiscoverSDK
Assembly: OpenDiscoverSDK (in OpenDiscoverSDK.dll) Version: 2025.4.4.0 (2025.4.4)
Syntax
C#
public static class ContentExtractorFactory

The ContentExtractorFactory type exposes the following members.

Properties
 NameDescription
Public propertyStatic memberCustomEntityDefinitionsLoaded Returns true if static method LoadCustomEntityDefinitions(ListCustomEntityDefinition) has previously been successfully called by the application.
Public propertyStatic memberNumCustomEntityDefinitionsLoaded The count of CustomEntityDefinition items previously loaded via static method LoadCustomEntityDefinitions(ListCustomEntityDefinition).
Top
Methods
 NameDescription
Public methodStatic memberClearCustomEntityDefinitions Clears CustomEntityDefinitions previously loaded by a previous call to method LoadCustomEntityDefinitions(ListCustomEntityDefinition). See remarks on thread safety.
Public methodStatic memberGetAllSplitArchiveFilenames Given the file path to a main split archive or media image file part, returns all split segment file names contained in the directory of main part in proper order.
Public methodStatic memberCode exampleGetContentExtractor Returns a content extractor result for the given document using its document file format identification result (see IdResult).
Public methodStatic memberJoinMimePartialMessages Joins MIME partial-messages (see MimeEmailPartial) into a single MIME message that can then be content extracted as a single whole message.
Public methodStatic memberLoadCustomEntityDefinitions Loads user defined custom item definitions that can be detected in extracted text and metadata, see CustomEntityExtractionEnabled and CustomEntityDefinition. It is important to see remarks, as custom item defintions can only be loaded once.
Public methodStatic memberParseCustomEntityDefinitions Parses a string containing custom entity definitions into a list of CustomEntityDefinition. This list can them be passed to method LoadCustomEntityDefinitions(ListCustomEntityDefinition) as the argument.
Public methodStatic memberValidateCustomEntityDefinitions Validates user defined custom item definitions before attempting to load them into SDK (see LoadCustomEntityDefinitions(ListCustomEntityDefinition).
Top
Remarks
The content extractor factory returns specific interfaces that can be used to extract content from documents such as text, metadata, and attachments and to also extract items from archive and mail store containers.
See Also