LanguageIdSettings Class

Extracted text language identification settings.

Definition

Namespace: OpenDiscoverSDK.Interfaces.Settings
Assembly: OpenDiscoverSDK.Interfaces (in OpenDiscoverSDK.Interfaces.dll) Version: 2026.2.6.0 (2026.02.06)
C#
[DataContractAttribute]
public class LanguageIdSettings
Inheritance
Object    LanguageIdSettings

Remarks

If ExtractionType is set to MetadataOnly then the properties of this class are ignored because no text is extracted in metadata only mode.

Constructors

LanguageIdSettings Constructor.

Properties

IdentifyLanguages Determines if languages present in extract text are to be identified by content extractors.
LatinScriptRegionPartitionSize Used by language identification algorithm, see IdentifyLanguages, to partition detected Latin script regions into smaller character ranges of this size.
MaxLanguageIdWords Maximum number of words (alpha only tokens) in extracted text to use for language identification. Note: common language Unicode scripts like numbers and punctuation are not included, only all alpha "words" are used for language detection.
PartitionLatinScriptRegions Determines if Latin script regions detected during the language identification process (see IdentifyLanguages) are partitioned into smaller regions of LatinScriptRegionPartitionSize number of words.

Methods

EqualsDetermines whether the specified object is equal to the current object.
(Inherited from Object)
GetHashCodeServes as the default hash function.
(Inherited from Object)
GetTypeGets the Type of the current instance.
(Inherited from Object)
ToStringReturns a string that represents the current object.
(Inherited from Object)

See Also