Language Id SettingsMax Language Id Words Property
Definition
Assembly: OpenDiscoverSDK.Interfaces (in OpenDiscoverSDK.Interfaces.dll) Version: 2026.2.6.0 (2026.02.06)
[DataMemberAttribute]
public int MaxLanguageIdWords { get; set; }Property Value
Int32Remarks
This property is ignored if IdentifyLanguages is false.
Only the first 'MaxLanguageIdWords' words are used in language identification if extracted text length exceeds this length.
Take into consideration that if processing documents with 100's of thousands of words that there is a performance penalty for identifying the languages present in all of this text.
Default property value: 10,000 [words] - up to the first 10,000 extracted words are used for language identification. For English, 10K words is approximately 45,000 alpha characters if the average word token size is 4.5 characters. Minimum allowed property value: 500 [words] Maximum allowed property value: 100,000 [words].