Package opennlp.tools.langdetect
Class DefaultLanguageDetectorContextGenerator
java.lang.Object
opennlp.tools.langdetect.DefaultLanguageDetectorContextGenerator
- All Implemented Interfaces:
Serializable,LanguageDetectorContextGenerator
public class DefaultLanguageDetectorContextGenerator
extends Object
implements LanguageDetectorContextGenerator
A context generator for language detector.
- See Also:
-
Constructor Summary
ConstructorsConstructorDescriptionDefaultLanguageDetectorContextGenerator(int minLength, int maxLength, CharSequenceNormalizer... normalizers) Creates a customizableDefaultLanguageDetectorContextGeneratorthat computes ngrams from text. -
Method Summary
Modifier and TypeMethodDescription<T extends CharSequence>
T[]getContext(CharSequence document) Retrieves the contexts for adocumentusing character ngrams.
-
Constructor Details
-
DefaultLanguageDetectorContextGenerator
public DefaultLanguageDetectorContextGenerator(int minLength, int maxLength, CharSequenceNormalizer... normalizers) Creates a customizableDefaultLanguageDetectorContextGeneratorthat computes ngrams from text.- Parameters:
minLength- The min number of ngrams characters. Must be greater than0.maxLength- The max number of ngrams characters. Must be greater than0and must be greater thanminLength.normalizers- Zero or more normalizers to be applied in to the text before extracting ngrams.
-
-
Method Details
-
getContext
Description copied from interface:LanguageDetectorContextGeneratorRetrieves the contexts for adocumentusing character ngrams.- Specified by:
getContextin interfaceLanguageDetectorContextGenerator- Parameters:
document- The textual input used to extract context from.- Returns:
- An array of contexts on which a model basis its decisions.
-