Class HunmorphAnnotationTokenizer
java.lang.Object
com.github.szgabsz91.morpher.languagehandlers.hunmorph.impl.HunmorphAnnotationTokenizer
Tokenizes Hunmorph-Ocamorph token strings.
-
Field Summary
FieldsModifier and TypeFieldDescriptionThe list of known tokens. -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptioncom.github.szgabsz91.morpher.languagehandlers.api.model.AnnotationTokenizerResultpreprocess(com.github.szgabsz91.morpher.languagehandlers.api.model.AnnotationTokenizerResult originalResult) Preprocesses the given result and returns the processed version.com.github.szgabsz91.morpher.languagehandlers.api.model.AnnotationTokenizerResultTokenizes the given expression and grammatical form and returns the result.
-
Field Details
-
KNOWN_TOKENS
The list of known tokens.
-
-
Constructor Details
-
HunmorphAnnotationTokenizer
public HunmorphAnnotationTokenizer()
-
-
Method Details
-
tokenize
public com.github.szgabsz91.morpher.languagehandlers.api.model.AnnotationTokenizerResult tokenize(String expression, String grammaticalForm, int frequency) Tokenizes the given expression and grammatical form and returns the result.- Parameters:
expression- the expression containing the tokensgrammaticalForm- the grammatical formfrequency- the frequency- Returns:
- the
AnnotationTokenizerResultinstance
-
getSupportedAffixTypes
-
preprocess
public com.github.szgabsz91.morpher.languagehandlers.api.model.AnnotationTokenizerResult preprocess(com.github.szgabsz91.morpher.languagehandlers.api.model.AnnotationTokenizerResult originalResult) Preprocesses the given result and returns the processed version.- Parameters:
originalResult- the original result- Returns:
- the processed result
-