public class Tokenizer extends Object
| Modifier and Type | Field and Description |
|---|---|
protected static TokenizationStep |
ts |
| Constructor and Description |
|---|
Tokenizer() |
| Modifier and Type | Method and Description |
|---|---|
static Tokens |
tokenize(ITextUnit textUnit,
LocaleId language,
String... tokenNames) |
static Tokens |
tokenize(String string,
LocaleId language,
String... tokenNames) |
static Tokens |
tokenize(TextContainer textContainer,
LocaleId language,
String... tokenNames) |
static Tokens |
tokenize(TextFragment textFragment,
LocaleId language,
String... tokenNames) |
protected static Tokens |
tokenizeString(String text,
LocaleId language,
String... tokenNames)
Extracts tokens from the given text.
|
protected static TokenizationStep ts
protected static Tokens tokenizeString(String text, LocaleId language, String... tokenNames)
text - Text to tokenize.language - Language of the text.tokenNames - Optional list of token names. If omitted, all tokens will be extracted.public static Tokens tokenize(TextContainer textContainer, LocaleId language, String... tokenNames)
public static Tokens tokenize(TextFragment textFragment, LocaleId language, String... tokenNames)
Copyright © 2018. All rights reserved.