Converts the language tag (language ID and an optional region/country part) to the ICU format: lower case for the language,
upper case for the region, parts are delimited with an underscore.
don't use this. It only "understands" language + region,
but that is just a small part of what BCP 47 supports.
Also, the LocaleId does case normalization out of the box now.
don't use this. It only "understands" language + region,
but that is just a small part of what BCP 47 supports.
Also, the LocaleId does case normalization out of the box now.
NUM - Static variable in interface net.sf.okapi.steps.tokenization.engine.javacc.WordTokenizerConstants