| Package | Description |
|---|---|
| com.aliasi.lm |
Classes for character- and token-based language models.
|
| Modifier and Type | Class and Description |
|---|---|
class |
CompiledTokenizedLM
A
CompiledTokenizedLM implements a tokenized bounded
sequence language model. |
class |
TokenizedLM
A
TokenizedLM provides a dynamic sequence language
model which models token sequences with an n-gram model, and
whitespace and unknown tokens with their own sequence language
models. |
| Modifier and Type | Method and Description |
|---|---|
SortedSet<ScoredObject<String[]>> |
TokenizedLM.newTermSet(int nGram,
int minCount,
int maxReturned,
LanguageModel.Tokenized backgroundLM)
Returns a list of scored n-grams ordered by the significance
of the degree to which their counts in this model exceed their
expected counts in a specified background model.
|
SortedSet<ScoredObject<String[]>> |
TokenizedLM.oldTermSet(int nGram,
int minCount,
int maxReturned,
LanguageModel.Tokenized backgroundLM)
Returns a list of scored n-grams ordered in reverse order
of significance with respect to the background model.
|
Copyright © 2016 Alias-i, Inc.. All rights reserved.