| Package | Description |
|---|---|
| com.aliasi.classify |
Classes for classifying data and evaluation.
|
| com.aliasi.lm |
Classes for character- and token-based language models.
|
| Modifier and Type | Class and Description |
|---|---|
class |
LMClassifier<L extends LanguageModel,M extends MultivariateDistribution>
An
LMClassifier performs joint probability-based
classification of character sequences into non-overlapping
categories based on language models for each category and a
multivariate distribution over categories. |
| Constructor and Description |
|---|
LMClassifier(String[] categories,
L[] languageModels,
M categoryDistribution)
Construct a joint classifier for character sequences
classifying over a specified set of categories, with a
multivariate distribution over those categories and a language
model per category.
|
| Modifier and Type | Interface and Description |
|---|---|
static interface |
LanguageModel.Conditional
A
LanguageModel.Conditional is a language model
that implements conditional estimates of characters given
previous characters. |
static interface |
LanguageModel.Dynamic
A
LanguageModel.Dynamic accepts training events in
the form of character slices or sequences. |
static interface |
LanguageModel.Process
A
LanguageModel.Process is normalized by length. |
static interface |
LanguageModel.Sequence
A
LanguageModel.Sequence is normalized over all
character sequences. |
static interface |
LanguageModel.Tokenized
A
LanguageModel.Tokenized provides a means of
estimating the probability of a sequence of tokens. |
| Modifier and Type | Class and Description |
|---|---|
class |
CompiledNGramBoundaryLM
A
CompiledNGramBoundaryLM is constructed by reading
the serialized form of an instance of NGramBoundaryLM. |
class |
CompiledNGramProcessLM
A
CompiledNGramProcessLM implements a conditional
process language model. |
class |
CompiledTokenizedLM
A
CompiledTokenizedLM implements a tokenized bounded
sequence language model. |
class |
NGramBoundaryLM
An
NGramBoundaryLM provides a dynamic sequence
language model for which training, estimation and pruning may be
interleaved. |
class |
NGramProcessLM
An
NGramProcessLM provides a dynamic conditional
process language model process for which training, estimation, and
pruning may be interleaved. |
class |
TokenizedLM
A
TokenizedLM provides a dynamic sequence language
model which models token sequences with an n-gram model, and
whitespace and unknown tokens with their own sequence language
models. |
class |
UniformBoundaryLM
A
UniformBoundaryLM implements a uniform sequence
language model with a specified number of outcomes and the same
probability assigned to the end-of-stream marker. |
class |
UniformProcessLM
A
UniformLM.Sequence implements a uniform sequence
language model with a specified number of outcomes and the same
probability assigned to the end-of-stream marker. |
Copyright © 2016 Alias-i, Inc.. All rights reserved.