| Package | Description |
|---|---|
| com.aliasi.chunk |
Classes for extracting meaningful chunks (spans) of text.
|
| com.aliasi.classify |
Classes for classifying data and evaluation.
|
| com.aliasi.hmm |
Classes for estimating and decoding hidden Markov models.
|
| com.aliasi.lm |
Classes for character- and token-based language models.
|
| com.aliasi.spell |
Classes for spelling correction and edit distance.
|
| com.aliasi.test.unit.lm |
| Class and Description |
|---|
| LanguageModel.Process
A
LanguageModel.Process is normalized by length. |
| LanguageModel.Sequence
A
LanguageModel.Sequence is normalized over all
character sequences. |
| Class and Description |
|---|
| LanguageModel
A
LanguageModel provides an estimate of the probability of a
sequence of characters. |
| LanguageModel.Dynamic
A
LanguageModel.Dynamic accepts training events in
the form of character slices or sequences. |
| NGramBoundaryLM
An
NGramBoundaryLM provides a dynamic sequence
language model for which training, estimation and pruning may be
interleaved. |
| NGramProcessLM
An
NGramProcessLM provides a dynamic conditional
process language model process for which training, estimation, and
pruning may be interleaved. |
| TokenizedLM
A
TokenizedLM provides a dynamic sequence language
model which models token sequences with an n-gram model, and
whitespace and unknown tokens with their own sequence language
models. |
| Class and Description |
|---|
| NGramBoundaryLM
An
NGramBoundaryLM provides a dynamic sequence
language model for which training, estimation and pruning may be
interleaved. |
| Class and Description |
|---|
| CharSeqCounter
A
CharSeqCounter counter provides counts for sequences
of characters. |
| IntSeqCounter
An
IntSeqCounter provides counts for sequences of
integers. |
| LanguageModel
A
LanguageModel provides an estimate of the probability of a
sequence of characters. |
| LanguageModel.Conditional
A
LanguageModel.Conditional is a language model
that implements conditional estimates of characters given
previous characters. |
| LanguageModel.Dynamic
A
LanguageModel.Dynamic accepts training events in
the form of character slices or sequences. |
| LanguageModel.Process
A
LanguageModel.Process is normalized by length. |
| LanguageModel.Sequence
A
LanguageModel.Sequence is normalized over all
character sequences. |
| LanguageModel.Tokenized
A
LanguageModel.Tokenized provides a means of
estimating the probability of a sequence of tokens. |
| NGramBoundaryLM
An
NGramBoundaryLM provides a dynamic sequence
language model for which training, estimation and pruning may be
interleaved. |
| NGramProcessLM
An
NGramProcessLM provides a dynamic conditional
process language model process for which training, estimation, and
pruning may be interleaved. |
| TrieCharSeqCounter
A
TrieCharSeqCounter stores counts for substrings of
strings. |
| TrieIntSeqCounter
An
TrieIntSeqCounter implements an integer sequence
counter with a trie structure of counts. |
| TrieReader
The
TrieReader interface provides a means to read a
trie structure with counts. |
| TrieWriter
The
TrieWriter interface provides a means
to write an arbitrary trie structure with positive node counts. |
| UniformBoundaryLM
A
UniformBoundaryLM implements a uniform sequence
language model with a specified number of outcomes and the same
probability assigned to the end-of-stream marker. |
| Class and Description |
|---|
| CompiledNGramProcessLM
A
CompiledNGramProcessLM implements a conditional
process language model. |
| NGramProcessLM
An
NGramProcessLM provides a dynamic conditional
process language model process for which training, estimation, and
pruning may be interleaved. |
| Class and Description |
|---|
| CompiledNGramBoundaryLM
A
CompiledNGramBoundaryLM is constructed by reading
the serialized form of an instance of NGramBoundaryLM. |
| LanguageModel.Conditional
A
LanguageModel.Conditional is a language model
that implements conditional estimates of characters given
previous characters. |
| LanguageModel.Dynamic
A
LanguageModel.Dynamic accepts training events in
the form of character slices or sequences. |
| NGramBoundaryLM
An
NGramBoundaryLM provides a dynamic sequence
language model for which training, estimation and pruning may be
interleaved. |
| NGramProcessLM
An
NGramProcessLM provides a dynamic conditional
process language model process for which training, estimation, and
pruning may be interleaved. |
| TokenizedLM
A
TokenizedLM provides a dynamic sequence language
model which models token sequences with an n-gram model, and
whitespace and unknown tokens with their own sequence language
models. |
| TrieCharSeqCounter
A
TrieCharSeqCounter stores counts for substrings of
strings. |
| TrieReader
The
TrieReader interface provides a means to read a
trie structure with counts. |
Copyright © 2016 Alias-i, Inc.. All rights reserved.