| Package | Description |
|---|---|
| com.aliasi.chunk |
Classes for extracting meaningful chunks (spans) of text.
|
| com.aliasi.classify |
Classes for classifying data and evaluation.
|
| com.aliasi.corpus |
Classes for parsing and handling various corpora.
|
| com.aliasi.hmm |
Classes for estimating and decoding hidden Markov models.
|
| com.aliasi.lm |
Classes for character- and token-based language models.
|
| com.aliasi.sentences |
Classes for sentence-boundary detection.
|
| com.aliasi.spell |
Classes for spelling correction and edit distance.
|
| com.aliasi.tag |
Classes and interfaces for sequence tagging, including evaluators.
|
| Modifier and Type | Class and Description |
|---|---|
class |
CharLmHmmChunker
A
CharLmHmmChunker employs a hidden Markov model
estimator and tokenizer factory to learn a chunker. |
class |
CharLmRescoringChunker
A
CharLmRescoringChunker provides a long-distance
character language model-based chunker that operates by rescoring
the output of a contained character language model HMM chunker. |
class |
ChunkerEvaluator
The
ChunkerEvaulator class provides an evaluation
framework for chunkers. |
class |
TrainTokenShapeChunker
A
TrainTokenShapeChunker is used to train a token and
shape-based chunker. |
| Modifier and Type | Class and Description |
|---|---|
class |
BaseClassifierEvaluator<E>
A
BaseClassifierEvaluator provides an evaluation harness
for first-best classifiers. |
class |
BernoulliClassifier<E>
A
BernoulliClassifier provides a feature-based
classifier where feature values are reduced to booleans based on a
specified threshold. |
class |
BinaryLMClassifier
A
BinaryLMClassifier is a boolean dynamic language
model classifier for use when there are two categories, but
training data is only available for one of the categories. |
class |
ConditionalClassifierEvaluator<E>
A
ConditionalClassifierEvaluator provides an evaluation
harness for conditional probability-based n-best classifiers. |
class |
DynamicLMClassifier<L extends LanguageModel.Dynamic>
A
DynamicLMClassifier is a language model classifier
that accepts training events of categorized character sequences. |
class |
JointClassifierEvaluator<E>
A
JointClassifierEvaluator provides an evaluation harness
for joint probability-based n-best classifiers. |
class |
KnnClassifier<E>
A
KnnClassifier implements k-nearest-neighor
classification based on feature extraction and a vector proximity
or distance. |
class |
NaiveBayesClassifier
A
NaiveBayesClassifier provides a trainable naive Bayes
text classifier, with tokens as features. |
class |
RankedClassifierEvaluator<E>
A
RankedClassifierEvaluator provides an evaluation harness for
ranked classifiers. |
class |
ScoredClassifierEvaluator<E>
A
ScoredClassifierEvaluator provides an evaluation harness for
score-based classifiers. |
class |
TfIdfClassifierTrainer<E>
A
TfIdfClassifierTrainer provides a framework for
training discriminative classifiers based on term-frequency (TF)
and inverse document frequency (IDF) weighting of features. |
class |
TradNaiveBayesClassifier
A
TradNaiveBayesClassifier implements a traditional
token-based approach to naive Bayes text classification. |
| Modifier and Type | Class and Description |
|---|---|
class |
Corpus<H extends Handler>
The
Corpus abstract class provides a basis for passing
training and testing data to data handlers. |
class |
DiskCorpus<H extends Handler>
A
DiskCorpus reads data from a specified training and
test directory using a specified parser. |
class |
InputSourceParser<H extends Handler>
An
InputSourceParser is an abstract parser based
on an abstract method for parsing from an input source. |
class |
Parser<H extends Handler>
The
Parser abstract class provides methods for parsing
content from an input source or character sequence and passing
extracted events to a content handler. |
class |
StringParser<H extends Handler>
A
StringParser is an abstract parser based on an
abstract method for parsing from a character slice. |
class |
XMLParser<H extends Handler>
An
XMLParser adapts a handler to be used to handle
text extracted from an XML source. |
| Modifier and Type | Interface and Description |
|---|---|
interface |
ObjectHandler<E>
The
ObjectHandler interface specifies a handler
with a single method that takes a single argument of the
type of the generic paramter. |
| Modifier and Type | Class and Description |
|---|---|
class |
XValidatingObjectCorpus<E>
An
XValidatingObjectCorpus holds a list of items
which it uses to provide training and testing items using
cross-validation. |
| Modifier and Type | Class and Description |
|---|---|
class |
AbstractHmmEstimator
An
HmmEstimator may be used to train a hidden Markov
model (HMM). |
class |
HmmCharLmEstimator
An
HmmCharLmEstimator employs a maximum a posteriori
transition estimator and a bounded character language model
emission estimator. |
| Modifier and Type | Interface and Description |
|---|---|
static interface |
LanguageModel.Dynamic
A
LanguageModel.Dynamic accepts training events in
the form of character slices or sequences. |
| Modifier and Type | Class and Description |
|---|---|
class |
NGramBoundaryLM
An
NGramBoundaryLM provides a dynamic sequence
language model for which training, estimation and pruning may be
interleaved. |
class |
NGramProcessLM
An
NGramProcessLM provides a dynamic conditional
process language model process for which training, estimation, and
pruning may be interleaved. |
class |
TokenizedLM
A
TokenizedLM provides a dynamic sequence language
model which models token sequences with an n-gram model, and
whitespace and unknown tokens with their own sequence language
models. |
class |
UniformBoundaryLM
A
UniformBoundaryLM implements a uniform sequence
language model with a specified number of outcomes and the same
probability assigned to the end-of-stream marker. |
class |
UniformProcessLM
A
UniformLM.Sequence implements a uniform sequence
language model with a specified number of outcomes and the same
probability assigned to the end-of-stream marker. |
| Modifier and Type | Class and Description |
|---|---|
class |
SentenceEvaluator
A
SentenceEvaluator handles reference chunkings by
constructing a response chunking and adding them to a sentence
evaluation. |
| Modifier and Type | Class and Description |
|---|---|
class |
TfIdfDistance
The
TfIdfDistance class provides a string distance
based on term frequency (TF) and inverse document frequency (IDF). |
class |
TrainSpellChecker
A
TrainSpellChecker instance provides a mechanism for
collecting training data for a compiled spell checker. |
| Modifier and Type | Class and Description |
|---|---|
class |
MarginalTaggerEvaluator<E>
A
MarginalTaggerEvaluator evaluates marginal taggers either
directly or by adding their outputs. |
class |
NBestTaggerEvaluator<E>
An
NBestTaggerEvaluator provides an evaluation
framework for n-best taggers. |
class |
TaggerEvaluator<E>
A
TaggerEvaluator provides evaluation for
first-best taggers implementing the Tagger interface. |
Copyright © 2019 Alias-i, Inc.. All rights reserved.