| Package | Description |
|---|---|
| com.aliasi.dict |
Classes for handling dictionaries.
|
| com.aliasi.spell |
Classes for spelling correction and edit distance.
|
| Modifier and Type | Field and Description |
|---|---|
static WeightedEditDistance |
ApproxDictionaryChunker.TT_DISTANCE
This is a weighted edit distance defined by Tsuruoka and Tsujii
for matching protein names in biomedical texts.
|
| Modifier and Type | Method and Description |
|---|---|
WeightedEditDistance |
ApproxDictionaryChunker.editDistance()
Returns the weighted edit distance for matching with
this chunker.
|
| Constructor and Description |
|---|
ApproxDictionaryChunker(TrieDictionary<String> dictionary,
TokenizerFactory tokenizerFactory,
WeightedEditDistance editDistance,
double distanceThreshold)
Construct an approximate dictionary chunker from the specified
dictionary, tokenizer factory, weighted edit distance and
distance bound.
|
| Modifier and Type | Class and Description |
|---|---|
class |
FixedWeightEditDistance
A
FixedWeightEditDistance sets constant weights for
the edit operations for weighted edit distance. |
| Modifier and Type | Field and Description |
|---|---|
static WeightedEditDistance |
CompiledSpellChecker.CASE_RESTORING
A weighted edit distance ordered by similarity that treats case
variants as zero cost and all other edits as infinite cost.
|
static WeightedEditDistance |
CompiledSpellChecker.TOKENIZING
A weighted edit distance ordered by similarity that allows free
space insertion.
|
| Modifier and Type | Method and Description |
|---|---|
WeightedEditDistance |
AutoCompleter.editDistance()
Returns the weighted edit distance for this auto-completer.
|
WeightedEditDistance |
CompiledSpellChecker.editDistance()
Returns the weighted edit distance for this compiled spell
checker.
|
WeightedEditDistance |
TrainSpellChecker.editDistance()
Returns the weighted edit distance (channel model) underlying this spell checker
trainer.
|
| Modifier and Type | Method and Description |
|---|---|
void |
CompiledSpellChecker.setEditDistance(WeightedEditDistance editDistance)
Sets the edit distance for this spell checker to the
specified value.
|
| Constructor and Description |
|---|
AutoCompleter(Map<String,? extends Number> phraseCounts,
WeightedEditDistance editDistance,
int maxResultsPerPrefix,
int maxSearchQueueSize,
double minScore)
Construct an automatic completer from the specified phrases,
phrase counts, edit distance, and search parameters.
|
CompiledSpellChecker(CompiledNGramProcessLM lm,
WeightedEditDistance editDistance,
Set<String> tokenSet)
Construct a compiled spell checker based on the specified
language model and edit distance, with a null tokenizer
factory, the specified set of valid output tokens, with default
value for n-best size, known token edit cost and first and
second character edit costs.
|
CompiledSpellChecker(CompiledNGramProcessLM lm,
WeightedEditDistance editDistance,
Set<String> tokenSet,
int nBestSize)
Construct a compiled spell checker based on the specified
language model and edit distance, a null tokenizer factory, the
set of valid output tokens, and maximum n-best size, with
default known token and first and second character edit costs.
|
CompiledSpellChecker(CompiledNGramProcessLM lm,
WeightedEditDistance editDistance,
TokenizerFactory factory,
Set<String> tokenSet,
int nBestSize)
Construct a compiled spell checker based on the specified
language model and edit distance, tokenizer factory, the
set of valid output tokens, and maximum n-best size, with
default known token and first and second character edit costs.
|
CompiledSpellChecker(CompiledNGramProcessLM lm,
WeightedEditDistance editDistance,
TokenizerFactory factory,
Set<String> tokenSet,
int nBestSize,
double knownTokenEditCost,
double firstCharEditCost,
double secondCharEditCost)
Construct a compiled spell checker based on the specified
language model and similarity edit distance, set of valid
output tokens, maximum n-best size per character, and the
specified edit penalities for editing known tokens or the first
or second characters of tokens.
|
TrainSpellChecker(NGramProcessLM lm,
WeightedEditDistance editDistance)
Construct a non-tokenizing spell checker trainer from the
specified language model and edit distance.
|
TrainSpellChecker(NGramProcessLM lm,
WeightedEditDistance editDistance,
TokenizerFactory tokenizerFactory)
Construct a spell checker trainer from the specified n-gram
process language model, tokenizer factory and edit distance.
|
Copyright © 2016 Alias-i, Inc.. All rights reserved.