| Package | Description |
|---|---|
| com.aliasi.chunk |
Classes for extracting meaningful chunks (spans) of text.
|
| com.aliasi.crf |
Classes and interfaces for conditional random fields.
|
| Modifier and Type | Class and Description |
|---|---|
class |
BioTagChunkCodec
The
BioTagChunkCodec implements a chunk to tag
coder/decoder based on the BIO encoding scheme and a
specified tokenizer factory. |
class |
IoTagChunkCodec
The
IoTagChunkCodec implements a chunk to tag
coder/decoder based on the IO encoding scheme and a
specified tokenizer factory. |
| Modifier and Type | Method and Description |
|---|---|
static ObjectHandler<StringTagging> |
TagChunkCodecAdapters.chunkingToStringTagging(TagChunkCodec codec,
ObjectHandler<Chunking> handler)
Return the string tagging handler that converts string taggings
to chunkings.
|
static ObjectHandler<Tagging<String>> |
TagChunkCodecAdapters.chunkingToTagging(TagChunkCodec codec,
ObjectHandler<Chunking> handler)
Returns the tagging handler that converts taggings to chunkings
using the specified codec.
|
static ObjectHandler<Chunking> |
TagChunkCodecAdapters.stringTaggingToChunking(TagChunkCodec codec,
ObjectHandler<StringTagging> handler)
Return the chunking handler that converts chunkings to taggings
using the specified codec.
|
static ObjectHandler<Chunking> |
TagChunkCodecAdapters.taggingToChunking(TagChunkCodec codec,
ObjectHandler<Tagging<String>> handler)
Return the chunking handler that converts chunkings to simple
taggings using the specified codec.
|
| Modifier and Type | Method and Description |
|---|---|
TagChunkCodec |
ChainCrfChunker.codec()
Returns the tag/chunk coder/decoder for this chunker.
|
| Modifier and Type | Method and Description |
|---|---|
static ChainCrfChunker |
ChainCrfChunker.estimate(Corpus<ObjectHandler<Chunking>> chunkingCorpus,
TagChunkCodec codec,
TokenizerFactory tokenizerFactory,
ChainCrfFeatureExtractor<String> featureExtractor,
boolean addInterceptFeature,
int minFeatureCount,
boolean cacheFeatureVectors,
RegressionPrior prior,
int priorBlockSize,
AnnealingSchedule annealingSchedule,
double minImprovement,
int minEpochs,
int maxEpochs,
Reporter reporter)
Return the chain CRF-based chunker estimated from the specified
corpus, which is converted to a tagging corpus using the
specified coder/decoder and tokenizer factory, then passed to
the chain CRF estimate method along with the rest of the
arguments.
|
| Constructor and Description |
|---|
ChainCrfChunker(ChainCrf<String> crf,
TokenizerFactory tokenizerFactory,
TagChunkCodec codec)
Construct a chunker based on the specified chain conditional
random field, tokenizer factory and tag-chunk coder/decoder.
|
Copyright © 2019 Alias-i, Inc.. All rights reserved.