| Package | Description |
|---|---|
| edu.umass.cs.mallet.base.extract |
| Modifier and Type | Class and Description |
|---|---|
class |
StringTokenization |
| Modifier and Type | Method and Description |
|---|---|
Tokenization |
DocumentExtraction.getInput() |
| Modifier and Type | Method and Description |
|---|---|
LabeledSpans |
TokenizationFilter.constructLabeledSpans(LabelAlphabet dict,
Object document,
Label backgroundTag,
Tokenization input,
Sequence seq)
Converts a the sequence of labels into a set of labeled spans.
|
LabeledSpans |
HierarchicalTokenizationFilter.constructLabeledSpans(LabelAlphabet dict,
Object document,
Label backgroundTag,
Tokenization input,
Sequence seq) |
LabeledSpans |
DefaultTokenizationFilter.constructLabeledSpans(LabelAlphabet dict,
Object document,
Label backgroundTag,
Tokenization input,
Sequence seq) |
LabeledSpans |
ConfidenceTokenizationFilter.constructLabeledSpans(LabelAlphabet dict,
Object document,
Label backgroundTag,
Tokenization input,
Sequence seq) |
LabeledSpans |
BIOTokenizationFilter.constructLabeledSpans(LabelAlphabet dict,
Object document,
Label backgroundTag,
Tokenization input,
Sequence seq) |
protected Span |
BIOTokenizationFilter.createSpan(Tokenization input,
int startTokenIdx,
int endTokenIdx) |
Extraction |
Extractor.extract(Tokenization toks)
Performs extraction from an object that has been
already been tokenized.
|
Extraction |
CRFExtractor.extract(Tokenization spans) |
| Constructor and Description |
|---|
DocumentExtraction(String name,
LabelAlphabet dict,
Tokenization input,
LabeledSpans predictedSpans,
LabeledSpans trueSpans,
String background) |
DocumentExtraction(String name,
LabelAlphabet dict,
Tokenization input,
Sequence predicted,
Sequence target,
String background) |
DocumentExtraction(String name,
LabelAlphabet dict,
Tokenization input,
Sequence predicted,
Sequence target,
String background,
TokenizationFilter filter) |
DocumentExtraction(String name,
LabelAlphabet dict,
Tokenization input,
Sequence predicted,
String background) |
Extraction(Extractor extractor,
LabelAlphabet dict,
String name,
Tokenization input,
Sequence output,
String background)
Creates an extration given a sequence output by some kind of per-sequece labeler, like an
HMM or a CRF.
|
Copyright © 2019 JULIE Lab, Germany. All rights reserved.