Package opennlp.tools.parser.treeinsert
Class Parser
java.lang.Object
opennlp.tools.parser.AbstractBottomUpParser
opennlp.tools.parser.treeinsert.Parser
- All Implemented Interfaces:
Parser
A built-attach
Parser implementation.
Nodes are built when their left-most child is encountered. Subsequent children are attached as daughters. Attachment is based on node in the right-frontier of the tree. After each attachment or building, nodes are assessed as either complete or incomplete. Complete nodes are no longer eligible for daughter attachment.
Complex modifiers which produce additional node levels of the same type are attached with sister-adjunction. Attachment can not take place higher in the right-frontier than an incomplete node.
- See Also:
-
Field Summary
FieldsModifier and TypeFieldDescriptionstatic final StringOutcome used when a node should be attached as a daughter to another node.static final StringOutcome used when a node should be attached as a sister to another node.static final StringLabel used to distinguish build nodes from non-built nodes.static final StringOutcome used when a constituent needs an no additional parent node/building.static final StringOutcome used when a node should not be attached to another node.Fields inherited from class opennlp.tools.parser.AbstractBottomUpParser
COMPLETE, CONT, defaultAdvancePercentage, defaultBeamSize, INC_NODE, INCOMPLETE, OTHER, START, TOK_NODE, TOP_NODE -
Constructor Summary
ConstructorsConstructorDescriptionParser(ParserModel model) Instantiates aParservia a givenmodel.Parser(ParserModel model, int beamSize, double advancePercentage) Instantiates aParservia a givenmodeland other configuration parameters. -
Method Summary
Modifier and TypeMethodDescriptiongetRightFrontier(Parse root, Set<String> punctSet) Returns the right frontier of the specifiedtreewith nodes ordered from deepest to shallowest.static ParserModeltrain(String languageCode, ObjectStream<Parse> parseSamples, HeadRules rules, int iterations, int cutoff) Starts a training of aParserModel.static ParserModeltrain(String languageCode, ObjectStream<Parse> parseSamples, HeadRules rules, TrainingParameters mlParams) Starts a training of aParserModel.Methods inherited from class opennlp.tools.parser.AbstractBottomUpParser
buildDictionary, buildDictionary, collapsePunctuation, parse, parse, setErrorReporting, setParents
-
Field Details
-
DONE
Outcome used when a constituent needs an no additional parent node/building.- See Also:
-
ATTACH_SISTER
Outcome used when a node should be attached as a sister to another node.- See Also:
-
ATTACH_DAUGHTER
Outcome used when a node should be attached as a daughter to another node.- See Also:
-
NON_ATTACH
Outcome used when a node should not be attached to another node.- See Also:
-
BUILT
Label used to distinguish build nodes from non-built nodes.- See Also:
-
-
Constructor Details
-
Parser
Instantiates aParservia a givenmodeland other configuration parameters. Uses the default implementations ofPOSTaggerMEandChunkerME.- Parameters:
model- TheParserModelto use.beamSize- The number of different parses kept during parsing.advancePercentage- The minimal amount of probability mass which advanced outcomes must represent. Only outcomes which contribute to the topadvancePercentagewill be explored.- Throws:
IllegalStateException- Thrown if theParserTypeis not supported.- See Also:
-
Parser
Instantiates aParservia a givenmodel. Uses the default implementations ofPOSTaggerMEandChunkerMEand default values forbeamSizeandadvancePercentage.- Parameters:
model- TheParserModelto use.- Throws:
IllegalStateException- Thrown if theParserTypeis not supported.- See Also:
-
-
Method Details
-
getRightFrontier
Returns the right frontier of the specifiedtreewith nodes ordered from deepest to shallowest.- Parameters:
root- Therootof the parse tree.punctSet- A set of punctuation symbols to be used.- Returns:
- The right frontier of the specified parse tree.
-
train
public static ParserModel train(String languageCode, ObjectStream<Parse> parseSamples, HeadRules rules, TrainingParameters mlParams) throws IOException Starts a training of aParserModel.- Parameters:
languageCode- An ISO conform language code.parseSamples- Thesamplesas input.rules- TheHeadRulesto use.mlParams- Theparametersfor training.- Returns:
- A valid
ParserModel. - Throws:
IOException- Thrown if IO errors occurred during training.
-
train
public static ParserModel train(String languageCode, ObjectStream<Parse> parseSamples, HeadRules rules, int iterations, int cutoff) throws IOException Starts a training of aParserModel.- Parameters:
languageCode- An ISO conform language code.parseSamples- Thesamplesas input.rules- TheHeadRulesto use.iterations- The number of iterations to be conducted.cutoff- The cut-off parameter to be used.- Returns:
- A valid
ParserModel. - Throws:
IOException- Thrown if IO errors occurred during training.
-