public abstract class AbstractConceptExtractor extends Object implements ConceptExtractor
Abstract class for concept extraction which is the super class of AbstractPhraseExtractor, AbstractTermExtractor, AbstractTokenExtrator, and AbstractTripleExtractor
Copyright: Copyright (c) 2005
Company: IST, Drexel University
| Modifier and Type | Field and Description |
|---|---|
protected ConceptFilter |
cf |
protected boolean |
conceptFilter_enabled |
protected ArrayList |
conceptList |
protected DocumentParser |
parser |
protected boolean |
subconcept_enabled |
| Constructor and Description |
|---|
AbstractConceptExtractor() |
| Modifier and Type | Method and Description |
|---|---|
ArrayList |
extractFromDoc(Document doc)
Extracts concepts from a parsed document
|
ArrayList |
extractFromDoc(String doc)
Extracts concepts from a raw document
|
ConceptFilter |
getConceptFilter()
Gets the concept filter used for this extractor.
|
ArrayList |
getConceptList() |
DocumentParser |
getDocumentParser()
Gets document parser.
|
boolean |
getFilteringOption()
Tests if the extractor applies concept filtering.
|
boolean |
getSubConceptOption() |
SortedArray |
mergeConceptByEntryID(ArrayList termList)
The concepts with identical entry id will be merged.
|
SortedArray |
mergeConceptByName(ArrayList termList)
The concepts with identical names will be merged.
|
void |
print(PrintWriter out)
Print out the extract concepts to the speficid print writer.
|
void |
print(PrintWriter out,
ArrayList conceptList)
Print out the given list of concepts to the speficid print writer.
|
void |
setConceptFilter(ConceptFilter cf)
Sets the concept filter for the concept extatractor
|
void |
setDocumentParser(DocumentParser parser)
Sets the document parser for the concept extractor.
|
void |
setFilteringOption(boolean option)
Sets the option of concept filtering
|
void |
setSubConceptOption(boolean option) |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitextractFromSentence, getLemmatiser, initDocExtraction, setLemmatiser, supportConceptEntry, supportConceptNameprotected ArrayList conceptList
protected boolean conceptFilter_enabled
protected boolean subconcept_enabled
protected ConceptFilter cf
protected DocumentParser parser
public void setSubConceptOption(boolean option)
setSubConceptOption in interface ConceptExtractorpublic boolean getSubConceptOption()
getSubConceptOption in interface ConceptExtractorpublic boolean getFilteringOption()
ConceptExtractorgetFilteringOption in interface ConceptExtractorpublic void setFilteringOption(boolean option)
ConceptExtractorsetFilteringOption in interface ConceptExtractoroption - the option of concept filteringpublic void setConceptFilter(ConceptFilter cf)
ConceptExtractorsetConceptFilter in interface ConceptExtractorcf - the concept filterpublic ConceptFilter getConceptFilter()
ConceptExtractorgetConceptFilter in interface ConceptExtractorpublic ArrayList getConceptList()
getConceptList in interface ConceptExtractorpublic void print(PrintWriter out)
ConceptExtractorprint in interface ConceptExtractorout - the print writerpublic void print(PrintWriter out, ArrayList conceptList)
ConceptExtractorprint in interface ConceptExtractorout - the print writerconceptList - a list concepts for outputpublic SortedArray mergeConceptByEntryID(ArrayList termList)
ConceptExtractormergeConceptByEntryID in interface ConceptExtractortermList - a list of conceptspublic SortedArray mergeConceptByName(ArrayList termList)
ConceptExtractormergeConceptByName in interface ConceptExtractortermList - a list of conceptspublic ArrayList extractFromDoc(String doc)
ConceptExtractorextractFromDoc in interface ConceptExtractordoc - the content of the documentpublic ArrayList extractFromDoc(Document doc)
ConceptExtractorextractFromDoc in interface ConceptExtractordoc - a parsed documentpublic DocumentParser getDocumentParser()
ConceptExtractorgetDocumentParser in interface ConceptExtractorpublic void setDocumentParser(DocumentParser parser)
ConceptExtractorsetDocumentParser in interface ConceptExtractorparser - document parserCopyright © 2018 JULIE Lab, Germany. All rights reserved.