public abstract class AbstractDualConceptExtractor extends Object implements DualConceptExtractor
Dual concept extractor for dual indexing
Copyright: Copyright (c) 2005
Company: IST, Drexel University
| Modifier and Type | Field and Description |
|---|---|
protected ArrayList |
firstConceptList |
protected DocumentParser |
parser |
protected ArrayList |
secondConceptList |
| Constructor and Description |
|---|
AbstractDualConceptExtractor() |
| Modifier and Type | Method and Description |
|---|---|
boolean |
extractFromDoc(Article article)
Extracts concepts from a semi-structured article
|
boolean |
extractFromDoc(String doc)
Extracts concepts from a raw document
|
protected String |
getArticleContent(Article article) |
DocumentParser |
getDocumentParser()
Gets document parser.
|
ArrayList |
getFirstConceptList()
Gets the first set of concepts extracted before
|
ArrayList |
getSecondConceptList()
Gets the second set of concepts extracted before
|
void |
initDocExtraction()
It is required to call this method before one calls extractFromDoc method.
|
boolean |
isExtractionMerged() |
void |
setDocumentParser(DocumentParser parser)
Sets the document parser for the concept extractor.
|
boolean |
supportConceptEntry()
Tests if the extracted concept has an entry ID.
|
boolean |
supportConceptName()
Tests if the extracted concept has a name.
|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitextractFromDoc, extractFromSentenceprotected ArrayList firstConceptList
protected ArrayList secondConceptList
protected DocumentParser parser
public boolean extractFromDoc(Article article)
DualConceptExtractorextractFromDoc in interface DualConceptExtractorarticle - a semi-structured article for extractionpublic boolean extractFromDoc(String doc)
DualConceptExtractorextractFromDoc in interface DualConceptExtractordoc - a raw document for extractionpublic ArrayList getFirstConceptList()
DualConceptExtractorgetFirstConceptList in interface DualConceptExtractorpublic ArrayList getSecondConceptList()
DualConceptExtractorgetSecondConceptList in interface DualConceptExtractorpublic boolean isExtractionMerged()
public boolean supportConceptName()
DualConceptExtractorsupportConceptName in interface DualConceptExtractorpublic boolean supportConceptEntry()
DualConceptExtractorsupportConceptEntry in interface DualConceptExtractorpublic void initDocExtraction()
DualConceptExtractorinitDocExtraction in interface DualConceptExtractorpublic DocumentParser getDocumentParser()
DualConceptExtractorgetDocumentParser in interface DualConceptExtractorpublic void setDocumentParser(DocumentParser parser)
DualConceptExtractorsetDocumentParser in interface DualConceptExtractorparser - document parserCopyright © 2018 JULIE Lab, Germany. All rights reserved.