Package de.julielab.jcore.reader.ign
Class IGNReader
- java.lang.Object
-
- org.apache.uima.resource.Resource_ImplBase
-
- org.apache.uima.resource.ConfigurableResource_ImplBase
-
- org.apache.uima.collection.CollectionReader_ImplBase
-
- de.julielab.jcore.reader.ign.IGNReader
-
- All Implemented Interfaces:
org.apache.uima.collection.base_cpm.BaseCollectionReader,org.apache.uima.collection.CollectionReader,org.apache.uima.resource.ConfigurableResource,org.apache.uima.resource.Resource
public class IGNReader extends org.apache.uima.collection.CollectionReader_ImplBaseThe IGNReader reads corpus files in BioC-format.
There are XML files comprising the actual text (as well as passage and sentence annotations) and there are separate XML files comprising the annotations.- Author:
- engelmann
-
-
Field Summary
Fields Modifier and Type Field Description static StringPARAM_INPUTDIR_ANNOString parameter indicating path to the directory containing files in BioC-format that comprise the annotations.static StringPARAM_INPUTDIR_TEXTString parameter indicating path to the directory containing files in BioC-format that comprise the actual text.static StringPUBLICATION_DATES_FILEoptional Parameter providing the file path to a file mapping the article ids to the corresponding publication years
-
Constructor Summary
Constructors Constructor Description IGNReader()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description voidclose()voidgetNext(org.apache.uima.cas.CAS aCas)org.apache.uima.util.Progress[]getProgress()booleanhasNext()voidinitialize()-
Methods inherited from class org.apache.uima.collection.CollectionReader_ImplBase
destroy, getCasInitializer, getProcessingResourceMetaData, initialize, isConsuming, reconfigure, setCasInitializer, typeSystemInit
-
Methods inherited from class org.apache.uima.resource.ConfigurableResource_ImplBase
getConfigParameterValue, getConfigParameterValue, setConfigParameterValue, setConfigParameterValue
-
Methods inherited from class org.apache.uima.resource.Resource_ImplBase
getCasManager, getLogger, getMetaData, getRelativePathResolver, getResourceManager, getUimaContext, getUimaContextAdmin, setLogger, setMetaData
-
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
-
-
-
-
Field Detail
-
PARAM_INPUTDIR_TEXT
public static final String PARAM_INPUTDIR_TEXT
String parameter indicating path to the directory containing files in BioC-format that comprise the actual text.- See Also:
- Constant Field Values
-
PARAM_INPUTDIR_ANNO
public static final String PARAM_INPUTDIR_ANNO
String parameter indicating path to the directory containing files in BioC-format that comprise the annotations.- See Also:
- Constant Field Values
-
PUBLICATION_DATES_FILE
public static final String PUBLICATION_DATES_FILE
optional Parameter providing the file path to a file mapping the article ids to the corresponding publication years- See Also:
- Constant Field Values
-
-
Method Detail
-
initialize
public void initialize() throws org.apache.uima.resource.ResourceInitializationException- Overrides:
initializein classorg.apache.uima.collection.CollectionReader_ImplBase- Throws:
org.apache.uima.resource.ResourceInitializationException
-
getNext
public void getNext(org.apache.uima.cas.CAS aCas) throws IOException, org.apache.uima.collection.CollectionException- Throws:
IOExceptionorg.apache.uima.collection.CollectionException
-
hasNext
public boolean hasNext() throws IOException, org.apache.uima.collection.CollectionException- Throws:
IOExceptionorg.apache.uima.collection.CollectionException
-
getProgress
public org.apache.uima.util.Progress[] getProgress()
-
close
public void close() throws IOException- Throws:
IOException
-
-