Class IGNReader

  • All Implemented Interfaces:
    org.apache.uima.collection.base_cpm.BaseCollectionReader, org.apache.uima.collection.CollectionReader, org.apache.uima.resource.ConfigurableResource, org.apache.uima.resource.Resource

    public class IGNReader
    extends org.apache.uima.collection.CollectionReader_ImplBase
    The IGNReader reads corpus files in BioC-format.
    There are XML files comprising the actual text (as well as passage and sentence annotations) and there are separate XML files comprising the annotations.
    Author:
    engelmann
    • Field Summary

      Fields 
      Modifier and Type Field Description
      static String PARAM_INPUTDIR_ANNO
      String parameter indicating path to the directory containing files in BioC-format that comprise the annotations.
      static String PARAM_INPUTDIR_TEXT
      String parameter indicating path to the directory containing files in BioC-format that comprise the actual text.
      static String PUBLICATION_DATES_FILE
      optional Parameter providing the file path to a file mapping the article ids to the corresponding publication years
      • Fields inherited from interface org.apache.uima.resource.Resource

        PARAM_AGGREGATE_SOFA_MAPPINGS, PARAM_CONFIG_MANAGER, PARAM_CONFIG_PARAM_SETTINGS, PARAM_EXTERNAL_OVERRIDE_SETTINGS, PARAM_PERFORMANCE_TUNING_SETTINGS, PARAM_RESOURCE_MANAGER, PARAM_UIMA_CONTEXT
    • Constructor Summary

      Constructors 
      Constructor Description
      IGNReader()  
    • Method Summary

      All Methods Instance Methods Concrete Methods 
      Modifier and Type Method Description
      void close()  
      void getNext​(org.apache.uima.cas.CAS aCas)  
      org.apache.uima.util.Progress[] getProgress()  
      boolean hasNext()  
      void initialize()  
      • Methods inherited from class org.apache.uima.collection.CollectionReader_ImplBase

        destroy, getCasInitializer, getProcessingResourceMetaData, initialize, isConsuming, reconfigure, setCasInitializer, typeSystemInit
      • Methods inherited from class org.apache.uima.resource.ConfigurableResource_ImplBase

        getConfigParameterValue, getConfigParameterValue, setConfigParameterValue, setConfigParameterValue
      • Methods inherited from class org.apache.uima.resource.Resource_ImplBase

        getCasManager, getLogger, getMetaData, getRelativePathResolver, getResourceManager, getUimaContext, getUimaContextAdmin, setLogger, setMetaData
      • Methods inherited from interface org.apache.uima.resource.ConfigurableResource

        getConfigParameterValue, getConfigParameterValue, setConfigParameterValue, setConfigParameterValue
      • Methods inherited from interface org.apache.uima.resource.Resource

        getLogger, getMetaData, getResourceManager, getUimaContext, getUimaContextAdmin, setLogger
    • Field Detail

      • PARAM_INPUTDIR_TEXT

        public static final String PARAM_INPUTDIR_TEXT
        String parameter indicating path to the directory containing files in BioC-format that comprise the actual text.
        See Also:
        Constant Field Values
      • PARAM_INPUTDIR_ANNO

        public static final String PARAM_INPUTDIR_ANNO
        String parameter indicating path to the directory containing files in BioC-format that comprise the annotations.
        See Also:
        Constant Field Values
      • PUBLICATION_DATES_FILE

        public static final String PUBLICATION_DATES_FILE
        optional Parameter providing the file path to a file mapping the article ids to the corresponding publication years
        See Also:
        Constant Field Values
    • Constructor Detail

      • IGNReader

        public IGNReader()
    • Method Detail

      • initialize

        public void initialize()
                        throws org.apache.uima.resource.ResourceInitializationException
        Overrides:
        initialize in class org.apache.uima.collection.CollectionReader_ImplBase
        Throws:
        org.apache.uima.resource.ResourceInitializationException
      • getNext

        public void getNext​(org.apache.uima.cas.CAS aCas)
                     throws IOException,
                            org.apache.uima.collection.CollectionException
        Throws:
        IOException
        org.apache.uima.collection.CollectionException
      • hasNext

        public boolean hasNext()
                        throws IOException,
                               org.apache.uima.collection.CollectionException
        Throws:
        IOException
        org.apache.uima.collection.CollectionException
      • getProgress

        public org.apache.uima.util.Progress[] getProgress()