Package corpora

Class DataLoader


  • public class DataLoader
    extends Object
    Author:
    Chinh
    • Constructor Detail

      • DataLoader

        public DataLoader()
    • Method Detail

      • loadData

        public void loadData​(String path,
                             boolean train)
      • Txt2Db

        public void Txt2Db​(String path,
                           String dest,
                           boolean train)
      • Txt2Db

        public DBUtils Txt2Db​(String pid,
                              String text,
                              List<String> proteins)
        Creates a database for event extraction as an programmatic API-call where all values are given directly rather then reading the values from files. In contrast to Txt2Db(String, String, boolean), created in-memory database is returned for further processing and not persisted to file.
        The protein lines have to match the Shared Task 2011 format:
        ID<tab>Entity-Type[Protein]<tab>start<tab>end<tab>Mention name
        Example: T3 Protein 166 174 TGF-beta
        Parameters:
        pid -
        text -
        proteins -
        Returns:
      • main

        public static void main​(String[] args)