Class TransformerDisambiguationDataUtils


  • public class TransformerDisambiguationDataUtils
    extends Object
    • Constructor Detail

      • TransformerDisambiguationDataUtils

        public TransformerDisambiguationDataUtils()
    • Method Detail

      • getGmMarkedDocumentText

        public static String getGmMarkedDocumentText​(de.julielab.geneexpbase.genemodel.GeneMention gm,
                                                     int maxContextTokens,
                                                     boolean onlyGenes,
                                                     boolean uniqueGenes)
      • getCandidateQueryString

        public static String getCandidateQueryString​(de.julielab.geneexpbase.candidateretrieval.SynHit sh,
                                                     CandidateRetrieval candidateRetrieval)
                                              throws ExecutionException
        Creates a single string describing the given gene database candidate. Those are names and synonyms and optionally textual descriptions of the gene. This string is supposed to serve as a "query" or "question" for the transformer. Given also a part of the document context of the current gene, the transformer should then decide whether the gene description belongs to the current gene or not.
        Parameters:
        sh - The gene candidate to create a query string for.
        Returns:
        The candidate "query" to compare to the document context.
        Throws:
        ExecutionException
      • addDocumentLevelGeneAnnotations

        public static void addDocumentLevelGeneAnnotations​(de.julielab.geneexpbase.genemodel.GeneDocument document,
                                                           com.google.common.collect.Multimap<String,​String> docid2geneid)