Actually used and required files in this directory:

* desc2tax - can be updated/complemented because it is not complete
* modifiers
* non_descriptives
* organisms_in_gene_info.txt.gz - should be updated regularly with the index updates
* reference_species.txt
* speciesprefixes.map - can be complemented, is incomplete
* unspecified_proteins
* familyfilterGnormplusTrainNoAgglomerations.mod.gz - created by de.julielab.jules.ae.genemapper.filtering.families.GeneFamilyTagger
* speciesAssignmentGnormplusTrain.mod.gz - created by de.julielab.jules.ae.genemapper.disambig.org.mlcandidateranker.SpeciesAssignmentMaxEntTrain
* synonym_species_occurrences.tsv.gz - created by jcore-gene-mapper-resources/pipelines/synonym-species-occurrence-counting either run on PubMed, PMC or both.

