Actually used and required files in this directory:

* desc2tax - can be updated/complemented because it is not complete
* modifiers
* non_descriptives
* organisms_in_gene_info.txt.gz - should be updated regularly with the index updates
* reference_species.txt
* speciesprefixes.map - can be complemented, is incomplete
* unspecified_proteins
* familyfilterGnormplusTrainNoAgglomerations.mod.gz - created by de.julielab.jules.ae.genemapping.filtering.families.GeneFamilyTagger
* speciesAssignmentGnormplusTrain.mod.gz - created by de.julielab.jules.ae.genemapping.disambig.org.mlcandidateranker.SpeciesAssignmentMaxEntTrain