Abstract
There is a gap in the conceptual framework linking genes to phenotypes (G2P) for non-model organisms, as most non-model organisms do not yet have genomic resources readily availa-ble. To address this, researchers often perform literature reviews to understand G2P linkages by curating a list of likely gene candidates, hinging upon other studies already conducted in closely related systems. Sifting through hundreds to thousands of articles is a cumbersome task that slows down the scientific process and may introduce bias into a study. To fill this gap, we created G2PMineR, a free and open source literature mining tool developed specifically for G2P research. This R package uses automation to make the G2P review process efficient and unbiased, while also generating hypothesized associations between genes and phenotypes within a taxonomical frame-work. We applied the package to a literature review for drought-tolerance in plants. The analysis provides biologically meaningful results within the known framework of drought tolerance in plants. Overall, the package is useful for conducting literature reviews for genome-to-phenome pro-jects, and also has broad appeal to scientists investigating a wide range of study systems as it can conduct analyses under the auspices of three different kingdoms (Plantae, Animalia, and Fungi).
| Original language | American English |
|---|---|
| Article number | 293 |
| Pages (from-to) | 1-16 |
| Number of pages | 16 |
| Journal | Genes |
| Volume | 12 |
| Issue number | 2 |
| DOIs | |
| State | Published - Feb 2021 |
Keywords
- G2P
- Genotype
- Literature mining
- Literature review
- Phenotype
EGS Disciplines
- Biology