KnetMiner: a comprehensive approach for supporting evidence-based gene discovery and complex trait analysis across species.
Keywan Hassani-PakAjit SinghMarco BrandiziJoseph HearnshawJeremy D ParsonsSandeep AmberkarAndrew L PhillipsJohn H DoonanChris RawlingsPublished in: Plant biotechnology journal (2021)
The generation of new ideas and scientific hypotheses is often the result of extensive literature and database searches, but, with the growing wealth of public and private knowledge, the process of searching diverse and interconnected data to generate new insights into genes, gene networks, traits and diseases is becoming both more complex and more time-consuming. To guide this technically challenging data integration task and to make gene discovery and hypotheses generation easier for researchers, we have developed a comprehensive software package called KnetMiner which is open-source and containerized for easy use. KnetMiner is an integrated, intelligent, interactive gene and gene network discovery platform that supports scientists explore and understand the biological stories of complex traits and diseases across species. It features fast algorithms for generating rich interactive gene networks and prioritizing candidate genes based on knowledge mining approaches. KnetMiner is used in many plant science institutions and has been adopted by several plant breeding organizations to accelerate gene discovery. The software is generic and customizable and can therefore be readily applied to new species and data types; for example, it has been applied to pest insects and fungal pathogens; and most recently repurposed to support COVID-19 research. Here, we give an overview of the main approaches behind KnetMiner and we report plant-centric case studies for identifying genes, gene networks and trait relationships in Triticum aestivum (bread wheat), as well as, an evidence-based approach to rank candidate genes under a large Arabidopsis thaliana QTL. KnetMiner is available at: https://knetminer.org.
Keyphrases
- genome wide
- genome wide identification
- copy number
- dna methylation
- healthcare
- small molecule
- high throughput
- sars cov
- machine learning
- genome wide analysis
- emergency department
- public health
- systematic review
- arabidopsis thaliana
- electronic health record
- transcription factor
- gene expression
- data analysis
- health insurance
- multidrug resistant
- genetic diversity