MIDESP: Mutual Information-Based Detection of Epistatic SNP Pairs for Qualitative and Quantitative Phenotypes.
Felix HeinrichFaisal RamzanAbirami RajavelArmin Otto SchmittMehmet GültasPublished in: Biology (2021)
The interactions between SNPs result in a complex interplay with the phenotype, known as epistasis. The knowledge of epistasis is a crucial part of understanding genetic causes of complex traits. However, due to the enormous number of SNP pairs and their complex relationship to the phenotype, identification still remains a challenging problem. Many approaches for the detection of epistasis have been developed using mutual information (MI) as an association measure. However, these methods have mainly been restricted to case-control phenotypes and are therefore of limited applicability for quantitative traits. To overcome this limitation of MI-based methods, here, we present an MI-based novel algorithm, MIDESP, to detect epistasis between SNPs for qualitative as well as quantitative phenotypes. Moreover, by incorporating a dataset-dependent correction technique, we deal with the effect of background associations in a genotypic dataset to separate correct epistatic interaction signals from those of false positive interactions resulting from the effect of single SNP×phenotype associations. To demonstrate the effectiveness of MIDESP, we apply it on two real datasets with qualitative and quantitative phenotypes, respectively. Our results suggest that by eliminating the background associations, MIDESP can identify important genes, which play essential roles for bovine tuberculosis or the egg weight of chickens.
Keyphrases
- genome wide
- dna methylation
- high resolution
- copy number
- systematic review
- case control
- healthcare
- randomized controlled trial
- body mass index
- mycobacterium tuberculosis
- gene expression
- machine learning
- label free
- loop mediated isothermal amplification
- physical activity
- transcription factor
- weight loss
- health information
- real time pcr
- pulmonary tuberculosis
- hepatitis c virus
- hiv aids
- sensitive detection
- electronic health record