A novel transcriptional risk score for risk prediction of complex human diseases.
Nayang ShanYuhan XieShuang SongWei JiangZuoheng WangLin HouPublished in: Genetic epidemiology (2021)
Recently polygenetic risk score (PRS) has been successfully used in the risk prediction of complex human diseases. Many studies incorporated internal information, such as effect size distribution, or external information, such as linkage disequilibrium, functional annotation, and pleiotropy among multiple diseases, to optimize the performance of PRS. To leverage on multiomics datasets, we developed a novel flexible transcriptional risk score (TRS), in which messenger RNA expression levels were imputed and weighted for risk prediction. In simulation studies, we demonstrated that single-tissue TRS has greater prediction power than LDpred, especially when there is a large effect of gene expression on the phenotype. Multitissue TRS improves prediction accuracy when there are multiple tissues with independent contributions to disease risk. We applied our method to complex traits, including Crohn's disease, type 2 diabetes, and so on. The single-tissue TRS method outperformed LDpred and AnnoPred across the tested traits. The performance of multitissue TRS is trait-dependent. Moreover, our method can easily incorporate information from epigenomic and proteomic data upon the availability of reference datasets.
Keyphrases
- gene expression
- genome wide
- endothelial cells
- type diabetes
- health information
- dna methylation
- rna seq
- induced pluripotent stem cells
- transcription factor
- poor prognosis
- pluripotent stem cells
- cardiovascular disease
- magnetic resonance imaging
- healthcare
- computed tomography
- single cell
- social media
- contrast enhanced
- machine learning
- skeletal muscle
- oxidative stress
- deep learning
- big data
- heat shock
- network analysis