Login / Signup

SR-TWAS: leveraging multiple reference panels to improve transcriptome-wide association study power by ensemble machine learning.

Randy L ParrishAron S BuchmanShinya TasakiYangling WangDenis AveyJishu XuPhilip Lawrence De JagerDavid A BennettMichael P EpsteinJingjing Yang
Published in: Nature communications (2024)
Multiple reference panels of a given tissue or multiple tissues often exist, and multiple regression methods could be used for training gene expression imputation models for transcriptome-wide association studies (TWAS). To leverage expression imputation models (i.e., base models) trained with multiple reference panels, regression methods, and tissues, we develop a Stacked Regression based TWAS (SR-TWAS) tool which can obtain optimal linear combinations of base models for a given validation transcriptomic dataset. Both simulation and real studies show that SR-TWAS improves power, due to increased training sample sizes and borrowed strength across multiple regression methods and tissues. Leveraging base models across multiple reference panels, tissues, and regression methods, our real studies identify 6 independent significant risk genes for Alzheimer's disease (AD) dementia for supplementary motor area tissue and 9 independent significant risk genes for Parkinson's disease (PD) for substantia nigra tissue. Relevant biological interpretations are found for these significant risk genes.
Keyphrases