Opportunities and challenges for transcriptome-wide association studies.
Michael WainbergNicholas A Sinnott-ArmstrongNicholas MancusoAlvaro N BarbeiraDavid A KnowlesDavid GolanRaili ErmelArno RuusaleppThomas QuertermousKe HaoJohan L M BjorkegrenHae Kyung ImBogdan PasaniucManuel A RivasAnshul KundajePublished in: Nature genetics (2019)
Transcriptome-wide association studies (TWAS) integrate genome-wide association studies (GWAS) and gene expression datasets to identify gene-trait associations. In this Perspective, we explore properties of TWAS as a potential approach to prioritize causal genes at GWAS loci, by using simulations and case studies of literature-curated candidate causal genes for schizophrenia, low-density-lipoprotein cholesterol and Crohn's disease. We explore risk loci where TWAS accurately prioritizes the likely causal gene as well as loci where TWAS prioritizes multiple genes, some likely to be non-causal, owing to sharing of expression quantitative trait loci (eQTL). TWAS is especially prone to spurious prioritization with expression data from non-trait-related tissues or cell types, owing to substantial cross-cell-type variation in expression levels and eQTL strengths. Nonetheless, TWAS prioritizes candidate causal genes more accurately than simple baselines. We suggest best practices for causal-gene prioritization with TWAS and discuss future opportunities for improvement. Our results showcase the strengths and limitations of using eQTL datasets to determine causal genes at GWAS loci.
Keyphrases
- genome wide
- dna methylation
- gene expression
- copy number
- poor prognosis
- genome wide association
- genome wide identification
- healthcare
- primary care
- genome wide association study
- single cell
- systematic review
- case control
- binding protein
- rna seq
- bipolar disorder
- mass spectrometry
- molecular dynamics
- cell therapy
- machine learning
- health information
- deep learning