Login / Signup

A statistical framework for cross-tissue transcriptome-wide association analysis.

Yiming HuMo LiQiongshi LuHaoyi WengJiawei WangSeyedeh Maryam ZekavatZhaolong YuBoyang LiJianlei GuSydney MuchnikYu ShiBrian W KunkleShubhabrata MukherjeePradeep NatarajanAdam NajAmanda KuzmaYi ZhaoPaul K Cranenull nullHui LuHongyu Zhao
Published in: Nature genetics (2019)
Transcriptome-wide association analysis is a powerful approach to studying the genetic architecture of complex traits. A key component of this approach is to build a model to impute gene expression levels from genotypes by using samples with matched genotypes and gene expression data in a given tissue. However, it is challenging to develop robust and accurate imputation models with a limited sample size for any single tissue. Here, we first introduce a multi-task learning method to jointly impute gene expression in 44 human tissues. Compared with single-tissue methods, our approach achieved an average of 39% improvement in imputation accuracy and generated effective imputation models for an average of 120% more genes. We describe a summary-statistic-based testing framework that combines multiple single-tissue associations into a powerful metric to quantify the overall gene-trait association. We applied our method, called UTMOST (unified test for molecular signatures), to multiple genome-wide-association results and demonstrate its advantages over single-tissue strategies.
Keyphrases
  • gene expression
  • genome wide
  • dna methylation
  • single cell
  • high resolution
  • machine learning
  • big data
  • deep learning
  • electronic health record
  • genome wide identification