Login / Signup

Detecting disease-associated genomic outcomes using constrained mixture of Bayesian hierarchical models for paired data.

Yunfeng LiJarrett MorrowBenjamin RabyKelan TantisiraScott T WeissWei HuangWeiliang Qiu
Published in: PloS one (2017)
Detecting disease-associated genomic outcomes is one of the key steps in precision medicine research. Cutting-edge high-throughput technologies enable researchers to unbiasedly test if genomic outcomes are associated with disease of interest. However, these technologies also include the challenges associated with the analysis of genome-wide data. Two big challenges are (1) how to reduce the effects of technical noise; and (2) how to handle the curse of dimensionality (i.e., number of variables are way larger than the number of samples). To tackle these challenges, we propose a constrained mixture of Bayesian hierarchical models (MBHM) for detecting disease-associated genomic outcomes for data obtained from paired/matched designs. Paired/matched designs can effectively reduce effects of confounding factors. MBHM does not involve multiple testing, hence does not have the problem of the curse of dimensionality. It also could borrow information across genes so that it can be used for whole genome data with small sample sizes.
Keyphrases
  • genome wide
  • big data
  • electronic health record
  • copy number
  • high throughput
  • type diabetes
  • adipose tissue
  • gene expression
  • air pollution
  • data analysis
  • transcription factor
  • weight loss
  • bioinformatics analysis