Login / Signup

A method for identifying genetic heterogeneity within phenotypically defined disease subgroups.

James LileyJohn A ToddChris Wallace
Published in: Nature genetics (2016)
Many common diseases show wide phenotypic variation. We present a statistical method for determining whether phenotypically defined subgroups of disease cases represent different genetic architectures, in which disease-associated variants have different effect sizes in two subgroups. Our method models the genome-wide distributions of genetic association statistics with mixture Gaussians. We apply a global test without requiring explicit identification of disease-associated variants, thus maximizing power in comparison to standard variant-by-variant subgroup analysis. Where evidence for genetic subgrouping is found, we present methods for post hoc identification of the contributing genetic variants. We demonstrate the method on a range of simulated and test data sets, for which expected results are already known. We investigate subgroups of individuals with type 1 diabetes (T1D) defined by autoantibody positivity, establishing evidence for differential genetic architecture with positivity for thyroid-peroxidase-specific antibody, driven generally by variants in known T1D-associated genomic regions.
Keyphrases
  • copy number
  • genome wide
  • dna methylation
  • randomized controlled trial
  • clinical trial
  • machine learning
  • big data
  • hydrogen peroxide
  • single cell
  • open label
  • artificial intelligence