Login / Signup

Generalized Structured Component Analysis in candidate gene association studies: applications and limitations.

Paul A ThompsonDorothy V M BishopElse EisingSimon E FisherDianne F Newbury
Published in: Wellcome open research (2019)
Background: Generalized Structured Component Analysis (GSCA) is a component-based alternative to traditional covariance-based structural equation modelling. This method has previously been applied to test for association between candidate genes and clinical phenotypes, contrasting with traditional genetic association analyses that adopt univariate testing of many individual single nucleotide polymorphisms (SNPs) with correction for multiple testing. Methods: We first evaluate the ability of the GSCA method to replicate two previous findings from a genetics association study of developmental language disorders. We then present the results of a simulation study to test the validity of the GSCA method under more restrictive data conditions, using smaller sample sizes and larger numbers of SNPs than have previously been investigated. Finally, we compare GSCA performance against univariate association analysis conducted using PLINK v1.9. Results: Results from simulations show that power to detect effects depends not just on sample size, but also on the ratio of SNPs with effect to number of SNPs tested within a gene. Inclusion of many SNPs in a model dilutes true effects. Conclusions: We propose that GSCA is a useful method for replication studies, when candidate SNPs have been identified, but should not be used for exploratory analysis.
Keyphrases
  • genome wide
  • dna methylation
  • copy number
  • genome wide association
  • gene expression
  • machine learning
  • electronic health record
  • big data
  • high resolution
  • autism spectrum disorder
  • deep learning