Login / Signup

Splicing QTL analysis focusing on coding sequences reveals mechanisms for disease susceptibility loci.

Kensuke YamaguchiKazuyoshi IshigakiAkari SuzukiYumi TsuchidaHaruka TsuchiyaShuji SumitomoYasuo NagafuchiFuyuki MiyaTatushiko TsunodaHirofumi ShodaKeishi FujioKazuhiko YamamotoYuta Kochi
Published in: Nature communications (2022)
Splicing quantitative trait loci (sQTLs) are one of the major causal mechanisms in genome-wide association study (GWAS) loci, but their role in disease pathogenesis is poorly understood. One reason is the complexity of alternative splicing events producing many unknown isoforms. Here, we propose two approaches, namely integration and selection, for this complexity by focusing on protein-structure of isoforms. First, we integrate isoforms with the same coding sequence (CDS) and identify 369-601 integrated-isoform ratio QTLs (i 2 -rQTLs), which altered protein-structure, in six immune subsets. Second, we select CDS incomplete isoforms annotated in GENCODE and identify 175-337 isoform-ratio QTL (i-rQTL). By comprehensive long-read capture RNA-sequencing among these incomplete isoforms, we reveal 29 full-length isoforms with unannotated CDSs associated with GWAS traits. Furthermore, we show that disease-causal sQTL genes can be identified by evaluating their trans-eQTL effects. Our approaches highlight the understudied role of protein-altering sQTLs and are broadly applicable to other tissues and diseases.
Keyphrases
  • genome wide association study
  • genome wide
  • dna methylation
  • quantum dots
  • amino acid
  • gene expression
  • genome wide association
  • small molecule
  • high density
  • genetic diversity