A computational method for cell type-specific expression quantitative trait loci mapping using bulk RNA-seq data.
Paul LittleSi LiuVasyl ZhabotynskyYun LiDan-Yu LinWei SunPublished in: Nature communications (2023)
Mapping cell type-specific gene expression quantitative trait loci (ct-eQTLs) is a powerful way to investigate the genetic basis of complex traits. A popular method for ct-eQTL mapping is to assess the interaction between the genotype of a genetic locus and the abundance of a specific cell type using a linear model. However, this approach requires transforming RNA-seq count data, which distorts the relation between gene expression and cell type proportions and results in reduced power and/or inflated type I error. To address this issue, we have developed a statistical method called CSeQTL that allows for ct-eQTL mapping using bulk RNA-seq count data while taking advantage of allele-specific expression. We validated the results of CSeQTL through simulations and real data analysis, comparing CSeQTL results to those obtained from purified bulk RNA-seq data or single cell RNA-seq data. Using our ct-eQTL findings, we were able to identify cell types relevant to 21 categories of human traits.
Keyphrases
- rna seq
- single cell
- genome wide
- data analysis
- gene expression
- high resolution
- electronic health record
- dna methylation
- computed tomography
- high throughput
- image quality
- big data
- dual energy
- contrast enhanced
- poor prognosis
- endothelial cells
- magnetic resonance imaging
- positron emission tomography
- bone marrow
- mass spectrometry
- long non coding rna
- induced pluripotent stem cells
- molecular dynamics