Tissue-specific enhancer-gene maps from multimodal single-cell data identify causal disease alleles.
Saori SakaueKathryn WeinandShakson IsaacKushal K DeyKarthik A JagadeeshMasahiro KanaiGerald F M WattsZhu Zhunull nullMichael B BrennerAndrew McDavidLaura T DonlinKevin WeiAlkes L PriceSoumya RaychaudhuriPublished in: Nature genetics (2024)
Translating genome-wide association study (GWAS) loci into causal variants and genes requires accurate cell-type-specific enhancer-gene maps from disease-relevant tissues. Building enhancer-gene maps is essential but challenging with current experimental methods in primary human tissues. Here we developed a nonparametric statistical method, SCENT (single-cell enhancer target gene mapping), that models association between enhancer chromatin accessibility and gene expression in single-cell or nucleus multimodal RNA sequencing and ATAC sequencing data. We applied SCENT to 9 multimodal datasets including >120,000 single cells or nuclei and created 23 cell-type-specific enhancer-gene maps. These maps were highly enriched for causal variants in expression quantitative loci and GWAS for 1,143 diseases and traits. We identified likely causal genes for both common and rare diseases and linked somatic mutation hotspots to target genes. We demonstrate that application of SCENT to multimodal data from disease-relevant human tissue enables the scalable construction of accurate cell-type-specific enhancer-gene maps, essential for defining noncoding variant function.
Keyphrases
- genome wide
- single cell
- copy number
- genome wide identification
- transcription factor
- gene expression
- dna methylation
- binding protein
- rna seq
- genome wide association study
- genome wide analysis
- high resolution
- endothelial cells
- pain management
- high throughput
- electronic health record
- big data
- cell death
- oxidative stress
- cell proliferation
- mass spectrometry
- chronic pain
- data analysis
- machine learning
- pluripotent stem cells
- artificial intelligence