Identifying genetic variants that influence the abundance of cell states in single-cell data.
Laurie RumkerSaori SakaueYakir ReshefJoyce B KangSeyhan YazarJose Alquicira-HernandezCristian ValenciaKaitlyn A LagattutaAnnelise Mah-SomAparna NathanJoseph E PowellPo-Ru LohSoumya RaychaudhuriPublished in: bioRxiv : the preprint server for biology (2023)
To understand genetic mechanisms driving disease, it is essential but difficult to map how risk alleles affect the composition of cells present in the body. Single-cell profiling quantifies granular information about tissues, but variant-associated cell states may reflect diverse combinations of the profiled cell features that are challenging to predefine. We introduce GeNA (Genotype-Neighborhood Associations), a statistical tool to identify cell state abundance quantitative trait loci (csaQTLs) in high-dimensional single-cell datasets. Instead of testing associations to predefined cell states, GeNA flexibly identifies the cell states whose abundance is most associated with genetic variants. In a genome-wide survey of scRNA-seq peripheral blood profiling from 969 individuals, 1 GeNA identifies five independent loci associated with shifts in the relative abundance of immune cell states. For example, rs3003-T (p=1.96×10 -11 ) associates with increased abundance of NK cells expressing TNF-α response programs. This csaQTL colocalizes with increased risk for psoriasis, an autoimmune disease that responds to anti-TNF treatments. Flexibly characterizing csaQTLs for granular cell states may help illuminate how genetic background alters cellular composition to confer disease risk.
Keyphrases
- single cell
- genome wide
- rna seq
- high throughput
- dna methylation
- gene expression
- peripheral blood
- rheumatoid arthritis
- healthcare
- public health
- antibiotic resistance genes
- mesenchymal stem cells
- oxidative stress
- stem cells
- mass spectrometry
- bone marrow
- copy number
- cell cycle arrest
- machine learning
- cell proliferation
- big data
- atopic dermatitis
- nk cells
- cross sectional
- genome wide association study