Predicting causal genes from psychiatric genome-wide association studies using high-level etiological knowledge.
Michael WainbergDaniele MericoMatthew C KellerEric B FaumanShreejoy J TripathyPublished in: Molecular psychiatry (2022)
Genome-wide association studies have discovered hundreds of genomic loci associated with psychiatric traits, but the causal genes underlying these associations are often unclear, a research gap that has hindered clinical translation. Here, we present a Psychiatric Omnilocus Prioritization Score (PsyOPS) derived from just three binary features encapsulating high-level assumptions about psychiatric disease etiology - namely, that causal psychiatric disease genes are likely to be mutationally constrained, be specifically expressed in the brain, and overlap with known neurodevelopmental disease genes. To our knowledge, PsyOPS is the first method specifically tailored to prioritizing causal genes at psychiatric GWAS loci. We show that, despite its extreme simplicity, PsyOPS achieves state-of-the-art performance at this task, comparable to a prior domain-agnostic approach relying on tens of thousands of features. Genes prioritized by PsyOPS are substantially more likely than other genes at the same loci to have convergent evidence of direct regulation by the GWAS variant according to both DNA looping assays and expression or splicing quantitative trait locus (QTL) maps. We provide examples of genes hundreds of kilobases away from the lead variant, like GABBR1 for schizophrenia, that are prioritized by all three of PsyOPS, DNA looping and QTLs. Our results underscore the power of incorporating high-level knowledge of trait etiology into causal gene prediction at GWAS loci, and comprise a resource for researchers interested in experimentally characterizing psychiatric gene candidates.
Keyphrases
- genome wide
- genome wide association
- dna methylation
- genome wide identification
- mental health
- copy number
- healthcare
- bioinformatics analysis
- genome wide analysis
- genome wide association study
- gene expression
- transcription factor
- high resolution
- mass spectrometry
- poor prognosis
- high throughput
- smoking cessation
- climate change
- case control