Tissue-wide cell-specific proteogenomic modeling reveals novel candidate risk genes in autism spectrum disorders.
Abolfazl Doostparast TorshiziKai WangPublished in: NPJ systems biology and applications (2022)
Autism spectrum disorders (ASD) are a set of complex neurodevelopmental diseases characterized with repetitive behavioral patterns and communication disabilities. Using a systems biology method called MAPSD (Markov Affinity-based Proteogenomic Signal Diffusion) for joint modeling of proteome dynamics and a wide array of omics datasets, we identified a list of candidate ASD risk genes. Leveraging the collected biological signals as well as a large-scale protein-protein interaction network adjusted based on single cell resolution proteome properties in four brain regions, we observed an agreement between the known and the newly identified candidate genes that are spatially enriched in neuronal cells within cerebral cortex at the protein level. Moreover, we created a detailed subcellular localization enrichment map of the known and the identified genes across 32 micro-domains and showed that neuronal cells and neuropils share the largest fraction of signal enrichment in cerebral cortex. Notably, we showed that the identified genes are among the transcriptional biomarkers of inhibitory and excitatory neurons in human frontal cortex. Intersecting the identified genes with a single cell RNA-seq data on ASD brains further evidenced that 20 candidate genes, including GRIK1, EMX2, STXBP6, and KCNJ3 are disrupted in distinct cell-types. Moreover, we showed that ASD risk genes are predominantly distributed in certain human interactome modules, and that the identified genes may act as the regulator for some of the known ASD loci. In summary, our study demonstrated how tissue-wide cell-specific proteogenomic modeling can reveal candidate genes for brain disorders that can be supported by convergent lines of evidence.
Keyphrases
- single cell
- rna seq
- autism spectrum disorder
- genome wide
- high throughput
- bioinformatics analysis
- attention deficit hyperactivity disorder
- protein protein
- intellectual disability
- genome wide identification
- induced apoptosis
- endothelial cells
- cerebral ischemia
- small molecule
- dna methylation
- gene expression
- transcription factor
- oxidative stress
- resting state
- high frequency
- subarachnoid hemorrhage
- big data
- binding protein
- cell death
- amino acid
- cell proliferation
- heat stress
- spinal cord
- endoplasmic reticulum stress
- spinal cord injury
- high density
- cerebral blood flow