Evaluating Methods for the Prediction of Cell Type-Specific Enhancers in the Mammalian Cortex.
Nelson J JohansenNiklas KempynckNathan R ZemkeSaroja SomasundaramSeppe De WinterMarcus J HooperDeepanjali DwivediRuchi LohiaFabien WehbeBocheng LiDarina AbaffyováEthan J ArmandJulie De ManEren Can EksiNikolai HeckerGert HulselmansVasileios KonstantakosDavid MauduitJohn K MichGabriele PartelTanya L DaigleBoaz P LeviKai ZhangYoshiaki TanakaJonathan M WernerJonathan T TingYoav Ben-SimonJeremy A MillerJoseph R EckerBing RenStein AertsEd S LeinBosiljka TasicTrygve E BakkenPublished in: bioRxiv : the preprint server for biology (2024)
Identifying cell type-specific enhancers in the brain is critical to building genetic tools for investigating the mammalian brain. Computational methods for functional enhancer prediction have been proposed and validated in the fruit fly and not yet the mammalian brain. We organized the 'Brain Initiative Cell Census Network (BICCN) Challenge: Predicting Functional Cell Type-Specific Enhancers from Cross-Species Multi-Omics' to assess machine learning and feature-based methods designed to nominate enhancer DNA sequences to target cell types in the mouse cortex. Methods were evaluated based on in vivo validation data from hundreds of cortical cell type-specific enhancers that were previously packaged into individual AAV vectors and retro-orbitally injected into mice. We find that open chromatin was a key predictor of functional enhancers, and sequence models improved prediction of non-functional enhancers that can be deprioritized as opposed to pursued for in vivo testing. Sequence models also identified cell type-specific transcription factor codes that can guide designs of in silico enhancers. This community challenge establishes a benchmark for enhancer prioritization algorithms and reveals computational approaches and molecular information that are crucial for the identification of functional enhancers for mammalian cortical cell types. The results of this challenge bring us closer to understanding the complex gene regulatory landscape of the mammalian brain and help us design more efficient genetic tools and potential gene therapies for human neurological diseases.
Keyphrases
- transcription factor
- machine learning
- resting state
- single cell
- functional connectivity
- white matter
- genome wide
- cerebral ischemia
- cell therapy
- healthcare
- endothelial cells
- copy number
- mental health
- stem cells
- binding protein
- deep learning
- type diabetes
- multiple sclerosis
- gene expression
- electronic health record
- adipose tissue
- single molecule
- dna methylation
- artificial intelligence
- brain injury
- quality improvement
- circulating tumor
- insulin resistance
- molecular dynamics simulations