Unifying community-wide whole-brain imaging datasets enables robust automated neuron identification and reveals determinants of neuron positioning in C. elegans .
Daniel Yutaka SpragueKevin RuschRaymond L DunnJackson M BorchardtSteven BanGreg BubnisGrace C ChiuChentao WenRyoga SuzukiShivesh ChaudharyHyun Jee LeeZikai YuBenjamin K DichterRyan LyShuichi OnamiHang LuKoutarou D KimuraEviatar I YeminiSaul KatoPublished in: bioRxiv : the preprint server for biology (2024)
We develop a data harmonization approach for C. elegans volumetric microscopy data, still or video, consisting of a standardized format, data pre-processing techniques, and a set of human-in-the-loop machine learning based analysis software tools. We unify a diverse collection of 118 whole-brain neural activity imaging datasets from 5 labs, storing these and accompanying tools in an online repository called WormID (wormid.org). We use this repository to train three existing automated cell identification algorithms to, for the first time, enable accuracy in neural identification that generalizes across labs, approaching human performance in some cases. We mine this repository to identify factors that influence the developmental positioning of neurons. To facilitate communal use of this repository, we created open-source software, code, web-based tools, and tutorials to explore and curate datasets for contribution to the scientific community. This repository provides a growing resource for experimentalists, theorists, and toolmakers to (a) study neuroanatomical organization and neural activity across diverse experimental paradigms, (b) develop and benchmark algorithms for automated neuron detection, segmentation, cell identification, tracking, and activity extraction, and (c) inform models of neurobiological development and function.
Keyphrases
- machine learning
- deep learning
- big data
- high resolution
- high throughput
- endothelial cells
- single cell
- artificial intelligence
- electronic health record
- bioinformatics analysis
- healthcare
- rna seq
- mental health
- data analysis
- white matter
- cell therapy
- resting state
- convolutional neural network
- pluripotent stem cells
- stem cells
- optical coherence tomography
- functional connectivity
- mass spectrometry
- multiple sclerosis
- transcription factor
- bone marrow
- mesenchymal stem cells
- spinal cord injury
- sensitive detection