BigNeuron: a resource to benchmark and predict performance of algorithms for automated tracing of neurons in light microscopy datasets.
Linus Manubens-GilZhi ZhouHanbo ChenArvind RamanathanXiaoxiao LiuYufeng LiuAlessandro BriaTodd GilletteZongcai RuanJian YangMiroslav RadojevićTing ZhaoLi ChengLei QuSiqi LiuKristofer E BouchardLin GuWeidong CaiShuiwang JiBadrinath RoysamChing-Wei WangHongchuan YuAmos SironiDaniel Maxim IasconeJie ZhouErhan BasEduardo Conde-SousaPaulo AguiarXiang LiYujie LiSumit NandaYuan WangLeila MuresanPascal FuaBing YeHai-Yan HeJochen F StaigerManuel PeterDaniel N CoxMichel SimonneauMarcel OberlaenderGregory S X E JefferisKei ItoPaloma Gonzalez-BellidoJinhyun KimEdwin RubelHollis T ClineHongkui ZengAljoscha NernAnn-Shyn ChiangJianhua YaoJane RoskamsRick LiveseyJanine StevensTianming LiuChinh DangYike GuoNing ZhongGeorgia D TourassiSean L HillMichael J HawrylyczChristof KochErik MeijeringGiorgio A AscoliHanchuan PengPublished in: Nature methods (2023)
BigNeuron is an open community bench-testing platform with the goal of setting open standards for accurate and fast automatic neuron tracing. We gathered a diverse set of image volumes across several species that is representative of the data obtained in many neuroscience laboratories interested in neuron tracing. Here, we report generated gold standard manual annotations for a subset of the available imaging datasets and quantified tracing quality for 35 automatic tracing algorithms. The goal of generating such a hand-curated diverse dataset is to advance the development of tracing algorithms and enable generalizable benchmarking. Together with image quality features, we pooled the data in an interactive web application that enables users and developers to perform principal component analysis, t-distributed stochastic neighbor embedding, correlation and clustering, visualization of imaging and tracing data, and benchmarking of automatic tracing algorithms in user-defined data subsets. The image quality metrics explain most of the variance in the data, followed by neuromorphological features related to neuron size. We observed that diverse algorithms can provide complementary information to obtain accurate results and developed a method to iteratively combine methods and generate consensus reconstructions. The consensus trees obtained provide estimates of the neuron structure ground truth that typically outperform single algorithms in noisy datasets. However, specific algorithms may outperform the consensus tree strategy in specific imaging conditions. Finally, to aid users in predicting the most accurate automatic tracing results without manual annotations for comparison, we used support vector machine regression to predict reconstruction quality given an image volume and a set of automatic tracings.
Keyphrases
- deep learning
- machine learning
- big data
- image quality
- high resolution
- artificial intelligence
- electronic health record
- mental health
- rna seq
- spinal cord
- magnetic resonance imaging
- high throughput
- clinical practice
- mass spectrometry
- single cell
- spinal cord injury
- magnetic resonance
- data analysis
- photodynamic therapy
- social media
- single molecule
- dual energy
- randomized controlled trial