BigNeuron: a resource to benchmark and predict performance of algorithms for automated tracing of neurons in light microscopy datasets.

Linus Manubens-GilZhi ZhouHanbo ChenArvind RamanathanXiaoxiao LiuYufeng Liu Alessandro BriaTodd GilletteZongcai RuanJian YangMiroslav RadojevićTing ZhaoLi ChengLei QuSiqi LiuKristofer E BouchardLin Gu Weidong Cai Shuiwang Ji Badrinath Roysam Ching-Wei Wang Hongchuan YuAmos SironiDaniel Maxim IasconeJie ZhouErhan BasEduardo Conde-SousaPaulo AguiarXiang LiYujie LiSumit NandaYuan Wang Leila MuresanPascal FuaBing YeHai-Yan HeJochen F StaigerManuel PeterDaniel N CoxMichel SimonneauMarcel OberlaenderGregory S X E JefferisKei ItoPaloma Gonzalez-BellidoJinhyun KimEdwin RubelHollis T Cline Hongkui Zeng Aljoscha Nern Ann-Shyn ChiangJianhua YaoJane RoskamsRick LiveseyJanine StevensTianming LiuChinh DangYike GuoNing ZhongGeorgia D Tourassi Sean L Hill Michael J Hawrylycz Christof Koch Erik Meijering Giorgio A Ascoli Hanchuan Peng

Published in: Nature methods (2023)

BigNeuron is an open community bench-testing platform with the goal of setting open standards for accurate and fast automatic neuron tracing. We gathered a diverse set of image volumes across several species that is representative of the data obtained in many neuroscience laboratories interested in neuron tracing. Here, we report generated gold standard manual annotations for a subset of the available imaging datasets and quantified tracing quality for 35 automatic tracing algorithms. The goal of generating such a hand-curated diverse dataset is to advance the development of tracing algorithms and enable generalizable benchmarking. Together with image quality features, we pooled the data in an interactive web application that enables users and developers to perform principal component analysis, t-distributed stochastic neighbor embedding, correlation and clustering, visualization of imaging and tracing data, and benchmarking of automatic tracing algorithms in user-defined data subsets. The image quality metrics explain most of the variance in the data, followed by neuromorphological features related to neuron size. We observed that diverse algorithms can provide complementary information to obtain accurate results and developed a method to iteratively combine methods and generate consensus reconstructions. The consensus trees obtained provide estimates of the neuron structure ground truth that typically outperform single algorithms in noisy datasets. However, specific algorithms may outperform the consensus tree strategy in specific imaging conditions. Finally, to aid users in predicting the most accurate automatic tracing results without manual annotations for comparison, we used support vector machine regression to predict reconstruction quality given an image volume and a set of automatic tracings.

Keyphrases