A scalable and modular automated pipeline for stitching of large electron microscopy datasets.
Gayathri MahalingamRussel M TorresDaniel KapnerEric T TrautmanTim FlissShamishtaa SeshamaniEric PerlmanRob YoungSamuel KinnJoAnn BuchananMarc M TakenoWenjing YinDaniel J BumbargerRyder P GwinnJulie NyhusEd LeinStephen J SmithR Clay ReidKhaled A KhairyStephan SaalfeldForrest C CollmanNuno Maçarico da CostaPublished in: eLife (2022)
Serial-section electron microscopy (ssEM) is the method of choice for studying macroscopic biological samples at extremely high resolution in three dimensions. In the nervous system, nanometer-scale images are necessary to reconstruct dense neural wiring diagrams in the brain, so -called connectomes . The data that can comprise of up to 10 8 individual EM images must be assembled into a volume, requiring seamless 2D registration from physical section followed by 3D alignment of the stitched sections. The high throughput of ssEM necessitates 2D stitching to be done at the pace of imaging, which currently produces tens of terabytes per day. To achieve this, we present a modular volume assembly software pipeline ASAP (Assembly Stitching and Alignment Pipeline) that is scalable to datasets containing petabytes of data and parallelized to work in a distributed computational environment. The pipeline is built on top of the Render Trautman and Saalfeld (2019) services used in the volume assembly of the brain of adult Drosophila melanogaster (Zheng et al. 2018). It achieves high throughput by operating only on image meta-data and transformations. ASAP is modular, allowing for easy incorporation of new algorithms without significant changes in the workflow. The entire software pipeline includes a complete set of tools for stitching, automated quality control, 3D section alignment, and final rendering of the assembled volume to disk. ASAP has been deployed for continuous stitching of several large-scale datasets of the mouse visual cortex and human brain samples including one cubic millimeter of mouse visual cortex (Yin et al. 2020); Microns Consortium et al. (2021) at speeds that exceed imaging. The pipeline also has multi-channel processing capabilities and can be applied to fluorescence and multi-modal datasets like array tomography.
Keyphrases
- high throughput
- electron microscopy
- deep learning
- high resolution
- electronic health record
- machine learning
- single cell
- rna seq
- quality control
- big data
- drosophila melanogaster
- data analysis
- convolutional neural network
- artificial intelligence
- mental health
- white matter
- healthcare
- primary care
- resting state
- optical coherence tomography
- multiple sclerosis
- mass spectrometry
- functional connectivity
- quantum dots
- blood brain barrier
- young adults
- decision making
- photodynamic therapy
- neural network