Automated, high-throughput quantification of EGFP-expressing neutrophils in zebrafish by machine learning and a highly-parallelized microscope.
John EfromsonGiuliano FerreroAurélien BègueThomas Jedidiah Jenks DomanClay DugoAndi BarkerVeton SaliuPaul ReameyKanghyun KimMark HarfoucheJeffrey A YoderPublished in: bioRxiv : the preprint server for biology (2023)
Normal development of the immune system is essential for overall health and disease resistance. Bony fish, such as the zebrafish ( Danio rerio ), possess all the major immune cell lineages as mammals and can be employed to model human host response to immune challenge. Zebrafish neutrophils, for example, are present in the transparent larvae as early as 48 hours post fertilization and have been examined in numerous infection and immunotoxicology reports. One significant advantage of the zebrafish model is the ability to affordably generate high numbers of individual larvae that can be arrayed in multi-well plates for high throughput genetic and chemical exposure screens. However, traditional workflows for imaging individual larvae have been limited to low-throughput studies using traditional microscopes and manual analyses. Using a newly developed, parallelized microscope, the Multi-Camera Array Microscope (MCAM™), we have optimized a rapid, high-resolution algorithmic method to count fluorescently labeled cells in zebrafish larvae in vivo . Using transgenic zebrafish larvae, in which neutrophils express EGFP, we captured 18 gigapixels of images across a full 96-well plate, in 75 seconds, and processed the resulting datastream, counting individual fluorescent neutrophils in all individual larvae in 5 minutes. This automation is facilitated by a machine learning segmentation algorithm that defines the most in-focus view of each larva in each well after which pixel intensity thresholding and blob detection are employed to locate and count fluorescent cells. We validated this method by comparing algorithmic neutrophil counts to manual counts in larvae subjected to changes in neutrophil numbers, demonstrating the utility of this approach for high-throughput genetic and chemical screens where a change in neutrophil number is an endpoint metric. Using the MCAM™ we have been able to, within minutes, acquire both enough data to create an automated algorithm and execute a biological experiment with statistical significance. Finally, we present this open-source software package which allows the user to train and evaluate a custom machine learning segmentation model and use it to localize zebrafish and analyze cell counts within the segmented region of interest. This software can be modified as needed for studies involving other zebrafish cell lineages using different transgenic reporter lines and can also be adapted for studies using other amenable model species.
Keyphrases
- high throughput
- machine learning
- deep learning
- single cell
- aedes aegypti
- high resolution
- drosophila melanogaster
- convolutional neural network
- artificial intelligence
- induced apoptosis
- healthcare
- genome wide
- big data
- zika virus
- public health
- dna methylation
- cell therapy
- mass spectrometry
- mental health
- cell cycle arrest
- emergency department
- case control
- crispr cas
- cell proliferation
- stem cells
- signaling pathway
- computed tomography
- mesenchymal stem cells
- data analysis
- pi k akt
- cell death
- bone marrow
- fluorescent probe
- high speed
- photodynamic therapy
- label free
- real time pcr