cytometree: A binary tree algorithm for automatic gating in cytometry analysis.
Daniel CommengesChariff AlkhassimRaphaël GottardoBoris HejblumRodolphe ThiébautPublished in: Cytometry. Part A : the journal of the International Society for Analytical Cytology (2018)
Flow cytometry is a powerful technology that allows the high-throughput quantification of dozens of surface and intracellular proteins at the single-cell level. It has become the most widely used technology for immunophenotyping of cells over the past three decades. Due to the increasing complexity of cytometry experiments (more cells and more markers), traditional manual flow cytometry data analysis has become untenable due to its subjectivity and time-consuming nature. We present a new unsupervised algorithm called "cytometree" to perform automated population identification (aka gating) in flow cytometry. cytometree is based on the construction of a binary tree, the nodes of which are subpopulations of cells. At each node, the marker distributions are modeled by mixtures of normal distributions. Node splitting is done according to a model selection procedure based on a normalized difference of Akaike information criteria between two competing models. Post-processing of the tree structure and derived populations allows us to complete the annotation of the populations. The algorithm is shown to perform better than the state-of-the-art unsupervised algorithms previously proposed on panels introduced by the Flow Cytometry: Critical Assessment of Population Identification Methods project. The algorithm is also applied to a T-cell panel proposed by the Human Immunology Project Consortium (HIPC) program; it also outperforms the best unsupervised open-source available algorithm while requiring the shortest computation time. © 2018 International Society for Advancement of Cytometry.
Keyphrases
- flow cytometry
- machine learning
- single cell
- deep learning
- induced apoptosis
- high throughput
- cell cycle arrest
- rna seq
- quality improvement
- data analysis
- lymph node
- endothelial cells
- endoplasmic reticulum stress
- neural network
- ionic liquid
- squamous cell carcinoma
- radiation therapy
- social media
- induced pluripotent stem cells
- pi k akt
- health information
- rectal cancer
- monte carlo