micronuclAI: Automated quantification of micronuclei for assessment of chromosomal instability.
Miguel A Ibarra-ArellanoLindsay A CaprioAroj HadaNiklas StotzemLuke L CaiShivem B ShahJohannes C MelmsFlorian WünnemanBenjamin IzarDenis SchapiroPublished in: bioRxiv : the preprint server for biology (2024)
Chromosomal instability (CIN) is a hallmark of cancer that drives metastasis, immune evasion and treatment resistance. CIN results from chromosome mis-segregation events during anaphase, as excessive chromatin is packaged in micronuclei (MN), that can be enumerated to quantify CIN. Despite recent advancements in automation through computer vision and machine learning, the assessment of CIN remains a predominantly manual and time-consuming task, thus hampering important work in the field. Here, we present micronuclAI , a novel pipeline for automated and reliable quantification of MN of varying size, morphology and location from DNA-only stained images. In micronucleAI , single-cell crops are extracted from high-resolution microscopy images with the help of segmentation masks, which are then used to train a convolutional neural network (CNN) to output the number of MN associated with each cell. The pipeline was evaluated against manual single-cell level counts by experts and against routinely used MN ratio within the complete image. The classifier was able to achieve a weighted F1 score of 0.937 on the test dataset and the complete pipeline can achieve close to human-level performance on various datasets derived from multiple human and murine cancer cell lines. The pipeline achieved a root-mean-square deviation (RMSE) value of 0.0041, an R 2 of 0.87 and a Pearson's correlation of 0.938 on images obtained at 10X magnification. We tested the approach in otherwise isogenic cell lines in which we genetically dialed up or down CIN rates, and also on a publicly available image data set (obtained at 100X) and achieved an RMSE value of 0.0159, an R 2 of 0.90, and a Pearson's correlation of 0.951. Given the increasing interest in developing therapies for CIN-driven cancers, this method provides an important, scalable, and rapid approach to quantifying CIN on routinely obtained images. We release a GUI-implementation for easy access and utilization of the pipeline.
Keyphrases
- deep learning
- convolutional neural network
- single cell
- machine learning
- artificial intelligence
- high resolution
- rna seq
- endothelial cells
- papillary thyroid
- high throughput
- room temperature
- healthcare
- copy number
- gene expression
- dna damage
- transcription factor
- primary care
- high speed
- optical coherence tomography
- squamous cell carcinoma
- magnetic resonance imaging
- peripheral blood
- childhood cancer
- bone marrow
- lymph node metastasis
- young adults
- electronic health record
- magnetic resonance
- weight gain
- ionic liquid
- quality improvement
- network analysis