BIDCell: Biologically-informed self-supervised learning for segmentation of subcellular spatial transcriptomics data.
Xiaohang FuYingxin LinDavid M LinDaniel MechtersheimerChuhan WangFarhan AmeenShila GhazanfarEllis PatrickJinman KimJean Yee Hwa YangPublished in: Nature communications (2024)
Recent advances in subcellular imaging transcriptomics platforms have enabled high-resolution spatial mapping of gene expression, while also introducing significant analytical challenges in accurately identifying cells and assigning transcripts. Existing methods grapple with cell segmentation, frequently leading to fragmented cells or oversized cells that capture contaminated expression. To this end, we present BIDCell, a self-supervised deep learning-based framework with biologically-informed loss functions that learn relationships between spatially resolved gene expression and cell morphology. BIDCell incorporates cell-type data, including single-cell transcriptomics data from public repositories, with cell morphology information. Using a comprehensive evaluation framework consisting of metrics in five complementary categories for cell segmentation performance, we demonstrate that BIDCell outperforms other state-of-the-art methods according to many metrics across a variety of tissue types and technology platforms. Our findings underscore the potential of BIDCell to significantly enhance single-cell spatial expression analyses, enabling great potential in biological discovery.
Keyphrases
- single cell
- rna seq
- deep learning
- gene expression
- high resolution
- high throughput
- induced apoptosis
- cell cycle arrest
- convolutional neural network
- machine learning
- poor prognosis
- cell therapy
- dna methylation
- electronic health record
- healthcare
- big data
- mental health
- artificial intelligence
- emergency department
- mass spectrometry
- risk assessment
- cell death
- signaling pathway
- climate change
- bone marrow
- drinking water
- photodynamic therapy
- health information
- pi k akt
- data analysis