BIDCell: Biologically-informed self-supervised learning for segmentation of subcellular spatial transcriptomics data.
Xiaohang FuYingxin LinDavid M LinDaniel MechtersheimerChuhan WangFarhan AmeenShila GhazanfarEllis PatrickJinman KimJean Yee Hwa YangPublished in: Nature communications (2024)
Recent advances in subcellular imaging transcriptomics platforms have enabled high-resolution spatial mapping of gene expression, while also introducing significant analytical challenges in accurately identifying cells and assigning transcripts. Existing methods grapple with cell segmentation, frequently leading to fragmented cells or oversized cells that capture contaminated expression. To this end, we present BIDCell, a self-supervised deep learning-based framework with biologically-informed loss functions that learn relationships between spatially resolved gene expression and cell morphology. BIDCell incorporates cell-type data, including single-cell transcriptomics data from public repositories, with cell morphology information. Using a comprehensive evaluation framework consisting of metrics in five complementary categories for cell segmentation performance, we demonstrate that BIDCell outperforms other state-of-the-art methods according to many metrics across a variety of tissue types and technology platforms. Our findings underscore the potential of BIDCell to significantly enhance single-cell spatial expression analyses, enabling great potential in biological discovery.
Keyphrases
- single cell
- rna seq
- deep learning
- gene expression
- high resolution
- high throughput
- induced apoptosis
- cell cycle arrest
- machine learning
- cell therapy
- poor prognosis
- convolutional neural network
- healthcare
- dna methylation
- emergency department
- cell proliferation
- mental health
- artificial intelligence
- endoplasmic reticulum stress
- cell death
- heavy metals
- bone marrow
- drinking water
- binding protein
- social media
- liquid chromatography
- tandem mass spectrometry