BIDCell: Biologically-informed self-supervised learning for segmentation of subcellular spatial transcriptomics data.
Xiaohang FuYingxin LinDavid M LinDaniel MechtersheimerChuhan WangFarhan AmeenShila GhazanfarEllis PatrickJinman KimJean Yee Hwa YangPublished in: Nature communications (2024)
Recent advances in subcellular imaging transcriptomics platforms have enabled high-resolution spatial mapping of gene expression, while also introducing significant analytical challenges in accurately identifying cells and assigning transcripts. Existing methods grapple with cell segmentation, frequently leading to fragmented cells or oversized cells that capture contaminated expression. To this end, we present BIDCell, a self-supervised deep learning-based framework with biologically-informed loss functions that learn relationships between spatially resolved gene expression and cell morphology. BIDCell incorporates cell-type data, including single-cell transcriptomics data from public repositories, with cell morphology information. Using a comprehensive evaluation framework consisting of metrics in five complementary categories for cell segmentation performance, we demonstrate that BIDCell outperforms other state-of-the-art methods according to many metrics across a variety of tissue types and technology platforms. Our findings underscore the potential of BIDCell to significantly enhance single-cell spatial expression analyses, enabling great potential in biological discovery.
Keyphrases
- single cell
- rna seq
- deep learning
- gene expression
- high resolution
- high throughput
- induced apoptosis
- convolutional neural network
- cell cycle arrest
- poor prognosis
- electronic health record
- cell therapy
- dna methylation
- big data
- healthcare
- mass spectrometry
- artificial intelligence
- emergency department
- risk assessment
- mental health
- climate change
- cell proliferation
- binding protein
- human health
- cell death
- long non coding rna
- oxidative stress
- health information