DELVE: feature selection for preserving biological trajectories in single-cell data.
Jolene S RanekWayne StallaertJ Justin MilnerMargaret A RedickSamuel C WolffAdriana S BeltranNatalie StanleyJeremy E PurvisPublished in: Nature communications (2024)
Single-cell technologies can measure the expression of thousands of molecular features in individual cells undergoing dynamic biological processes. While examining cells along a computationally-ordered pseudotime trajectory can reveal how changes in gene or protein expression impact cell fate, identifying such dynamic features is challenging due to the inherent noise in single-cell data. Here, we present DELVE, an unsupervised feature selection method for identifying a representative subset of molecular features which robustly recapitulate cellular trajectories. In contrast to previous work, DELVE uses a bottom-up approach to mitigate the effects of confounding sources of variation, and instead models cell states from dynamic gene or protein modules based on core regulatory complexes. Using simulations, single-cell RNA sequencing, and iterative immunofluorescence imaging data in the context of cell cycle and cellular differentiation, we demonstrate how DELVE selects features that better define cell-types and cell-type transitions. DELVE is available as an open-source python package: https://github.com/jranek/delve .
Keyphrases
- single cell
- rna seq
- cell cycle
- induced apoptosis
- high throughput
- machine learning
- electronic health record
- cell cycle arrest
- big data
- cell fate
- cell proliferation
- depressive symptoms
- copy number
- genome wide
- high resolution
- poor prognosis
- endoplasmic reticulum stress
- deep learning
- oxidative stress
- transcription factor
- single molecule
- signaling pathway
- air pollution
- binding protein
- cross sectional
- mass spectrometry
- protein protein
- genome wide identification
- bone marrow
- fluorescence imaging
- network analysis