DELVE: feature selection for preserving biological trajectories in single-cell data.
Jolene S RanekWayne StallaertJ Justin MilnerMargaret A RedickSamuel C WolffAdriana S BeltranNatalie StanleyJeremy E PurvisPublished in: Nature communications (2024)
Single-cell technologies can measure the expression of thousands of molecular features in individual cells undergoing dynamic biological processes. While examining cells along a computationally-ordered pseudotime trajectory can reveal how changes in gene or protein expression impact cell fate, identifying such dynamic features is challenging due to the inherent noise in single-cell data. Here, we present DELVE, an unsupervised feature selection method for identifying a representative subset of molecular features which robustly recapitulate cellular trajectories. In contrast to previous work, DELVE uses a bottom-up approach to mitigate the effects of confounding sources of variation, and instead models cell states from dynamic gene or protein modules based on core regulatory complexes. Using simulations, single-cell RNA sequencing, and iterative immunofluorescence imaging data in the context of cell cycle and cellular differentiation, we demonstrate how DELVE selects features that better define cell-types and cell-type transitions. DELVE is available as an open-source python package: https://github.com/jranek/delve .
Keyphrases
- single cell
- rna seq
- cell cycle
- induced apoptosis
- high throughput
- machine learning
- electronic health record
- cell cycle arrest
- cell fate
- big data
- genome wide
- copy number
- poor prognosis
- signaling pathway
- transcription factor
- endoplasmic reticulum stress
- binding protein
- single molecule
- data analysis
- air pollution
- oxidative stress
- small molecule
- magnetic resonance imaging
- gene expression
- drinking water
- cross sectional
- long non coding rna
- artificial intelligence
- fluorescence imaging
- mesenchymal stem cells
- contrast enhanced
- neural network