A novel batch-effect correction method for scRNA-seq data based on Adversarial Information Factorization.
Lily MonnierPaul-Henry CournedePublished in: PLoS computational biology (2024)
Single-cell RNA sequencing (scRNA-seq) technology produces an unprecedented resolution at the level of a unique cell, raising great hopes in medicine. Nevertheless, scRNA-seq data suffer from high variations due to the experimental conditions, called batch effects, preventing any aggregated downstream analysis. Adversarial Information Factorization provides a robust batch-effect correction method that does not rely on prior knowledge of the cell types nor a specific normalization strategy while being adapted to any downstream analysis task. It compares to and even outperforms state-of-the-art methods in several scenarios: low signal-to-noise ratio, batch-specific cell types with few cells, and a multi-batches dataset with imbalanced batches and batch-specific cell types. Moreover, it best preserves the relative gene expression between cell types, yielding superior differential expression analysis results. Finally, in a more complex setting of a Leukemia cohort, our method preserved most of the underlying biological information for each patient while aligning the batches, improving the clustering metrics in the aggregated dataset.
Keyphrases
- single cell
- rna seq
- gene expression
- high throughput
- cell therapy
- acute myeloid leukemia
- dna methylation
- climate change
- electronic health record
- healthcare
- big data
- mesenchymal stem cells
- air pollution
- induced apoptosis
- cell proliferation
- signaling pathway
- health information
- anaerobic digestion
- machine learning
- deep learning
- social media
- case report