Anti-correlated feature selection prevents false discovery of subpopulations in scRNAseq.
Scott R TylerDaniel Lozano-OjalvoErnesto GuccioneEric E SchadtPublished in: Nature communications (2024)
While sub-clustering cell-populations has become popular in single cell-omics, negative controls for this process are lacking. Popular feature-selection/clustering algorithms fail the null-dataset problem, allowing erroneous subdivisions of homogenous clusters until nearly each cell is called its own cluster. Using real and synthetic datasets, we find that anti-correlated gene selection reduces or eliminates erroneous subdivisions, increases marker-gene selection efficacy, and efficiently scales to millions of cells.