Gene count normalization in single-cell imaging-based spatially resolved transcriptomics.
Lyla AttaKalen CliftonManjari AnantJean FanPublished in: bioRxiv : the preprint server for biology (2023)
Recent advances in imaging-based spatially resolved transcriptomics (im-SRT) technologies now enable high-throughput profiling of targeted genes and their locations in fixed tissues. Normalization of gene expression data is often needed to account for technical factors that may confound underlying biological signals. Here, we investigate the potential impact of different gene count normalization methods with different targeted gene panels in the analysis and interpretation of im-SRT data. Using different simulated gene panels that overrepresent genes expressed in specific tissue anatomical regions or cell types, we find that normalization methods that use scaling factors derived from gene counts differentially impact normalized gene expression magnitudes in a region- or cell type-specific manner. We show that these normalization-induced effects may reduce the reliability of downstream differential gene expression and fold change analysis, introducing false positive and false negative results when compared to results obtained from gene panels that are more representative of the gene expression of the tissue's component cell types. These effects are not observed without normalization or when scaling factors are not derived from gene counts, such as with cell volume normalization. Overall, we caution that the choice of normalization method and gene panel may impact the biological interpretation of the im-SRT data.
Keyphrases
- single cell
- gene expression
- genome wide
- genome wide identification
- copy number
- dna methylation
- high throughput
- stem cells
- electronic health record
- cell therapy
- mesenchymal stem cells
- machine learning
- cross sectional
- transcription factor
- drug delivery
- big data
- high glucose
- artificial intelligence
- photodynamic therapy
- drug induced
- diabetic rats
- decision making
- deep learning
- bioinformatics analysis