Login / Signup

Library size confounds biology in spatial transcriptomics data.

Dharmesh D BhuvaChin Wee TanAgus SalimClaire MarceauxMarie A PickeringJinjin ChenMalvika KharbandaXinyi JinNing LiuKristen FeherGivanna PutriWayne D TilleyTheresa E HickeyMarie-Liesse Asselin-LabatBelinda PhipsonMelissa J Davis
Published in: Genome biology (2024)
Spatial molecular data has transformed the study of disease microenvironments, though, larger datasets pose an analytics challenge prompting the direct adoption of single-cell RNA-sequencing tools including normalization methods. Here, we demonstrate that library size is associated with tissue structure and that normalizing these effects out using commonly applied scRNA-seq normalization methods will negatively affect spatial domain identification. Spatial data should not be specifically corrected for library size prior to analysis, and algorithms designed for scRNA-seq data should be adopted with caution.
Keyphrases
  • single cell
  • rna seq
  • electronic health record
  • big data
  • machine learning
  • high throughput
  • genome wide
  • gene expression
  • single molecule