A Gene Correlation Measurement Method for Spatial Transcriptome Data Based on Partitioning and Distribution.
Xiaoshu ZhuLiyuan PangXiaojun DingWei LanShuang MengXiaoqing PengPublished in: Journal of computational biology : a journal of computational molecular cell biology (2023)
Spatial transcriptome (ST) technology provides both the spatial location and transcriptional profile of spots, as well as tissue images. ST data can be utilized to construct gene regulatory networks, which can help identify gene modules that facilitate the understanding of biological processes such as cell communication. Correlation measurement is the core basis for constructing a gene regulatory network. However, due to the high noise and sparsity in ST data, common correlation measurement methods such as the Pearson correlation coefficient (PCC) and Spearman correlation coefficient (SPCC) are not suitable. In this work, a new gene correlation measurement method called STgcor is proposed. STgcor defines vertexes as spots in a two-dimensional coordinate plane consisting of axes X and Y from the gene pair ( X and Y ). The joint probability density of Gaussian distribution of the gene pair ( X and Y ) is calculated to identify and eliminate outliers. To overcome sparsity, the degree, trend, and location of the distribution of vertexes are used to measure the correlation between gene pairs ( X , Y ). To validate the performance of the STgcor method, it is compared with the PCC and SPCC in a weighted coexpression network analysis method using two ST datasets of breast cancer and prostate cancer. The gene modules identified by these methods are then compared and analyzed. The results show that the STgcor method detects some special gene modules and cancer-related pathways that cannot be detected by the other two methods.