MATTE: a pipeline of transcriptome module alignment for anti-noise phenotype-gene-related analysis.
Guoxin CaiWenyi ZhaoZhan ZhouXun GuPublished in: Briefings in bioinformatics (2023)
A phenotype may be associated with multiple genes that interact with each other in the form of a gene module or network. How to identify these relationships is one important aspect of comparative transcriptomics. However, it is still a challenge to align gene modules associated with different phenotypes. Although several studies attempted to address this issue in different aspects, a general framework is still needed. In this study, we introduce Module Alignment of TranscripTomE (MATTE), a novel approach to analyze transcriptomics data and identify differences in a modular manner. MATTE assumes that gene interactions modulate a phenotype and models phenotype differences as gene location changes. Specifically, we first represented genes by a relative differential expression to reduce the influence of noise in omics data. Meanwhile, clustering and aligning are combined to depict gene differences in a modular way robustly. The results show that MATTE outperformed state-of-the-art methods in identifying differentially expressed genes under noise in gene expression. In particular, MATTE could also deal with single-cell ribonucleic acid-seq data to extract the best cell-type marker genes compared to other methods. Additionally, we demonstrate how MATTE supports the discovery of biologically significant genes and modules, and facilitates downstream analyses to gain insight into breast cancer. The source code of MATTE and case analysis are available at https://github.com/zjupgx/MATTE.