Computational Characterization of Undifferentially Expressed Genes with Altered Transcription Regulation in Lung Cancer.
Ruihao XinQian ChengXiaohang ChiXin FengHang ZhangYueying WangMeiyu DuanTunyang XieXiaonan SongQiong YuYusi FanLan HuangFengfeng ZhouPublished in: Genes (2023)
A transcriptome profiles the expression levels of genes in cells and has accumulated a huge amount of public data. Most of the existing biomarker-related studies investigated the differential expression of individual transcriptomic features under the assumption of inter-feature independence. Many transcriptomic features without differential expression were ignored from the biomarker lists. This study proposed a computational analysis protocol (mqTrans) to analyze transcriptomes from the view of high-dimensional inter-feature correlations. The mqTrans protocol trained a regression model to predict the expression of an mRNA feature from those of the transcription factors (TFs). The difference between the predicted and real expression of an mRNA feature in a query sample was defined as the mqTrans feature. The new mqTrans view facilitated the detection of thirteen transcriptomic features with differentially expressed mqTrans features, but without differential expression in the original transcriptomic values in three independent datasets of lung cancer. These features were called dark biomarkers because they would have been ignored in a conventional differential analysis. The detailed discussion of one dark biomarker, GBP5, and additional validation experiments suggested that the overlapping long non-coding RNAs might have contributed to this interesting phenomenon. In summary, this study aimed to find undifferentially expressed genes with significantly changed mqTrans values in lung cancer. These genes were usually ignored in most biomarker detection studies of undifferential expression. However, their differentially expressed mqTrans values in three independent datasets suggested their strong associations with lung cancer.
Keyphrases
- poor prognosis
- single cell
- rna seq
- long non coding rna
- genome wide
- machine learning
- deep learning
- binding protein
- transcription factor
- genome wide identification
- randomized controlled trial
- bioinformatics analysis
- emergency department
- healthcare
- dna methylation
- induced apoptosis
- mental health
- cell death
- gene expression
- loop mediated isothermal amplification
- electronic health record
- label free
- artificial intelligence
- oxidative stress
- body composition
- cell proliferation
- pi k akt
- case control
- signaling pathway
- data analysis
- cell cycle arrest