Integrative analysis of DNA methylation and gene expression identified cervical cancer-specific diagnostic biomarkers.
Wanxue XuMengyao XuLonglong WangWei ZhouRong XiangYi ShiYunshan ZhangYongjun PiaoPublished in: Signal transduction and targeted therapy (2019)
Cervical cancer is the leading cause of death among women with cancer worldwide. Here, we performed an integrative analysis of Illumina HumanMethylation450K and RNA-seq data from TCGA to identify cervical cancer-specific DNA methylation markers. We first identified differentially methylated and expressed genes and examined the correlation between DNA methylation and gene expression. The DNA methylation profiles of 12 types of cancers, including cervical cancer, were used to generate a candidate set, and machine-learning techniques were adopted to define the final cervical cancer-specific markers in the candidate set. Then, we assessed the protein levels of marker genes by immunohistochemistry by using tissue arrays containing 93 human cervical squamous cell carcinoma samples and cancer-adjacent normal tissues. Promoter methylation was negatively correlated with the local regulation of gene expression. In the distant regulation of gene expression, the methylation of hypermethylated genes was more likely to be negatively correlated with gene expression, while the methylation of hypomethylated genes was more likely to be positively correlated with gene expression. Moreover, we identified four cervical cancer-specific methylation markers, cg07211381 (RAB3C), cg12205729 (GABRA2), cg20708961 (ZNF257), and cg26490054 (SLC5A8), with 96.2% sensitivity and 95.2% specificity by using the tenfold cross-validation of TCGA data. The four markers could distinguish tumors from normal tissues with a 94.2, 100, 100, and 100% AUC in four independent validation sets from the GEO database. Overall, our study demonstrates the potential use of methylation markers in cervical cancer diagnosis and may boost the development of new epigenetic therapies.
Keyphrases
- dna methylation
- gene expression
- genome wide
- rna seq
- squamous cell carcinoma
- copy number
- machine learning
- papillary thyroid
- big data
- electronic health record
- lymph node
- artificial intelligence
- childhood cancer
- genome wide analysis
- squamous cell
- climate change
- amino acid
- genome wide identification
- risk assessment
- locally advanced
- rectal cancer