Deep Learning for Human Disease Detection, Subtype Classification, and Treatment Response Prediction Using Epigenomic Data.
Thi Mai NguyenNackhyoung KimDa Hae KimHoang Long LeMd Jalil PiranSoo-Jong UmJin Hee KimPublished in: Biomedicines (2021)
Deep learning (DL) is a distinct class of machine learning that has achieved first-class performance in many fields of study. For epigenomics, the application of DL to assist physicians and scientists in human disease-relevant prediction tasks has been relatively unexplored until very recently. In this article, we critically review published studies that employed DL models to predict disease detection, subtype classification, and treatment responses, using epigenomic data. A comprehensive search on PubMed, Scopus, Web of Science, Google Scholar, and arXiv.org was performed following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines. Among 1140 initially identified publications, we included 22 articles in our review. DNA methylation and RNA-sequencing data are most frequently used to train the predictive models. The reviewed models achieved a high accuracy ranged from 88.3% to 100.0% for disease detection tasks, from 69.5% to 97.8% for subtype classification tasks, and from 80.0% to 93.0% for treatment response prediction tasks. We generated a workflow to develop a predictive model that encompasses all steps from first defining human disease-related tasks to finally evaluating model performance. DL holds promise for transforming epigenomic big data into valuable knowledge that will enhance the development of translational epigenomics.
Keyphrases
- big data
- machine learning
- deep learning
- artificial intelligence
- working memory
- endothelial cells
- dna methylation
- systematic review
- meta analyses
- electronic health record
- healthcare
- primary care
- induced pluripotent stem cells
- public health
- randomized controlled trial
- gene expression
- convolutional neural network
- single cell
- high speed