Convergent learning-based model for leukemia classification from gene expression.
Pradeep Kumar MallickSaumendra Kumar MohapatraGyoo-Soo ChaeMihir Narayan MohantyPublished in: Personal and ubiquitous computing (2020)
Microarray data analysis is a major challenging field of research in recent days. Machine learning-based automated gene data classification is an essential aspect for diagnosis of gene related any malfunctions and diseases. As the size of the data is very large, it is essential to design a suitable classifier that can process huge amount of data. Deep learning is one of the advanced machine learning techniques to mitigate these types of problems. Due the presence of more number of hidden layers, it can easily handle the big amount of data. We have presented a method of classification to understand the convergence of training deep neural network (DNN). The assumptions are taken as the inputs do not degenerate and the network is over-parameterized. Also the number of hidden neurons is sufficiently large. Authors in this piece of work have used DNN for classifying the gene expressions data. The dataset used in the work contains the bone marrow expressions of 72 leukemia patients. A five-layer DNN classifier is designed for classifying acute lymphocyte (ALL) and acute myelocytic (AML) samples. The network is trained with 80% data and rest 20% data is considered for validation purpose. Proposed DNN classifier is providing a satisfactory result as compared to other classifiers. Two types of leukemia are classified with 98.2% accuracy, 96.59% sensitivity, and 97.9% specificity. The different types of computer-aided analyses of genes can be helpful to genetic and virology researchers as well in future generation.
Keyphrases
- machine learning
- big data
- deep learning
- data analysis
- bone marrow
- electronic health record
- gene expression
- genome wide
- artificial intelligence
- copy number
- end stage renal disease
- liver failure
- chronic kidney disease
- dna methylation
- newly diagnosed
- mesenchymal stem cells
- ejection fraction
- spinal cord
- hepatitis b virus
- intensive care unit
- genome wide identification
- mental health
- peritoneal dialysis
- high throughput
- prognostic factors
- spinal cord injury
- transcription factor
- extracorporeal membrane oxygenation
- allogeneic hematopoietic stem cell transplantation
- acute lymphoblastic leukemia
- resistance training