Gene Identification in Inflammatory Bowel Disease via a Machine Learning Approach.
Gerardo Alfonso PerezRaquel CastilloPublished in: Medicina (Kaunas, Lithuania) (2023)
Inflammatory bowel disease (IBD) is an illness with increasing prevalence, particularly in emerging countries, which can have a substantial impact on the quality of life of the patient. The illness is rather heterogeneous with different evolution among patients. A machine learning approach is followed in this paper to identify potential genes that are related to IBD. This is done by following a Monte Carlo simulation approach. In total, 23 different machine learning techniques were tested (in addition to a base level obtained using artificial neural networks). The best model identified 74 genes selected by the algorithm as being potentially involved in IBD. IBD seems to be a polygenic illness, in which environmental factors might play an important role. Following a machine learning approach, it was possible to obtain a classification accuracy of 84.2% differentiating between patients with IBD and control cases in a large cohort of 2490 total cases. The sensitivity and specificity of the model were 82.6% and 84.4%, respectively. It was also possible to distinguish between the two main types of IBD: (1) Crohn's disease and (2) ulcerative colitis.
Keyphrases
- machine learning
- ulcerative colitis
- artificial intelligence
- big data
- neural network
- deep learning
- genome wide
- monte carlo
- genome wide identification
- bioinformatics analysis
- magnetic resonance imaging
- risk factors
- dna methylation
- gene expression
- risk assessment
- magnetic resonance
- computed tomography
- copy number
- transcription factor
- human health