Artificial Intelligence Analysis of Celiac Disease Using an Autoimmune Discovery Transcriptomic Panel Highlighted Pathogenic Genes including BTLA.
Joaquim CarrerasPublished in: Healthcare (Basel, Switzerland) (2022)
Celiac disease is a common immune-related inflammatory disease of the small intestine caused by gluten in genetically predisposed individuals. This research is a proof-of-concept exercise focused on using Artificial Intelligence (AI) and an autoimmune discovery gene panel to predict and model celiac disease. Conventional bioinformatics, gene set enrichment analysis (GSEA), and several machine learning and neural network techniques were used on a publicly available dataset (GSE164883). Machine learning and deep learning included C5, logistic regression, Bayesian network, discriminant analysis, KNN algorithm, LSVM, random trees, SVM, Tree-AS, XGBoost linear, XGBoost tree, CHAID, Quest, C&R tree, random forest, and neural network (multilayer perceptron). As a result, the gene panel predicted celiac disease with high accuracy (95-100%). Several pathogenic genes were identified, some of the immune checkpoint and immuno-oncology pathways. They included CASP3 , CD86 , CTLA4 , FASLG , GZMB , IFNG , IL15RA , ITGAX , LAG3 , MMP3 , MUC1 , MYD88 , PRDM1 , RGS1 , etc. Among them, B and T lymphocyte associated (BTLA, CD272) was highlighted and validated at the protein level by immunohistochemistry in an independent series of cases. Celiac disease was characterized by high BTLA, expressed by inflammatory cells of the lamina propria. In conclusion, artificial intelligence predicted celiac disease using an autoimmune discovery gene panel.
Keyphrases
- celiac disease
- artificial intelligence
- machine learning
- neural network
- deep learning
- big data
- genome wide
- genome wide identification
- copy number
- small molecule
- multiple sclerosis
- dna methylation
- genome wide analysis
- convolutional neural network
- high throughput
- oxidative stress
- induced apoptosis
- transcription factor
- drug induced
- immune response
- gene expression
- toll like receptor
- rna seq
- climate change
- cell death
- signaling pathway
- protein protein
- endoplasmic reticulum stress
- systemic sclerosis
- network analysis
- idiopathic pulmonary fibrosis