Development of a Machine Learning Model to Predict Non-Durable Response to Anti-TNF Therapy in Crohn's Disease Using Transcriptome Imputed from Genotypes.
Soo Kyung ParkYea Bean KimSangsoo KimChil-Woo LeeChang Hwan ChoiSang Bum KangTae Oh KimKi Bae BangJae-Young ChunJae Myung ChaJong-Pil ImMin-Suk KimKwang Sung AhnSeon-Young KimDong Il ParkPublished in: Journal of personalized medicine (2022)
Almost half of patients show no primary or secondary response to monoclonal anti-tumor necrosis factor α (anti-TNF) antibody treatment for inflammatory bowel disease (IBD). Thus, the exact mechanisms of a non-durable response (NDR) remain inadequately defined. We used our genome-wide genotype data to impute expression values as features in training machine learning models to predict a NDR. Blood samples from various IBD cohorts were used for genotyping with the Korea Biobank Array. A total of 234 patients with Crohn's disease (CD) who received their first anti-TNF therapy were enrolled. The expression profiles of 6294 genes in whole-blood tissue imputed from the genotype data were combined with clinical parameters to train a logistic model to predict the NDR. The top two and three most significant features were genetic features ( DPY19L3 , GSTT1 , and NUCB1 ), not clinical features. The logistic regression of the NDR vs. DR status in our cohort by the imputed expression levels showed that the β coefficients were positive for DPY19L3 and GSTT1 , and negative for NUCB1 , concordant with the known eQTL information. Machine learning models using imputed gene expression features effectively predicted NDR to anti-TNF agents in patients with CD.
Keyphrases
- genome wide
- machine learning
- rheumatoid arthritis
- gene expression
- dna methylation
- big data
- poor prognosis
- artificial intelligence
- high throughput
- newly diagnosed
- copy number
- deep learning
- prognostic factors
- social media
- single cell
- smoking cessation
- molecular dynamics
- genetic diversity
- virtual reality
- bioinformatics analysis