Mining Bovine Milk Proteins for DPP-4 Inhibitory Peptides Using Machine Learning and Virtual Proteolysis.
Yiyun ZhangYiqing ZhuXin BaoZijian DaiQun ShenLiyang WangYong XuePublished in: Research (Washington, D.C.) (2024)
Dipeptidyl peptidase-IV (DPP-4) enzyme inhibitors are a promising category of diabetes medications. Bioactive peptides, particularly those derived from bovine milk proteins, play crucial roles in inhibiting the DPP-4 enzyme. This study describes a comprehensive strategy for DPP-4 inhibitory peptide discovery and validation that combines machine learning and virtual proteolysis techniques. Five machine learning models, including GBDT, XGBoost, LightGBM, CatBoost, and RF, were trained. Notably, LightGBM demonstrated superior performance with an AUC value of 0.92 ± 0.01. Subsequently, LightGBM was employed to forecast the DPP-4 inhibitory potential of peptides generated through virtual proteolysis of milk proteins. Through a series of in silico screening process and in vitro experiments, GPVRGPF and HPHPHL were found to exhibit good DPP-4 inhibitory activity. Molecular docking and molecular dynamics simulations further confirmed the inhibitory mechanisms of these peptides. Through retracing the virtual proteolysis steps, it was found that GPVRGPF can be obtained from β-casein through enzymatic hydrolysis by chymotrypsin, while HPHPHL can be obtained from κ-casein through enzymatic hydrolysis by stem bromelain or papain. In summary, the integration of machine learning and virtual proteolysis techniques can aid in the preliminary determination of key hydrolysis parameters and facilitate the efficient screening of bioactive peptides.
Keyphrases
- molecular docking
- machine learning
- molecular dynamics simulations
- amino acid
- artificial intelligence
- type diabetes
- cardiovascular disease
- big data
- anaerobic digestion
- signaling pathway
- small molecule
- deep learning
- high throughput
- nitric oxide
- metabolic syndrome
- high resolution
- molecularly imprinted
- skeletal muscle
- mass spectrometry
- single cell
- solid phase extraction