Login / Signup

A First Computational Frame for Recognizing Heparin-Binding Protein.

Wen ZhuShi-Shi YuanJian LiCheng-Bing HuangHao LinBo Liao
Published in: Diagnostics (Basel, Switzerland) (2023)
Heparin-binding protein (HBP) is a cationic antibacterial protein derived from multinuclear neutrophils and an important biomarker of infectious diseases. The correct identification of HBP is of great significance to the study of infectious diseases. This work provides the first HBP recognition framework based on machine learning to accurately identify HBP. By using four sequence descriptors, HBP and non-HBP samples were represented by discrete numbers. By inputting these features into a support vector machine (SVM) and random forest (RF) algorithm and comparing the prediction performances of these methods on training data and independent test data, it is found that the SVM-based classifier has the greatest potential to identify HBP. The model could produce an auROC of 0.981 ± 0.028 on training data using 10-fold cross-validation and an overall accuracy of 95.0% on independent test data. As the first model for HBP recognition, it will provide some help for infectious diseases and stimulate further research in related fields.
Keyphrases
  • infectious diseases
  • binding protein
  • machine learning
  • big data
  • electronic health record
  • deep learning
  • artificial intelligence
  • growth factor
  • virtual reality
  • data analysis
  • bioinformatics analysis