Login / Signup

DNAPred_Prot: Identification of DNA-Binding Proteins Using Composition- and Position-Based Features.

Omar BarukabYaser Daanial KhanSher Afzal KhanKuo-Chen Chou
Published in: Applied bionics and biomechanics (2022)
In the domain of genome annotation, the identification of DNA-binding protein is one of the crucial challenges. DNA is considered a blueprint for the cell. It contained all necessary information for building and maintaining the trait of an organism. It is DNA, which makes a living thing, a living thing. Protein interaction with DNA performs an essential role in regulating DNA functions such as DNA repair, transcription, and regulation. Identification of these proteins is a crucial task for understanding the regulation of genes. Several methods have been developed to identify the binding sites of DNA and protein depending upon the structures and sequences, but they were costly and time-consuming. Therefore, we propose a methodology named "DNAPred_Prot", which uses various position and frequency-dependent features from protein sequences for efficient and effective prediction of DNA-binding proteins. Using testing techniques like 10-fold cross-validation and jackknife testing an accuracy of 94.95% and 95.11% was yielded, respectively. The results of SVM and ANN were also compared with those of a random forest classifier. The robustness of the proposed model was evaluated by using the independent dataset PDB186, and an accuracy of 91.47% was achieved by it. From these results, it can be predicted that the suggested methodology performs better than other extant methods for the identification of DNA-binding proteins.
Keyphrases
  • circulating tumor
  • cell free
  • single molecule
  • dna repair
  • nucleic acid
  • healthcare
  • stem cells
  • genome wide
  • climate change
  • social media
  • dna methylation
  • oxidative stress
  • single cell
  • neural network