Login / Signup

Performance improvement for a 2D convolutional neural network by using SSC encoding on protein-protein interaction tasks.

Yang WangZhanchao LiYanfei ZhangYingjun MaQixing HuangXingyu ChenZong DaiXiaoyong Zou
Published in: BMC bioinformatics (2021)
The proposed protein sequence encoding method is efficient at improving the capability of the CNN model on protein sequence-related tasks and may also be effective at enhancing the capability of other machine learning or deep learning methods. Prediction accuracy and molecular docking validation showed considerable improvement compared to the existing hot encoding method, indicating that the SSC encoding method may be useful for analyzing protein sequence-related tasks. The source code of the proposed methods is freely available for academic research at https://github.com/wangy496/SSC-format/ .
Keyphrases
  • protein protein
  • convolutional neural network
  • deep learning
  • molecular docking
  • small molecule
  • machine learning
  • amino acid
  • working memory
  • artificial intelligence
  • molecular dynamics simulations