Performance improvement for a 2D convolutional neural network by using SSC encoding on protein-protein interaction tasks.
Yang WangZhanchao LiYanfei ZhangYingjun MaQixing HuangXingyu ChenZong DaiXiaoyong ZouPublished in: BMC bioinformatics (2021)
The proposed protein sequence encoding method is efficient at improving the capability of the CNN model on protein sequence-related tasks and may also be effective at enhancing the capability of other machine learning or deep learning methods. Prediction accuracy and molecular docking validation showed considerable improvement compared to the existing hot encoding method, indicating that the SSC encoding method may be useful for analyzing protein sequence-related tasks. The source code of the proposed methods is freely available for academic research at https://github.com/wangy496/SSC-format/ .