SiameseCPP: a sequence-based Siamese network to predict cell-penetrating peptides by contrastive learning.
Xin ZhangLesong WeiXiucai YeKai ZhangSaisai TengZhongshen LiJunru JinMin Jae KimTetsuya SakuraiLizhen CuiBalachandran ManavalanLe-Yi WeiPublished in: Briefings in bioinformatics (2023)
In this study, we present SiameseCPP, a novel deep learning framework for automated CPPs prediction. SiameseCPP learns discriminative representations of CPPs based on a well-pretrained model and a Siamese neural network consisting of a transformer and gated recurrent units. Contrastive learning is used for the first time to build a CPP predictive model. Comprehensive experiments demonstrate that our proposed SiameseCPP is superior to existing baseline models for predicting CPPs. Moreover, SiameseCPP also achieves good performance on other functional peptide datasets, exhibiting satisfactory generalization ability.