MARPPI: boosting prediction of protein-protein interactions with multi-scale architecture residual network.
Xue LiPeifu HanWenqi ChenChangnan GaoShuang WangTao SongMuyuan NiuAlfonso Rodriguez-PatónPublished in: Briefings in bioinformatics (2023)
Protein-protein interactions (PPIs) are a major component of the cellular biochemical reaction network. Rich sequence information and machine learning techniques reduce the dependence of exploring PPIs on wet experiments, which are costly and time-consuming. This paper proposes a PPI prediction model, multi-scale architecture residual network for PPIs (MARPPI), based on dual-channel and multi-feature. Multi-feature leverages Res2vec to obtain the association information between residues, and utilizes pseudo amino acid composition, autocorrelation descriptors and multivariate mutual information to achieve the amino acid composition and order information, physicochemical properties and information entropy, respectively. Dual channel utilizes multi-scale architecture improved ResNet network which extracts protein sequence features to reduce protein feature loss. Compared with other advanced methods, MARPPI achieves 96.03%, 99.01% and 91.80% accuracy in the intraspecific datasets of Saccharomyces cerevisiae, Human and Helicobacter pylori, respectively. The accuracy on the two interspecific datasets of Human-Bacillus anthracis and Human-Yersinia pestis is 97.29%, and 95.30%, respectively. In addition, results on specific datasets of disease (neurodegenerative and metabolic disorders) demonstrate the ability to detect hidden interactions. To better illustrate the performance of MARPPI, evaluations on independent datasets and PPIs network suggest that MARPPI can be used to predict cross-species interactions. The above shows that MARPPI can be regarded as a concise, efficient and accurate tool for PPI datasets.
Keyphrases
- protein protein
- amino acid
- machine learning
- helicobacter pylori
- endothelial cells
- small molecule
- rna seq
- health information
- saccharomyces cerevisiae
- induced pluripotent stem cells
- deep learning
- pluripotent stem cells
- artificial intelligence
- healthcare
- high resolution
- helicobacter pylori infection
- social media
- big data
- binding protein