Structure-aware deep learning model for peptide toxicity prediction.
Hossein EbrahimikondoriDarcy SutherlandAnat YanaiAmelia RichterAli SalehiChenkai LiLauren CoombeMonica KotkoffRené L WarrenInanc BirolPublished in: Protein science : a publication of the Protein Society (2024)
Antimicrobial resistance is a critical public health concern, necessitating the exploration of alternative treatments. While antimicrobial peptides (AMPs) show promise, assessing their toxicity using traditional wet lab methods is both time-consuming and costly. We introduce tAMPer, a novel multi-modal deep learning model designed to predict peptide toxicity by integrating the underlying amino acid sequence composition and the three-dimensional structure of peptides. tAMPer adopts a graph-based representation for peptides, encoding ColabFold-predicted structures, where nodes represent amino acids and edges represent spatial interactions. Structural features are extracted using graph neural networks, and recurrent neural networks capture sequential dependencies. tAMPer's performance was assessed on a publicly available protein toxicity benchmark and an AMP hemolysis data we generated. On the latter, tAMPer achieves an F1-score of 68.7%, outperforming the second-best method by 23.4%. On the protein benchmark, tAMPer exhibited an improvement of over 3.0% in the F1-score compared to current state-of-the-art methods. We anticipate tAMPer to accelerate AMP discovery and development by reducing the reliance on laborious toxicity screening experiments.
Keyphrases
- amino acid
- neural network
- deep learning
- oxidative stress
- public health
- antimicrobial resistance
- convolutional neural network
- small molecule
- machine learning
- artificial intelligence
- protein kinase
- protein protein
- squamous cell carcinoma
- high resolution
- electronic health record
- mass spectrometry
- binding protein
- single cell
- rectal cancer