Login / Signup

PortPred: Exploiting deep learning embeddings of amino acid sequences for the identification of transporter proteins and their substrates.

Marco AnteghiniVitor Ap Martins Dos SantosEdoardo Saccenti
Published in: Journal of cellular biochemistry (2023)
The physiology of every living cell is regulated at some level by transporter proteins which constitute a relevant portion of membrane-bound proteins and are involved in the movement of ions, small and macromolecules across bio-membranes. The importance of transporter proteins is unquestionable. The prediction and study of previously unknown transporters can lead to the discovery of new biological pathways, drugs and treatments. Here we present PortPred, a tool to accurately identify transporter proteins and their substrate starting from the protein amino acid sequence. PortPred successfully combines pre-trained deep learning-based protein embeddings and machine learning classification approaches and outperforms other state-of-the-art methods. In addition, we present a comparison of the most promising protein sequence embeddings (Unirep, SeqVec, ProteinBERT, ESM-1b) and their performances for this specific task.
Keyphrases
  • amino acid
  • deep learning
  • machine learning
  • protein protein
  • small molecule
  • single cell
  • stem cells
  • convolutional neural network
  • transcription factor
  • cell therapy
  • resistance training
  • high throughput
  • high intensity