Login / Signup

Accelerating bioactive peptide discovery via mutual information-based meta-learning.

Wenjia HeYi JiangJunru JinZhongshen LiJiaojiao ZhaoBalachandran ManavalanRan SuXin GaoLe-Yi Wei
Published in: Briefings in bioinformatics (2021)
Recently, machine learning methods have been developed to identify various peptide bio-activities. However, due to the lack of experimentally validated peptides, machine learning methods cannot provide a sufficiently trained model, easily resulting in poor generalizability. Furthermore, there is no generic computational framework to predict the bioactivities of different peptides. Thus, a natural question is whether we can use limited samples to build an effective predictive model for different kinds of peptides. To address this question, we propose Mutual Information Maximization Meta-Learning (MIMML), a novel meta-learning-based predictive model for bioactive peptide discovery. Using few samples from various functional peptides, MIMML can sufficiently learn the discriminative information amongst various functions and characterize functional differences. Experimental results show excellent performance of MIMML though using far fewer training samples as compared to the state-of-the-art methods. We also decipher the latent relationships among different kinds of functions to understand what meta-model learned to improve a specific task. In summary, this study is a pioneering work in the field of functional peptide mining and provides the first-of-its-kind solution for few-sample learning problems in biological sequence analysis, accelerating the new functional peptide discovery. The source codes and datasets are available on https://github.com/TearsWaiting/MIMML.
Keyphrases
  • machine learning
  • small molecule
  • high throughput
  • amino acid
  • mental health
  • artificial intelligence
  • big data
  • social media
  • virtual reality
  • data analysis