Login / Signup

SubLocEP: a novel ensemble predictor of subcellular localization of eukaryotic mRNA based on machine learning.

Jing LiLichao ZhangShida HeFei GuoGuishen Wang
Published in: Briefings in bioinformatics (2021)
In this paper, SubLocEP is proposed as a two-layer integrated prediction model for accurate prediction of the location of sequence samples. Unlike the existing models based on limited features, SubLocEP comprehensively considers additional feature attributes and is combined with LightGBM to generated single feature classifiers. The initial integration model (single-layer model) is generated according to the categories of a feature. Subsequently, two single-layer integration models are weighted (sequence-based: physicochemical properties = 3:2) to produce the final two-layer model. The performance of SubLocEP on independent datasets is sufficient to indicate that SubLocEP is an accurate and stable prediction model with strong generalization ability. Additionally, an online tool has been developed that contains experimental data and can maximize the user convenience for estimation of subcellular localization of eukaryotic mRNA.
Keyphrases
  • machine learning
  • deep learning
  • big data
  • artificial intelligence
  • magnetic resonance
  • magnetic resonance imaging
  • mass spectrometry
  • electronic health record