DeepPRMS: advanced deep learning model to predict protein arginine methylation sites.
Monika KhandelwalRanjeet Kumar RoutPublished in: Briefings in functional genomics (2024)
Protein methylation is a form of post-translational modifications of protein, which is crucial for various cellular processes, including transcription activity and DNA repair. Correctly predicting protein methylation sites is fundamental for research and drug discovery. Some experimental techniques, such as methyl-specific antibodies, chromatin immune precipitation and mass spectrometry, exist for predicting protein methylation sites, but these techniques are time-consuming and costly. The ability to predict methylation sites using in silico techniques may help researchers identify potential candidate sites for future examination and make it easier to carry out site-specific investigations and downstream characterizations. In this research, we proposed a novel deep learning-based predictor, named DeepPRMS, to identify protein methylation sites in primary sequences. The DeepPRMS utilizes the gated recurrent unit (GRU) and convolutional neural network (CNN) algorithms to extract the sequential and spatial information from the primary sequences. GRU is used to extract sequential information, while CNN is used for spatial information. We combined the latent representation of GRU and CNN models to have a better interaction among them. Based on the independent test data set, DeepPRMS obtained an accuracy of 85.32%, a specificity of 84.94%, Matthew's correlation coefficient of 0.71 and a sensitivity of 85.80%. The results indicate that DeepPRMS can predict protein methylation sites with high accuracy and outperform the state-of-the-art models. The DeepPRMS is expected to effectively guide future research experiments for identifying potential methylated protein sites. The web server is available at http://deepprms.nitsri.ac.in/.
Keyphrases
- deep learning
- convolutional neural network
- genome wide
- protein protein
- dna methylation
- mass spectrometry
- amino acid
- dna repair
- oxidative stress
- binding protein
- drug discovery
- machine learning
- magnetic resonance imaging
- artificial intelligence
- healthcare
- small molecule
- transcription factor
- high resolution
- computed tomography
- risk assessment
- current status
- social media
- molecular dynamics simulations
- electronic health record
- data analysis
- simultaneous determination
- diffusion weighted imaging
- capillary electrophoresis