miRPreM and tiRPreM: Improved methodologies for the prediction of miRNAs and tRNA-induced small non-coding RNAs for model and non-model organisms.
Hukam Chand RawalShakir AliTapan Kumar MondalPublished in: Briefings in bioinformatics (2021)
In recent years, microRNAs (miRNAs) and tRNA-derived RNA fragments (tRFs) have been reported extensively following different approaches of identification and analysis. Comprehensively analyzing the present approaches to overcome the existing variations, we developed a benchmarking methodology each for the identification of miRNAs and tRFs, termed as miRNA Prediction Methodology (miRPreM) and tRNA-induced small non-coding RNA Prediction Methodology (tiRPreM), respectively. We emphasized the use of respective genome of organism under study for mapping reads, sample data with at least two biological replicates, normalized read count support and novel miRNA prediction by two standard tools with multiple runs. The performance of these methodologies was evaluated by using Oryza coarctata, a wild rice species as a case study for model and non-model organisms. With organism-specific reference genome approach, 98 miRNAs and 60 tRFs were exclusively found. We observed high accuracy (13 out of 15) when tested these genome-specific miRNAs in support of analyzing the data with respective organism. Such a strong impact of miRPreM, we have predicted more than double number of miRNAs (186) as compared with the traditional approaches (79) and with tiRPreM, we have predicted all known classes of tRFs within the same small RNA data. Moreover, the methodologies presented here are in standard form in order to extend its applicability to different organisms rather than restricting to plants. Hence, miRPreM and tiRPreM can fulfill the need of a comprehensive methodology for miRNA prediction and tRF identification, respectively, for model and non-model organisms.