Login / Signup

tRForest: a novel random forest-based algorithm for tRNA-derived fragment target prediction.

Rohan ParikhBriana WilsonLaine MarrahZhangli SuShekhar SahaPankaj KumarFenix HuangAnindya Dutta
Published in: NAR genomics and bioinformatics (2022)
tRNA fragments (tRFs) are small RNAs comparable to the size and function of miRNAs. tRFs are generally Dicer independent, are found associated with Ago, and can repress expression of genes post-transcriptionally. Given that this expands the repertoire of small RNAs capable of post-transcriptional gene expression, it is important to predict tRF targets with confidence. Some attempts have been made to predict tRF targets, but are limited in the scope of tRF classes used in prediction or limited in feature selection. We hypothesized that established miRNA target prediction features applied to tRFs through a random forest machine learning algorithm will immensely improve tRF target prediction. Using this approach, we show significant improvements in tRF target prediction for all classes of tRFs and validate our predictions in two independent cell lines. Finally, Gene Ontology analysis suggests that among the tRFs conserved between mice and humans, the predicted targets are enriched significantly in neuronal function, and we show this specifically for tRF-3009a. These improvements to tRF target prediction further our understanding of tRF function broadly across species and provide avenues for testing novel roles for tRFs in biology. We have created a publicly available website for the targets of tRFs predicted by tRForest.
Keyphrases
  • machine learning
  • gene expression
  • deep learning
  • genome wide
  • transcription factor
  • dna methylation
  • type diabetes
  • blood brain barrier
  • subarachnoid hemorrhage
  • copy number
  • data analysis
  • high throughput sequencing