Direct identification of A-to-I editing sites with nanopore native RNA sequencing.

Tram Anh Nguyen Jia Wei Joel HengPornchai KaewsapsakEng Piew Louis KokDominik StanojevićHao LiuAngelysia CardillaAlbert PradityaZirong YiMingwan LinJong Ghut Ashley AwYin Ying Ho Esther Kai Lay PehYuanming WangQixing ZhongJacki E Heraud-Farlow Shifeng Xue Bruno Reversade Carl R WalkleyYing Swan HoMile Šikić Yue Wan Meng How Tan

Published in: Nature methods (2022)

Inosine is a prevalent RNA modification in animals and is formed when an adenosine is deaminated by the ADAR family of enzymes. Traditionally, inosines are identified indirectly as variants from Illumina RNA-sequencing data because they are interpreted as guanosines by cellular machineries. However, this indirect method performs poorly in protein-coding regions where exons are typically short, in non-model organisms with sparsely annotated single-nucleotide polymorphisms, or in disease contexts where unknown DNA mutations are pervasive. Here, we show that Oxford Nanopore direct RNA sequencing can be used to identify inosine-containing sites in native transcriptomes with high accuracy. We trained convolutional neural network models to distinguish inosine from adenosine and guanosine, and to estimate the modification rate at each editing site. Furthermore, we demonstrated their utility on the transcriptomes of human, mouse and Xenopus. Our approach expands the toolkit for studying adenosine-to-inosine editing and can be further extended to investigate other RNA modifications.

Keyphrases