Direct identification of A-to-I editing sites with nanopore native RNA sequencing.
Tram Anh NguyenJia Wei Joel HengPornchai KaewsapsakEng Piew Louis KokDominik StanojevićHao LiuAngelysia CardillaAlbert PradityaZirong YiMingwan LinJong Ghut Ashley AwYin Ying HoEsther Kai Lay PehYuanming WangQixing ZhongJacki E Heraud-FarlowShifeng XueBruno ReversadeCarl R WalkleyYing Swan HoMile ŠikićYue WanMeng How TanPublished in: Nature methods (2022)
Inosine is a prevalent RNA modification in animals and is formed when an adenosine is deaminated by the ADAR family of enzymes. Traditionally, inosines are identified indirectly as variants from Illumina RNA-sequencing data because they are interpreted as guanosines by cellular machineries. However, this indirect method performs poorly in protein-coding regions where exons are typically short, in non-model organisms with sparsely annotated single-nucleotide polymorphisms, or in disease contexts where unknown DNA mutations are pervasive. Here, we show that Oxford Nanopore direct RNA sequencing can be used to identify inosine-containing sites in native transcriptomes with high accuracy. We trained convolutional neural network models to distinguish inosine from adenosine and guanosine, and to estimate the modification rate at each editing site. Furthermore, we demonstrated their utility on the transcriptomes of human, mouse and Xenopus. Our approach expands the toolkit for studying adenosine-to-inosine editing and can be further extended to investigate other RNA modifications.