Oligonucleotide Sequence Mapping of Large Therapeutic mRNAs via Parallel Ribonuclease Digestions and LC-MS/MS.
Tao JiangNingxi YuJaeah KimJohn-Ross MurgoMildred KissaiKanchana R RavichandranEdward J MiraccoVladimir PresnyakSerenus HuaPublished in: Analytical chemistry (2019)
Characterization of mRNA sequences is a critical aspect of mRNA drug development and regulatory filing. Herein, we developed a novel bottom-up oligonucleotide sequence mapping workflow combining multiple endonucleases that cleave mRNA at different frequencies. RNase T1, colicin E5, and mazF were applied in parallel to provide complementary sequence coverage for large mRNAs. Combined use of multiple endonucleases resulted in significantly improved sequence coverage: greater than 70% sequence coverage was achieved on mRNAs near 3000 nucleotides long. Oligonucleotide mapping simulations with large human RNA databases demonstrate that the proposed workflow can positively identify a single correct sequence from hundreds of similarly sized sequences. In addition, the workflow is sensitive and specific enough to detect minor sequence impurities such as single nucleotide polymorphisms (SNPs) with a sensitivity of less than 1%. LC-MS/MS-based oligonucleotide sequence mapping can serve as an orthogonal sequence characterization method to techniques such as Sanger sequencing or next-generation sequencing (NGS), providing high-throughput sequence identification and sensitive impurity detection.