Algorithm for Optimized mRNA Design Improves Stability and Immunogenicity.
He ZhangLiang ZhangAng LinCongcong XuZiyu LiKaibo LiuBoxiang LiuXiaopin MaFanfan ZhaoHuiling JiangChunxiu ChenHaifa ShenHangwen LiDavid H MathewsYujian ZhangLiang HuangPublished in: Nature (2023)
Messenger RNA (mRNA) vaccines are being used to contain COVID-19 (1, 2, 3), but still suffer from the critical limitation of mRNA instability and degradation, which is a major obstacle in the storage, distribution, and efficacy of the vaccine products (4). Previous work showed that increasing secondary structure lengthens mRNA half-life, which, together with optimal codons, improves protein expression (5). Therefore, a principled mRNA design algorithm must optimize both structural stability and codon usage. However, due to synonymous codons, the mRNA design space is prohibitively large (e.g., ~10 632 candidates for the SARS-CoV-2 Spike protein), which poses insurmountable computational challenges. Here we provide a simple and unexpected solution using a classical concept in computational linguistics, where finding the optimal mRNA sequence is akin to identifying the most likely sentence among similar sounding alternatives (6). Our algorithm LinearDesign takes only 11 minutes for the Spike protein, and can jointly optimize stability and codon usage. On both COVID-19 and varicella-zoster virus mRNA vaccines, LinearDesign substantially improves mRNA half-life and protein expression, and dramatically increases antibody titer by up to 128× in vivo, compared to the codon-optimization benchmark. This surprising result reveals the great potential of principled mRNA design, and enables the exploration of previously unreachable but highly stable and efficient designs. Our work is a timely tool not only for vaccines but also for mRNA medicine encoding all therapeutic proteins (e.g., monoclonal antibodies and anti-cancer drugs (7, 8)).