Learning the language of viral evolution and escape.
Brian L HieEllen D ZhongBonnie BergerBryan D BrysonPublished in: Science (New York, N.Y.) (2021)
The ability for viruses to mutate and evade the human immune system and cause infection, called viral escape, remains an obstacle to antiviral and vaccine development. Understanding the complex rules that govern escape could inform therapeutic design. We modeled viral escape with machine learning algorithms originally developed for human natural language. We identified escape mutations as those that preserve viral infectivity but cause a virus to look different to the immune system, akin to word changes that preserve a sentence's grammaticality but change its meaning. With this approach, language models of influenza hemagglutinin, HIV-1 envelope glycoprotein (HIV Env), and severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) Spike viral proteins can accurately predict structural escape patterns using sequence data alone. Our study represents a promising conceptual bridge between natural language and viral evolution.
Keyphrases
- sars cov
- respiratory syndrome coronavirus
- machine learning
- autism spectrum disorder
- endothelial cells
- antiretroviral therapy
- hepatitis c virus
- hiv infected
- hiv positive
- human immunodeficiency virus
- artificial intelligence
- induced pluripotent stem cells
- deep learning
- big data
- coronavirus disease
- palliative care
- south africa
- pluripotent stem cells
- electronic health record
- advanced cancer