Login / Signup

Mutual information networks reveal evolutionary relationships within the influenza A virus polymerase.

Sarah ArcosAlvin X HanAartjan J W Te VelthuisColin A RussellAdam S Lauring
Published in: bioRxiv : the preprint server for biology (2023)
The influenza A (IAV) RNA polymerase is an essential driver of IAV evolution. Mutations that the polymerase introduces into viral genome segments during replication are the ultimate source of genetic variation, including within the three subunits of the IAV polymerase (PB2, PB1, and PA). Evolutionary analysis of the IAV polymerase is complicated, because changes in mutation rate, replication speed, and drug resistance involve epistatic interactions among its subunits. In order to study the evolution of the human seasonal H3N2 polymerase since the 1968 pandemic, we identified pairwise evolutionary relationships among ∼7000 H3N2 polymerase sequences using mutual information (MI), which measures the information gained about the identity of one residue when a second residue is known. To account for uneven sampling of viral sequences over time, we developed a weighted MI metric (wMI) and demonstrate that wMI outperforms raw MI through simulations using a well-sampled SARS-CoV-2 dataset. We then constructed wMI networks of the H3N2 polymerase to extend the inherently pairwise wMI statistic to encompass relationships among larger groups of residues. We included HA in the wMI network to distinguish between functional wMI relationships within the polymerase and those potentially due to hitchhiking on antigenic changes in HA. The wMI networks reveal coevolutionary relationships among residues with roles in replication and encapsidation. Inclusion of HA highlighted polymerase-only subgraphs containing residues with roles in the enzymatic functions of the polymerase and host adaptability. This work provides insight into the factors that drive and constrain the rapid evolution of influenza viruses.
Keyphrases
  • sars cov
  • structural basis
  • genome wide
  • endothelial cells
  • gene expression
  • computed tomography
  • heavy metals
  • healthcare
  • dna methylation
  • single cell
  • sensitive detection
  • pluripotent stem cells