Login / Signup

Pandemic-scale phylogenomics reveals the SARS-CoV-2 recombination landscape.

Yatish TurakhiaBryan ThornlowAngie S HinrichsJakob McBroomeNicolas AyalaCheng YeKyle SmithNicola De MaioDavid HausslerRobert LanfearRussell B Corbett-Detig
Published in: Nature (2022)
Accurate and timely detection of recombinant lineages is crucial for interpreting genetic variation, reconstructing epidemic spread, identifying selection and variants of interest, and accurately performing phylogenetic analyses 1-4 . During the SARS-CoV-2 pandemic, genomic data generation has exceeded the capacities of existing analysis platforms, thereby crippling real-time analysis of viral evolution 5 . Here, we use a new phylogenomic method to search a nearly comprehensive SARS-CoV-2 phylogeny for recombinant lineages. In a 1.6 million sample tree from May 2021, we identify 589 recombination events, which indicate that around 2.7% of sequenced SARS-CoV-2 genomes have detectable recombinant ancestry. Recombination breakpoints are inferred to occur disproportionately in the 3' portion of the genome that contains the spike protein. Our results highlight the need for timely analyses of recombination for pinpointing the emergence of recombinant lineages with the potential to increase transmissibility or virulence of the virus. We anticipate that this approach will empower comprehensive real-time tracking of viral recombination during the SARS-CoV-2 pandemic and beyond.
Keyphrases
  • sars cov
  • dna repair
  • dna damage
  • respiratory syndrome coronavirus
  • copy number
  • gene expression
  • pseudomonas aeruginosa
  • single cell
  • high resolution
  • dna methylation
  • small molecule
  • big data