Login / Signup

Optimized Quantification of Intrahost Viral Diversity in SARS-CoV-2 and Influenza Virus Sequence Data.

A E RoderKee JohnsonM KnollM KhalfanB WangStacey Schultz-CherryS BanakisA KreitmanC MederosJ-H YounR MercadoW WangDenis RuchnewitzM I SamanovicM J MulliganM LassigM ÅukszaS DasD GreshamElodie Ghedin
Published in: bioRxiv : the preprint server for biology (2022)
When viruses replicate inside a host, the virus replication machinery makes mistakes. Over time, these mistakes create mutations that result in a diverse population of viruses inside the host. Mutations that are neither lethal to the virus, nor strongly beneficial, can lead to minority variants that are minor members of the virus population. However, preparing samples for sequencing can also introduce errors that resemble minority variants, resulting in inclusion of false positive data if not filtered correctly. In this study, we aimed to determine the best methods for identification and quantification of these minority variants by testing the performance of seven commonly used variant calling tools. We used simulated and synthetic data to test their performance against a true set of variants, and then used these studies to inform variant identification in data from clinical SARS-CoV-2 clinical specimens. Together, analyses of our data provide extensive guidance for future studies of viral diversity and evolution.
Keyphrases