StrainGE: a toolkit to track and characterize low-abundance strains in complex microbial communities.
Lucas R van DijkBruce J WalkerTimothy J StraubColin J WorbyAlexandra GroteHenry L SchreiberChristine AnyansiAmy J PickeringScott J HultgrenAbigail L MansonThomas AbeelAshlee M EarlPublished in: Genome biology (2022)
Human-associated microbial communities comprise not only complex mixtures of bacterial species, but also mixtures of conspecific strains, the implications of which are mostly unknown since strain level dynamics are underexplored due to the difficulties of studying them. We introduce the Strain Genome Explorer (StrainGE) toolkit, which deconvolves strain mixtures and characterizes component strains at the nucleotide level from short-read metagenomic sequencing with higher sensitivity and resolution than other tools. StrainGE is able to identify strains at 0.1x coverage and detect variants for multiple conspecific strains within a sample from coverages as low as 0.5x.