Strain tracking in complex microbiomes using synteny analysis reveals per-species modes of evolution.
Hagay EnavInbal PazRuth E LeyPublished in: Nature biotechnology (2024)
Microbial species diversify into strains through single-nucleotide mutations and structural changes, such as recombination, insertions and deletions. Most strain-comparison methods quantify differences in single-nucleotide polymorphisms (SNPs) and are insensitive to structural changes. However, recombination is an important driver of phenotypic diversification in many species, including human pathogens. We introduce SynTracker, a tool that compares microbial strains using genome synteny-the order of sequence blocks in homologous genomic regions-in pairs of metagenomic assemblies or genomes. Genome synteny is a rich source of genomic information untapped by current strain-comparison tools. SynTracker has low sensitivity to SNPs, has no database requirement and is robust to sequencing errors. It outperforms existing tools when tracking strains in metagenomic data and is particularly suited for phages, plasmids and other low-data contexts. Applied to single-species datasets and human gut metagenomes, SynTracker, combined with an SNP-based tool, detects strains enriched in either point mutations or structural changes, providing insights into microbial evolution in situ.
Keyphrases
- escherichia coli
- genome wide
- microbial community
- endothelial cells
- dna damage
- dna repair
- genetic diversity
- electronic health record
- copy number
- induced pluripotent stem cells
- big data
- antibiotic resistance genes
- healthcare
- pluripotent stem cells
- dna methylation
- oxidative stress
- gene expression
- machine learning
- deep learning
- social media
- artificial intelligence
- patient safety
- high density
- drug induced