Login / Signup

STRONG: metagenomics strain resolution on assembly graphs.

Christopher QuinceSergey NurkSebastien RaguideauRobert JamesOrkun S SoyerJ Kimberly SummersAntoine LimassetA Murat ErenRayan ChikhiAaron E Darling
Published in: Genome biology (2021)
We introduce STrain Resolution ON assembly Graphs (STRONG), which identifies strains de novo, from multiple metagenome samples. STRONG performs coassembly, and binning into metagenome assembled genomes (MAGs), and stores the coassembly graph prior to variant simplification. This enables the subgraphs and their unitig per-sample coverages, for individual single-copy core genes (SCGs) in each MAG, to be extracted. A Bayesian algorithm, BayesPaths, determines the number of strains present, their haplotypes or sequences on the SCGs, and abundances. STRONG is validated using synthetic communities and for a real anaerobic digestor time series generates haplotypes that match those observed from long Nanopore reads.
Keyphrases
  • single molecule
  • escherichia coli
  • genome wide
  • microbial community
  • machine learning
  • wastewater treatment
  • neural network
  • solid state
  • bioinformatics analysis