BugSplit enables genome-resolved metagenomics through highly accurate taxonomic binning of metagenomic assemblies.
Induja ChandrakumarNick P G GauthierCassidy NelsonMichael B BonsallKerstin LocherMarthe CharlesClayton MacDonaldMel KrajdenAmee R MangesSamuel D ChorltonPublished in: Communications biology (2022)
A large gap remains between sequencing a microbial community and characterizing all of the organisms inside of it. Here we develop a novel method to taxonomically bin metagenomic assemblies through alignment of contigs against a reference database. We show that this workflow, BugSplit, bins metagenome-assembled contigs to species with a 33% absolute improvement in F1-score when compared to alternative tools. We perform nanopore mNGS on patients with COVID-19, and using a reference database predating COVID-19, demonstrate that BugSplit's taxonomic binning enables sensitive and specific detection of a novel coronavirus not possible with other approaches. When applied to nanopore mNGS data from cases of Klebsiella pneumoniae and Neisseria gonorrhoeae infection, BugSplit's taxonomic binning accurately separates pathogen sequences from those of the host and microbiota, and unlocks the possibility of sequence typing, in silico serotyping, and antimicrobial resistance prediction of each organism within a sample. BugSplit is available at https://bugseq.com/academic .
Keyphrases
- microbial community
- antibiotic resistance genes
- antimicrobial resistance
- klebsiella pneumoniae
- multidrug resistant
- single molecule
- escherichia coli
- electronic health record
- coronavirus disease
- sars cov
- genetic diversity
- adverse drug
- wastewater treatment
- solid state
- gram negative
- molecular docking
- high resolution
- single cell
- anaerobic digestion
- big data
- candida albicans
- emergency department
- medical students
- machine learning
- artificial intelligence
- quantum dots