Pangenome databases improve host removal and mycobacteria classification from clinical metagenomic data.
Michael B HallLachlan J M CoinPublished in: GigaScience (2024)
Customized pangenome databases provide the best balance of accuracy and computational efficiency when compared to standard databases for the task of human read removal and M. tuberculosis read classification from metagenomic samples. Such databases allow for execution on a laptop, without sacrificing accuracy, an especially important consideration in low-resource settings. We make all customized databases and pipelines freely available.