Greengenes2 unifies microbial data in a single reference tree.
Daniel McDonaldYueyu JiangMetin BalabanKalen CantrellQiyun ZhuAntonio GonzalezJames T MortonGiorgia NicolaouDonovan H ParksSøren M KarstMads AlbertsenPhilip HugenholtzTodd DeSantisSe Jin SongAndrew BartkoAki Samuli HavulinnaPekka JousilahtiSusan ChengMichael InouyeTeemu NiiranenMohit JainVeikko V SalomaaLeo LahtiSiavash MirarabRob KnightPublished in: Nature biotechnology (2023)
Studies using 16S rRNA and shotgun metagenomics typically yield different results, usually attributed to PCR amplification biases. We introduce Greengenes2, a reference tree that unifies genomic and 16S rRNA databases in a consistent, integrated resource. By inserting sequences into a whole-genome phylogeny, we show that 16S rRNA and shotgun metagenomic data generated from the same samples agree in principal coordinates space, taxonomy and phenotype effect size when analyzed with the same tree.