Systematic evaluation of supervised machine learning for sample origin prediction using metagenomic sequencing data.
Julie Chih-Yu ChenAndrea D TylerPublished in: Biology direct (2020)
Herein, we highlight the capacity of predicting sample origin accurately with pre-trained origins and the challenge of predicting new origins through both regression and classification models. Overall, this work provides a summary of the impact of sequencing technique, protocol, taxonomic analytical approaches, and machine learning approaches on the use of metagenomics for prediction of sample origin.