Simple and accurate genomic classification model for distinguishing between human and pig Staphylococcus aureus.
Huiliu ZhouWenyin DuDejia OuyangYuehe LiYajie GongZhenjiang YaoMinghao ZhongXinguang ZhongZhenjiang YaoPublished in: Communications biology (2024)
Staphylococcus aureus (S. aureus) can cause various infections in humans and animals, contributing to high morbidity and mortality. To prevent and control cross-species transmission of S. aureus, it is necessary to understand the host-associated genetic variants. We performed a two-stage genome-wide association study (GWAS) including initial screening and further validation to compare genomic differences between human and pig S. aureus, aiming to identify host-associated determinants. Our multiple GWAS analyses found six consensus significant k-mers associated with host species, providing novel genetic evidence for distinguishing human from pig S. aureus. The best k-mer predictor achieved a high classification accuracy of 98.12% on its own and had extremely high resolution similar to the SNPs-based phylogeny, offering a very simple target for predicting the cross-species transmission risk of S. aureus. The final k-mer model revealed that 90% of S. aureus isolates from farm workers were predicted as livestock origin, suggesting a high risk of cross-species transmission. Bayesian inference revealed different cross-species transmission directions, with the human-to-pig transmission for ST5 and the pig-to-human transmission for ST398. Our findings provide a simple and accurate k-mer model for identifying and predicting the cross-species transmission risk of S. aureus.
Keyphrases
- endothelial cells
- staphylococcus aureus
- high resolution
- induced pluripotent stem cells
- machine learning
- genome wide association study
- single cell
- deep learning
- mass spectrometry
- sars cov
- gene expression
- escherichia coli
- candida albicans
- pseudomonas aeruginosa
- respiratory syndrome coronavirus
- genetic diversity
- tandem mass spectrometry
- clinical evaluation