Phenotype-Based Threat Assessment.
Jing YangMohammed EslamiYi-Pei ChenMayukh DasDongmei ZhangShaorong ChenAlexandria-Jade RobertsMark WestonAngelina VolkovaKasra FaghihiRobbie K MooreRobert C AlanizAlice R WattamAllan DickermanClark CucinellJarred KendziorskiSean CoburnHolly PatersonOsahon ObanorJason MaplesStephanie ServetasJennifer DootzQing-Ming QinJames E SamuelArum HanErin J van SchaikPaul de FigueiredoPublished in: Proceedings of the National Academy of Sciences of the United States of America (2022)
Bacterial pathogen identification, which is critical for human health, has historically relied on culturing organisms from clinical specimens. More recently, the application of machine learning (ML) to whole-genome sequences (WGSs) has facilitated pathogen identification. However, relying solely on genetic information to identify emerging or new pathogens is fundamentally constrained, especially if novel virulence factors exist. In addition, even WGSs with ML pipelines are unable to discern phenotypes associated with cryptic genetic loci linked to virulence. Here, we set out to determine if ML using phenotypic hallmarks of pathogenesis could assess potential pathogenic threat without using any sequence-based analysis. This approach successfully classified potential pathogenetic threat associated with previously machine-observed and unobserved bacteria with 99% and 85% accuracy, respectively. This work establishes a phenotype-based pipeline for potential pathogenic threat assessment, which we term PathEngine, and offers strategies for the identification of bacterial pathogens.
Keyphrases
- human health
- risk assessment
- antimicrobial resistance
- machine learning
- escherichia coli
- genome wide
- gram negative
- pseudomonas aeruginosa
- staphylococcus aureus
- climate change
- candida albicans
- biofilm formation
- deep learning
- preterm infants
- healthcare
- multidrug resistant
- artificial intelligence
- gestational age
- amino acid
- ultrasound guided