Login / Signup

Building a genome-based understanding of bacterial pH preferences.

Josep RamonedaElias Stallard-OliveraMichael HoffertClaire C WinfreyMasumi StadlerJuan Pablo Niño-GarcíaNoah Fierer
Published in: Science advances (2023)
The environmental preferences of many microbes remain undetermined. This is the case for bacterial pH preferences, which can be difficult to predict a priori despite the importance of pH as a factor structuring bacterial communities in many systems. We compiled data on bacterial distributions from five datasets spanning pH gradients in soil and freshwater systems (1470 samples), quantified the pH preferences of bacterial taxa across these datasets, and compiled genomic data from representative bacterial taxa. While taxonomic and phylogenetic information were generally poor predictors of bacterial pH preferences, we identified genes consistently associated with pH preference across environments. We then developed and validated a machine learning model to estimate bacterial pH preferences from genomic information alone, a model that could aid in the selection of microbial inoculants, improve species distribution models, or help design effective cultivation strategies. More generally, we demonstrate the value of combining biogeographic and genomic data to infer and predict the environmental preferences of diverse bacterial taxa.
Keyphrases
  • machine learning
  • decision making
  • big data
  • electronic health record
  • copy number
  • microbial community
  • healthcare
  • climate change
  • health information
  • human health