Login / Signup

Saccharomycotina yeasts defy long-standing macroecological patterns.

Kyle T DavidMarie-Claire HarrisonDana A OpulenteAbigail Leavitt LaBellaJohn F WoltersXiao-Fan ZhouXing-Xing ShenMarizeth GroenewaldMatthew W PennellChris Todd HittingerAntonis Rokas
Published in: Proceedings of the National Academy of Sciences of the United States of America (2024)
The Saccharomycotina yeasts ("yeasts" hereafter) are a fungal clade of scientific, economic, and medical significance. Yeasts are highly ecologically diverse, found across a broad range of environments in every biome and continent on earth; however, little is known about what rules govern the macroecology of yeast species and their range limits in the wild. Here, we trained machine learning models on 12,816 terrestrial occurrence records and 96 environmental variables to infer global distribution maps at ~1 km 2 resolution for 186 yeast species (~15% of described species from 75% of orders) and to test environmental drivers of yeast biogeography and macroecology. We found that predicted yeast diversity hotspots occur in mixed montane forests in temperate climates. Diversity in vegetation type and topography were some of the greatest predictors of yeast species richness, suggesting that microhabitats and environmental clines are key to yeast diversity. We further found that range limits in yeasts are significantly influenced by carbon niche breadth and range overlap with other yeast species, with carbon specialists and species in high-diversity environments exhibiting reduced geographic ranges. Finally, yeasts contravene many long-standing macroecological principles, including the latitudinal diversity gradient, temperature-dependent species richness, and a positive relationship between latitude and range size (Rapoport's rule). These results unveil how the environment governs the global diversity and distribution of species in the yeast subphylum. These high-resolution models of yeast species distributions will facilitate the prediction of economically relevant and emerging pathogenic species under current and future climate scenarios.
Keyphrases
  • saccharomyces cerevisiae
  • machine learning
  • climate change
  • cell wall
  • high resolution
  • genetic diversity
  • risk assessment
  • artificial intelligence
  • human health
  • tandem mass spectrometry