Login / Signup

GDPF: a data resource for the distribution of prokaryotic protein families across the global biosphere.

Zhuo PanDan-Dan LiPeng LiYu GengYiru JiangYa LiuYue-Zhong LiZheng Zhang
Published in: Nucleic acids research (2023)
Microorganisms encode most of the functions of life on Earth. However, conventional research has primarily focused on specific environments such as humans, soil and oceans, leaving the distribution of functional families throughout the global biosphere poorly comprehended. Here, we present the database of the global distribution of prokaryotic protein families (GDPF, http://bioinfo.qd.sdu.edu.cn/GDPF/), a data resource on the distribution of functional families across the global biosphere. GDPF provides global distribution information for 36 334 protein families, 19 734 superfamilies and 12 089 KEGG (Kyoto Encyclopedia of Genes and Genomes) orthologs from multiple source databases, covering typical environments such as soil, oceans, animals, plants and sediments. Users can browse, search and download the distribution data of each entry in 10 000 global microbial communities, as well as conduct comparative analysis of distribution disparities among multiple entries across various environments. The GDPF data resource contributes to uncovering the geographical distribution patterns, key influencing factors and macroecological principles of microbial functions at a global level, thereby promoting research in Earth ecology and human health.
Keyphrases
  • electronic health record
  • big data
  • human health
  • healthcare
  • heavy metals
  • binding protein
  • deep learning
  • dna methylation
  • plant growth
  • genome wide identification