Login / Signup

AniProtDB: A Collection of Consistently Generated Metazoan Proteomes for Comparative Genomics Studies.

Sofia N BarreiraAnh-Dao NguyenMark T FredriksenTyra G WolfsbergR Travis MorelandAndreas D Baxevanis
Published in: Molecular biology and evolution (2022)
To address the void in the availability of high-quality proteomic data traversing the animal tree, we have implemented a pipeline for generating de novo assemblies based on publicly available data from the NCBI Sequence Read Archive, yielding a comprehensive collection of proteomes from 100 species spanning 21 animal phyla. We have also created the Animal Proteome Database (AniProtDB), a resource providing open access to this collection of high-quality metazoan proteomes, along with information on predicted proteins and protein domains for each taxonomic classification and the ability to perform sequence similarity searches against all proteomes generated using this pipeline. This solution vastly increases the utility of these data by removing the barrier to access for research groups who do not have the expertise or resources to generate these data themselves and enables the use of data from nontraditional research organisms that have the potential to address key questions in biomedicine.
Keyphrases
  • electronic health record
  • big data
  • machine learning
  • healthcare
  • minimally invasive
  • deep learning
  • single cell
  • climate change
  • small molecule
  • binding protein