Login / Signup

ProtaBank: A repository for protein design and engineering data.

Connie Y WangPaul M ChangMarie L AryBenjamin D AllenRoberto A ChicaStephen L MayoBarry D Olafson
Published in: Protein science : a publication of the Protein Society (2018)
We present ProtaBank, a repository for storing, querying, analyzing, and sharing protein design and engineering data in an actively maintained and updated database. ProtaBank provides a format to describe and compare all types of protein mutational data, spanning a wide range of properties and techniques. It features a user-friendly web interface and programming layer that streamlines data deposition and allows for batch input and queries. The database schema design incorporates a standard format for reporting protein sequences and experimental data that facilitates comparison of results across different data sets. A suite of analysis and visualization tools are provided to facilitate discovery, to guide future designs, and to benchmark and train new predictive tools and algorithms. ProtaBank will provide a valuable resource to the protein engineering community by storing and safeguarding newly generated data, allowing for fast searching and identification of relevant data from the existing literature, and exploring correlations between disparate data sets. ProtaBank invites researchers to contribute data to the database to make it accessible for search and analysis. ProtaBank is available at https://protabank.org.
Keyphrases
  • electronic health record
  • big data
  • systematic review
  • machine learning
  • healthcare
  • small molecule
  • data analysis
  • protein protein
  • artificial intelligence
  • social media
  • deep learning
  • high resolution
  • health information