Login / Signup

CGRdb2.0: A Python Database Management System for Molecules, Reactions, and Chemical Data.

Timur R GimadievRamil I NugmanovAigul KhakimovaAdeliya FatykhovaTimur I MadzhidovPavel SidorovAlexandre Varnek
Published in: Journal of chemical information and modeling (2021)
This work introduces CGRdb2.0─an open-source database management system for molecules, reactions, and chemical data. CGRdb2.0 is a Python package connecting to a PostgreSQL database that enables native searches for molecules and reactions without complicated SQL syntax. The library provides out-of-the-box implementations for similarity and substructure searches for molecules, as well as similarity and substructure searches for reactions in two ways─based on reaction components and based on the Condensed Graph of Reaction approach, the latter significantly accelerating the performance. In benchmarking studies with the RDKit database cartridge, we demonstrate that CGRdb2.0 performs searches faster for smaller data sets, while allowing for interactive access to the retrieved data.
Keyphrases
  • electronic health record
  • big data
  • adverse drug
  • emergency department
  • transcription factor
  • deep learning
  • convolutional neural network
  • binding protein
  • neural network