Login / Signup

Reaction Data Curation I: Chemical Structures and Transformations Standardization.

Timur R GimadievArkadii LinValentina A AfoninaDinar BatyrshinRamil I NugmanovTagir AkhmetshinPavel SidorovNatalia DuybankovaJonas VerhoevenJoerg WegnerHugo CeulemansAndrey GedichTimur I MadzhidovAlexandre Varnek
Published in: Molecular informatics (2021)
The quality of experimental data for chemical reactions is a critical consideration for any reaction-driven study. However, the curation of reaction data has not been extensively discussed in the literature so far. Here, we suggest a 4 steps protocol that includes the curation of individual structures (reactants and products), chemical transformations, reaction conditions and endpoints. Its implementation in Python3 using CGRTools toolkit has been used to clean three popular reaction databases Reaxys, USPTO and Pistachio. The curated USPTO database is available in the GitHub repository (Laboratoire-de-Chemoinformatique/Reaction_Data_Cleaning).
Keyphrases
  • electronic health record
  • big data
  • randomized controlled trial
  • systematic review
  • primary care
  • high resolution
  • multidrug resistant
  • artificial intelligence
  • adverse drug