Login / Signup

Gram matrix: an efficient representation of molecular conformation and learning objective for molecular pretraining.

Wenkai XiangFeisheng ZhongLin NiMingyue ZhengXutong LiQian ShiDingyan Wang
Published in: Briefings in bioinformatics (2024)
Accurate prediction of molecular properties is fundamental in drug discovery and development, providing crucial guidance for effective drug design. A critical factor in achieving accurate molecular property prediction lies in the appropriate representation of molecular structures. Presently, prevalent deep learning-based molecular representations rely on 2D structure information as the primary molecular representation, often overlooking essential three-dimensional (3D) conformational information due to the inherent limitations of 2D structures in conveying atomic spatial relationships. In this study, we propose employing the Gram matrix as a condensed representation of 3D molecular structures and for efficient pretraining objectives. Subsequently, we leverage this matrix to construct a novel molecular representation model, Pre-GTM, which inherently encapsulates 3D information. The model accurately predicts the 3D structure of a molecule by estimating the Gram matrix. Our findings demonstrate that Pre-GTM model outperforms the baseline Graphormer model and other pretrained models in the QM9 and MoleculeNet quantitative property prediction task. The integration of the Gram matrix as a condensed representation of 3D molecular structure, incorporated into the Pre-GTM model, opens up promising avenues for its potential application across various domains of molecular research, including drug design, materials science, and chemical engineering.
Keyphrases
  • single molecule
  • high resolution
  • deep learning
  • healthcare
  • gram negative
  • emergency department
  • drug discovery
  • machine learning
  • adverse drug
  • neural network
  • crystal structure