Login / Signup

Automated Identification of Molecular Crystals' Packing Motifs.

Donald LovelandBhavya KailkhuraPiyush KarandeAnna M HiszpanskiThomas Yong-Jin Han
Published in: Journal of chemical information and modeling (2020)
Packing motifs-patterns in how molecules orient relative to one another in a crystal structure-are an important concept in many subdisciplines of materials science because of correlations observed between specific packing motifs and properties of interest. That said, packing motif data sets have remained small and noisy due to intensive manual labeling processes and insufficient labeling schemes. The most prominent labeling algorithms calculate relative interplanar angles of nearest neighbor molecules to determine the packing motif of a molecular crystal, but this simple approach can fail when neighbors are naively sampled isotropically around the crystal structure. To remedy this issue, we propose an optimization algorithm, which rotates the molecular crystal structure to find representative molecules that inform the packing motif. We package this algorithm into an automated framework-Autopack-which both optimally rotates the crystal structure and labels the packing motif based on the appropriate neighboring molecules. In this work, we detail the Autopack framework and its performance, which shows improvements compared to previous state-of-the-art labeling methods, providing the first quantitative point of comparison for packing motif labeling algorithms. Furthermore, using Autopack (available at https://ipo.llnl.gov/technologies/software/autopack), we perform the first large-scale study of potential relationships between chemicals' compositions and packing motifs, which shows that these relationships are more complex than previously hypothesized from studies that used only tens of polycyclic aromatic hydrocarbon molecules. Autopack's capabilities help pose next steps for crystal engineering research focusing not only on a molecule's adoption of a specific packing motif but also on new structure-property relationships.
Keyphrases
  • crystal structure
  • machine learning
  • deep learning
  • public health
  • risk assessment
  • high resolution
  • single molecule
  • big data