Rapid and Automated Ab Initio Metabolite Collisional Cross Section Prediction from SMILES Input.
Susanta DasLaleh DinpazhohKiyoto Aramis TanemuraKenneth M MerzPublished in: Journal of chemical information and modeling (2023)
We implemented an ab initio CCS prediction workflow which incrementally refines generated structures using molecular mechanics, a deep learning potential, conformational clustering, and quantum mechanics (QM). Automating intermediate steps for a high performance computing (HPC) environment allows users to input the SMILES structure of small organic molecules and obtain a Boltzmann averaged collisional cross section (CCS) value as output. The CCS of a molecular species is a metric measured by ion mobility spectrometry (IMS) which can improve annotation of untargeted metabolomics experiments. We report only a minor drop in accuracy when we expedite the CCS calculation by replacing the QM geometry refinement step with a single-point energy calculation. Even though the workflow involves stochastic steps (i.e., conformation generation and clustering), the final CCS value was highly reproducible for multiple iterations on L-carnosine. Finally, we illustrate that the gas phase ensembles modeled for the workflow are intermediate files which can be used for the prediction of other properties such as aqueous phase nuclear magnetic resonance chemical shift prediction. The software is available at the following link: https://github.com/DasSusanta/snakemake_ccs.
Keyphrases
- deep learning
- magnetic resonance
- mass spectrometry
- rna seq
- molecular dynamics
- electronic health record
- single molecule
- high resolution
- molecular dynamics simulations
- machine learning
- single cell
- computed tomography
- convolutional neural network
- risk assessment
- crystal structure
- gas chromatography
- tandem mass spectrometry
- water soluble