MolLM: a unified language model for integrating biomedical text with 2D and 3D molecular representations.
Xiangru TangAndrew TranJeffrey TanMark B GersteinPublished in: Bioinformatics (Oxford, England) (2024)
Our code, data, pre-trained model weights, and examples of using our model are all available at https://github.com/gersteinlab/MolLM. In particular, we provide Jupyter Notebooks offering step-by-step guidance on how to use MolLM to extract embeddings for both molecules and text.