DockingGA: enhancing targeted molecule generation using transformer neural network and genetic algorithm with docking simulation.
Changnan GaoWenjie BaoShuang WangJianyang ZhengLulu WangYongqi RenLinfang JiaoJianmin WangXun WangPublished in: Briefings in functional genomics (2024)
Generative molecular models generate novel molecules with desired properties by searching chemical space. Traditional combinatorial optimization methods, such as genetic algorithms, have demonstrated superior performance in various molecular optimization tasks. However, these methods do not utilize docking simulation to inform the design process, and heavy dependence on the quality and quantity of available data, as well as require additional structural optimization to become candidate drugs. To address this limitation, we propose a novel model named DockingGA that combines Transformer neural networks and genetic algorithms to generate molecules with better binding affinity for specific targets. In order to generate high quality molecules, we chose the Self-referencing Chemical Structure Strings to represent the molecule and optimize the binding affinity of the molecules to different targets. Compared to other baseline models, DockingGA proves to be the optimal model in all docking results for the top 1, 10 and 100 molecules, while maintaining 100% novelty. Furthermore, the distribution of physicochemical properties demonstrates the ability of DockingGA to generate molecules with favorable and appropriate properties. This innovation creates new opportunities for the application of generative models in practical drug discovery.
Keyphrases
- neural network
- machine learning
- molecular dynamics
- drug discovery
- molecular dynamics simulations
- deep learning
- genome wide
- protein protein
- copy number
- gene expression
- dna binding
- cancer therapy
- dna methylation
- small molecule
- transcription factor
- electronic health record
- mass spectrometry
- virtual reality
- artificial intelligence
- drug induced