TargIDe: a machine-learning workflow for target identification of molecules with antibiofilm activity against Pseudomonas aeruginosa.
João CarneiroRita P MagalhãesVictor M de la Oliva RoqueMariana SousaDiogo PratasSérgio F SousaPublished in: Journal of computer-aided molecular design (2023)
Bacterial biofilms are a source of infectious human diseases and are heavily linked to antibiotic resistance. Pseudomonas aeruginosa is a multidrug-resistant bacterium widely present and implicated in several hospital-acquired infections. Over the last years, the development of new drugs able to inhibit Pseudomonas aeruginosa by interfering with its ability to form biofilms has become a promising strategy in drug discovery. Identifying molecules able to interfere with biofilm formation is difficult, but further developing these molecules by rationally improving their activity is particularly challenging, as it requires knowledge of the specific protein target that is inhibited. This work describes the development of a machine learning multitechnique consensus workflow to predict the protein targets of molecules with confirmed inhibitory activity against biofilm formation by Pseudomonas aeruginosa. It uses a specialized database containing all the known targets implicated in biofilm formation by Pseudomonas aeruginosa. The experimentally confirmed inhibitors available on ChEMBL, together with chemical descriptors, were used as the input features for a combination of nine different classification models, yielding a consensus method to predict the most likely target of a ligand. The implemented algorithm is freely available at https://github.com/BioSIM-Research-Group/TargIDe under licence GNU General Public Licence (GPL) version 3 and can easily be improved as more data become available.
Keyphrases
- biofilm formation
- pseudomonas aeruginosa
- machine learning
- candida albicans
- cystic fibrosis
- acinetobacter baumannii
- multidrug resistant
- drug discovery
- big data
- staphylococcus aureus
- healthcare
- deep learning
- electronic health record
- artificial intelligence
- escherichia coli
- endothelial cells
- adverse drug
- protein protein
- mental health
- clinical practice
- drug resistant
- palliative care
- bioinformatics analysis
- induced pluripotent stem cells