MAIP: a web service for predicting blood-stage malaria inhibitors.
Nicolas BoscEloy FelixRicardo ArcilaDavid MendezMartin R SaundersDarren V S GreenJason OchoadaAnang A ShelatEric J MartinPreeti IyerOla EngkvistAndreas VerrasJames DuffyJeremy BurrowsJ Mark F GardnerAndrew R LeachPublished in: Journal of cheminformatics (2021)
Malaria is a disease affecting hundreds of millions of people across the world, mainly in developing countries and especially in sub-Saharan Africa. It is the cause of hundreds of thousands of deaths each year and there is an ever-present need to identify and develop effective new therapies to tackle the disease and overcome increasing drug resistance. Here, we extend a previous study in which a number of partners collaborated to develop a consensus in silico model that can be used to identify novel molecules that may have antimalarial properties. The performance of machine learning methods generally improves with the number of data points available for training. One practical challenge in building large training sets is that the data are often proprietary and cannot be straightforwardly integrated. Here, this was addressed by sharing QSAR models, each built on a private data set. We describe the development of an open-source software platform for creating such models, a comprehensive evaluation of methods to create a single consensus model and a web platform called MAIP available at https://www.ebi.ac.uk/chembl/maip/ . MAIP is freely available for the wider community to make large-scale predictions of potential malaria inhibiting compounds. This project also highlights some of the practical challenges in reproducing published computational methods and the opportunities that open-source software can offer to the community.
Keyphrases
- plasmodium falciparum
- healthcare
- mental health
- machine learning
- electronic health record
- big data
- data analysis
- molecular docking
- artificial intelligence
- clinical practice
- health insurance
- cross sectional
- randomized controlled trial
- systematic review
- signaling pathway
- deep learning
- health information
- structure activity relationship