A benchmark of optimization solvers for genome-scale metabolic modeling of organisms and communities.
Daniel MachadoPublished in: mSystems (2024)
Genome-scale metabolic modeling is a powerful framework for predicting metabolic phenotypes of any organism with an annotated genome. For two decades, this framework has been used for the rational design of microbial cell factories. In the last decade, the range of applications has exploded, and new frontiers have emerged, including the study of the gut microbiome and its health implications and the role of microbial communities in global ecosystems. However, all the critical steps in this framework, from model construction to simulation, require the use of powerful linear optimization solvers, with the choice often relying on commercial solvers for their well-known computational efficiency. In this work, I benchmark a total of six solvers (two commercial and four open source) and measure their performance to solve linear and mixed-integer linear problems of increasing complexity. Although commercial solvers are still the fastest, at least two open-source solvers show comparable performance. These results show that genome-scale metabolic modeling does not need to be hindered by commercial licensing schemes and can become a truly open science framework for solving urgent societal challenges.IMPORTANCEModeling the metabolism of organisms and communities allows for computational exploration of their metabolic capabilities and testing their response to genetic and environmental perturbations. This holds the potential to address multiple societal issues related to human health and the environment. One of the current limitations is the use of commercial optimization solvers with restrictive licenses for academic and non-academic use. This work compares the performance of several commercial and open-source solvers to solve some of the most complex problems in the field. Benchmarking results show that, although commercial solvers are indeed faster, some of the open-source options can also efficiently tackle the hardest problems, showing great promise for the development of open science applications.