Applying interpretable machine learning in computational biology-pitfalls, recommendations and opportunities for new developments.
Valerie ChenMuyu YangWenbo CuiJoon Sik KimAmeet TalwalkarJian MaPublished in: Nature methods (2024)
Recent advances in machine learning have enabled the development of next-generation predictive models for complex computational biology problems, thereby spurring the use of interpretable machine learning (IML) to unveil biological insights. However, guidelines for using IML in computational biology are generally underdeveloped. We provide an overview of IML methods and evaluation techniques and discuss common pitfalls encountered when applying IML methods to computational biology problems. We also highlight open questions, especially in the era of large language models, and call for collaboration between IML and computational biology researchers.