Embedding covariate adjustments in tree-based automated machine learning for biomedical big data analyses.
Elisabetta ManduchiWeixuan FuJoseph D RomanoStefano RubertoJason H MoorePublished in: BMC bioinformatics (2020)
In this work, we address an important need in the context of AutoML, which is particularly crucial for applications to bioinformatics and medical informatics, namely covariate adjustments. To this end we present a substantial extension of TPOT, a genetic programming based AutoML approach. We show the utility of this extension by applications to large toxicogenomics and differential gene expression data. The method is generally applicable in many other scenarios from the biomedical field.