Robust biomarker discovery for hepatocellular carcinoma from high-throughput data by multiple feature selection methods.
Zishuang ZhangZhi-Ping LiuPublished in: BMC medical genomics (2021)
It is found that overlaps among gene subsets contain different quantitative features selected by the RFE-CV of 6 classifiers. The AIC values in the model selection provide a theoretical foundation for the feature selection process of biomarker discovery via machine learning. What's more, genes containing in more optimally selected subsets make better biological sense and implication. The quality of feature selection is improved by the intersections of biomarkers selected from different classifiers. This is a general method suitable for screening biomarkers of complex diseases from high-throughput data.