Login / Signup

Subtyping and grading of lower-grade gliomas using integrated feature selection and support vector machine.

Sana MunquadTapas SiSaurav MallikAimin LiAsim Bikas Das
Published in: Briefings in functional genomics (2022)
Classifying lower-grade gliomas (LGGs) is a crucial step for accurate therapeutic intervention. The histopathological classification of various subtypes of LGG, including astrocytoma, oligodendroglioma and oligoastrocytoma, suffers from intraobserver and interobserver variability leading to inaccurate classification and greater risk to patient health. We designed an efficient machine learning-based classification framework to diagnose LGG subtypes and grades using transcriptome data. First, we developed an integrated feature selection method based on correlation and support vector machine (SVM) recursive feature elimination. Then, implementation of the SVM classifier achieved superior accuracy compared with other machine learning frameworks. Most importantly, we found that the accuracy of subtype classification is always high (>90%) in a specific grade rather than in mixed grade (~80%) cancer. Differential co-expression analysis revealed higher heterogeneity in mixed grade cancer, resulting in reduced prediction accuracy. Our findings suggest that it is necessary to identify cancer grades and subtypes to attain a higher classification accuracy. Our six-class classification model efficiently predicts the grades and subtypes with an average accuracy of 91% (±0.02). Furthermore, we identify several predictive biomarkers using co-expression, gene set enrichment and survival analysis, indicating our framework is biologically interpretable and can potentially support the clinician.
Keyphrases