BCD-WERT: a novel approach for breast cancer detection using whale optimization based efficient features and extremely randomized tree algorithm.
Shafaq AbbasZunera JalilAbdul Rehman JavedIqra BatoolMohammad Zubair KhanAbdulfattah NoorwaliThippa Reddy GadekalluAqsa AkbarPublished in: PeerJ. Computer science (2021)
Breast cancer is one of the leading causes of death in the current age. It often results in subpar living conditions for a patient as they have to go through expensive and painful treatments to fight this cancer. One in eight women all over the world is affected by this disease. Almost half a million women annually do not survive this fight and die from this disease. Machine learning algorithms have proven to outperform all existing solutions for the prediction of breast cancer using models built on the previously available data. In this paper, a novel approach named BCD-WERT is proposed that utilizes the Extremely Randomized Tree and Whale Optimization Algorithm (WOA) for efficient feature selection and classification. WOA reduces the dimensionality of the dataset and extracts the relevant features for accurate classification. Experimental results on state-of-the-art comprehensive dataset demonstrated improved performance in comparison with eight other machine learning algorithms: Support Vector Machine (SVM), Random Forest, Kernel Support Vector Machine, Decision Tree, Logistic Regression, Stochastic Gradient Descent, Gaussian Naive Bayes and k-Nearest Neighbor. BCD-WERT outperformed all with the highest accuracy rate of 99.30% followed by SVM achieving 98.60% accuracy. Experimental results also reveal the effectiveness of feature selection techniques in improving prediction accuracy.
Keyphrases
- machine learning
- deep learning
- big data
- artificial intelligence
- breast cancer risk
- double blind
- polycystic ovary syndrome
- open label
- placebo controlled
- randomized controlled trial
- phase iii
- systematic review
- papillary thyroid
- case report
- squamous cell carcinoma
- clinical trial
- type diabetes
- pregnancy outcomes
- electronic health record
- genome wide
- adipose tissue
- pregnant women
- mass spectrometry
- squamous cell
- cervical cancer screening
- insulin resistance
- single cell
- lymph node metastasis
- young adults
- loop mediated isothermal amplification
- label free