Login / Signup

Ensemble of a subset of kNN classifiers.

Asma GulAris PerperoglouZardad KhanOsama MahmoudMiftahuddin MiftahuddinWerner AdlerBerthold Lausen
Published in: Advances in data analysis and classification (2016)
Combining multiple classifiers, known as ensemble methods, can give substantial improvement in prediction performance of learning algorithms especially in the presence of non-informative features in the data sets. We propose an ensemble of subset of kNN classifiers, ESkNN, for classification task in two steps. Firstly, we choose classifiers based upon their individual performance using the out-of-sample accuracy. The selected classifiers are then combined sequentially starting from the best model and assessed for collective performance on a validation data set. We use bench mark data sets with their original and some added non-informative features for the evaluation of our method. The results are compared with usual kNN, bagged kNN, random kNN, multiple feature subset method, random forest and support vector machines. Our experimental comparisons on benchmark classification problems and simulated data sets reveal that the proposed ensemble gives better classification performance than the usual kNN and its ensembles, and performs comparable to random forest and support vector machines.
Keyphrases
  • machine learning
  • deep learning
  • electronic health record
  • neural network
  • big data
  • convolutional neural network
  • climate change
  • artificial intelligence
  • single cell
  • data analysis