Login / Signup

Feature Selection Method Based on Partial Least Squares and Analysis of Traditional Chinese Medicine Data.

Canyi HuangJianqiang DuBin NieRiyue YuWangping XiongQingxia Zeng
Published in: Computational and mathematical methods in medicine (2019)
The partial least squares method has many advantages in multivariable linear regression, but it does not include the function of feature selection. This method cannot screen for the best feature subset (referred to in this study as the "Gold Standard") or optimize the model, although contrarily using the L1 norm can achieve the sparse representation of parameters, leading to feature selection. In this study, a feature selection method based on partial least squares is proposed. In the new method, exploiting partial least squares allows extraction of the latent variables required for performing multivariable linear regression, and this method applies the L1 regular term constraint to the sum of the absolute values of the regression coefficients. This technique is then combined with the coordinate descent method to perform multiple iterations to select a better feature subset. Analyzing traditional Chinese medicine data and University of California, Irvine (UCI), datasets with the model, the experimental results show that the feature selection method based on partial least squares exhibits preferable adaptability for traditional Chinese medicine data and UCI datasets.
Keyphrases
  • machine learning
  • deep learning
  • electronic health record
  • high throughput
  • artificial intelligence