Recursive weighted partial least squares (rPLS): an efficient variable selection method using PLS

Åsmund Rinnan, Martin Andersson, Carsten Ridder, Søren Balling Engelsen

50 Citationer (Scopus)

Abstract

Variable selection is important in fine tuning partial least squares (PLS) regression models. This study introduces a novel variable weighting method for PLS regression where the univariate response variable y is used to guide the variable weighting in a recursive manner-the method is called recursive weighted PLS or just rPLS. The method iteratively reweights the variables using the regression coefficients calculated by PLS. The use of the regression vector to make up the weights is a reasonable idea from the fact that the weights in the regression vector ideally reflect the importance of the variables. In contrast to many other variable selection methods, the rPLS method has the advantage that only one parameter needs to be estimated: the number of latent factors used in the PLS model. The rPLS model has the fascinating output that it, under normal conditions, converges to a very limited number of variables (useful for interpretation), but it will exhibit optimal regression performance before convergence, normally including covarying neighbor variables. This study examines the properties of rPLS by application to a near-infrared spectroscopy dataset of feed samples predicting the protein content and to a metabolomics dataset modeling a reference metabolic parameter (creatinine) from nuclear magnetic resonance spectra of human urine.

OriginalsprogEngelsk
TidsskriftJournal of Chemometrics
Vol/bind28
Udgave nummer5
Sider (fra-til)439–447
Antal sider9
ISSN0886-9383
DOI
StatusUdgivet - maj 2014

Fingeraftryk

Dyk ned i forskningsemnerne om 'Recursive weighted partial least squares (rPLS): an efficient variable selection method using PLS'. Sammen danner de et unikt fingeraftryk.

Citationsformater