Bootstrap based confidence limits in principal component analysis: a case study

Hamid Babamoradi, Franciscus Winfried J van der Berg, Åsmund Rinnan

45 Citations (Scopus)

Abstract

Principal component analysis (PCA) is widely used as a tool in (exploratory) data investigations for many different research areas such as analytical chemistry, food- and pharmaceutical-research, and multivariate statistical process control. Despite its popularity, not many results have been reported thus far on how to calculate reliable confidence interval limits in PCA estimates. And, like all other data analysis tasks, results of PCA are not complete without reasonable expectations for the parameter uncertainties, especially in the case of predictive model objectives. In this paper we will present a case study on how to calculate confidence limits based on bootstrap re-sampling. Two NIR datasets are used to build bootstrap confidence limits. The first dataset shows the effect of outliers on bootstrap confidence limits, while the second shows the bootstrap confidence limits when the data has a bimodal distribution. The different steps and choices which have to be made for the algorithm to perform correctly will be presented. The bootstrap based confidence limits will be compared with the corresponding asymptotic confidence limits. We will thereby conclude that the confidence limits based on the bootstrap method give more meaningful answers and are to be preferred over its asymptotic counterparts.

Original languageEnglish
JournalChemometrics and Intelligent Laboratory Systems
Volume120
Pages (from-to)97-105
Number of pages9
ISSN0169-7439
DOIs
Publication statusPublished - 15 Jan 2013

Fingerprint

Dive into the research topics of 'Bootstrap based confidence limits in principal component analysis: a case study'. Together they form a unique fingerprint.

Cite this