Alternatives to Cross-Validatory Estimation of the Number of Factors in Multivariate Calibration
- 1 November 1990
- journal article
- research article
- Published by SAGE Publications in Applied Spectroscopy
- Vol. 44 (9), 1464-1470
- https://doi.org/10.1366/0003702904417788
Abstract
Overcoming the collinearity problem in regression by data compression techniques [i.e., principal component regression (PCR) and partial least-squares (PLS)] requires estimation of the number of factors (principal component) to use for the model. The most common approach is to use cross-validation for this purpose. Unfortunately, cross-validation is time consuming to carry out. Accordingly, we have searched for time-saving methods to estimate the number of factors. Two approaches were considered. The first uses the estimated standard error of the model and the second is an approximation to a cross-validation leave-one-out method. Both alternatives have been tested on spectroscopic data. It has been found that, when the number of wavelengths is limited, both methods give results similar to those obtained by full cross-validation both for PCR and PLS. However, when the number of wavelengths is large, the tested methods are reliable only for PCR and not for PLS.Keywords
This publication has 11 references indexed in Scilit:
- Prediction of gasoline octane numbers from near-infrared spectral features in the range 660-1215 nmAnalytical Chemistry, 1989
- A Note on the Use of the Partial Least-Squares Method for Multivariate CalibrationApplied Spectroscopy, 1988
- Principal component regression in NIR analysis: Viewpoints, background details and selection of componentsJournal of Chemometrics, 1988
- A theoretical foundation for the PLS algorithmJournal of Chemometrics, 1987
- Multivariate Calibration When the Error Covariance Matrix Is StructuredTechnometrics, 1985
- The Collinearity Problem in Linear Regression. The Partial Least Squares (PLS) Approach to Generalized InversesSIAM Journal on Scientific and Statistical Computing, 1984
- On Deriving the Inverse of a Sum of MatricesSIAM Review, 1981
- Regression DiagnosticsWiley Series in Probability and Statistics, 1980
- Cross-Validatory Estimation of the Number of Components in Factor and Principal Components ModelsTechnometrics, 1978
- Updating the singular value decompositionNumerische Mathematik, 1978