Estimation of prediction error for multivariate calibration
- 1 April 1988
- journal article
- research article
- Published by Wiley in Journal of Chemometrics
- Vol. 2 (2), 93-109
- https://doi.org/10.1002/cem.1180020203
Abstract
When arrays of non‐selective sensors or overlapping spectra are used for chemical analysis, multivariate calibration must be used to relate the instrument responses to individual analytes. Using a set of carefully selected calibration samples, a multivariate mathematical model is constructed for one or more analytes. If this step is successful, the model can be used to predict the concentrations of these analytes in prospective samples. Previously, the equations required to estimate the errors in the predicted concentrations, and from these the confidence intervals, were not available because the three sources of error (measured responses from calibration data, concentrations of the analytes in the calibration set and measured responses from the unknown sample) propagated in a non‐linear manner not amenable to statistical analysis. A new theory for error propagation is developed. The theory developed herein does not require estimates of the actual three sources of errors mentioned above and therefore is easy to implement. Data from near‐infrared reflectance spectrometry of wheat samples were used to test the equations derived from the theory. Complete agreement between the true prediction errors and those estimated from the theory is demonstrated.This publication has 17 references indexed in Scilit:
- A theoretical foundation for the PLS algorithmJournal of Chemometrics, 1987
- Influential Observations, High Leverage Points, and Outliers in Linear RegressionStatistical Science, 1986
- Error propagation and figures of merit for quantification by solving matrix equationsAnalytical Chemistry, 1986
- Multivariate calibration. I. Concepts and distinctionsTrAC Trends in Analytical Chemistry, 1984
- A Misuse of Ridge Regression in the Calibration of a Near Infrared Reflectance InstrumentJournal of the Royal Statistical Society Series C: Applied Statistics, 1983
- Matrix representations and criteria for selecting analytical wavelengths for multicomponent spectroscopic analysisAnalytical Chemistry, 1982
- Error propagation and optimal performance in multicomponent analysisAnalytical Chemistry, 1981
- Meaningful Multicollinearity MeasuresTechnometrics, 1978
- Cross-Validatory Estimation of the Number of Components in Factor and Principal Components ModelsTechnometrics, 1978
- Generalized Inverses, Ridge Regression, Biased Linear Estimation, and Nonlinear EstimationTechnometrics, 1970