Cross-Validation in Statistical Climate Forecast Models

Open Access

1 November 1987

journal article
research article
Published by American Meteorological Society in Journal of Climate and Applied Meteorology

Vol. 26 (11), 1589-1600
https://doi.org/10.1175/1520-0450(1987)026<1589:cviscf>2.0.co;2

Abstract

Cross-validation is a statistical procedure that produces an estimate of forecast skill which is less biased than the usual hindcast skill estimates. The cross-validation method systematically deletes one or more cases in a dataset, derives a forecast model from the remaining cases, and tests it on the deleted case or cases. The procedure is nonparametric and can be applied to any automated model building technique. It can also provide important diagnostic information about influential cases in the dataset and the stability of the model. Two experiments were conducted using cross-validation to estimate forecast skill in different predictive models of North Pacific sea surface temperatures (SSTs). The results indicate that bias, or artificial predictability (defined here as the difference between the usual hindcast skill and the forecast skill estimated by cross-validation), increases with each decision—either screening of potential predictors or fixing the value of a coefficient—drawn from the da... Abstract Cross-validation is a statistical procedure that produces an estimate of forecast skill which is less biased than the usual hindcast skill estimates. The cross-validation method systematically deletes one or more cases in a dataset, derives a forecast model from the remaining cases, and tests it on the deleted case or cases. The procedure is nonparametric and can be applied to any automated model building technique. It can also provide important diagnostic information about influential cases in the dataset and the stability of the model. Two experiments were conducted using cross-validation to estimate forecast skill in different predictive models of North Pacific sea surface temperatures (SSTs). The results indicate that bias, or artificial predictability (defined here as the difference between the usual hindcast skill and the forecast skill estimated by cross-validation), increases with each decision—either screening of potential predictors or fixing the value of a coefficient—drawn from the da...

Keywords

This publication has 3 references indexed in Scilit:

SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation
Nature Genetics, 2008
Strategies for Assessing Skill and Significance of Screening Regression Models with Emphasis on Monte Carlo Techniques
Journal of Climate and Applied Meteorology, 1984
Monte Carlo Significance Testing as Applied to Statistical Tropical Cyclone Prediction Models
Journal of Applied Meteorology, 1977

Cited by 681 articles