An Introduction to Ensemble Methods for Data Analysis
- 1 February 2006
- journal article
- research article
- Published by SAGE Publications in Sociological Methods & Research
- Vol. 34 (3), 263-295
- https://doi.org/10.1177/0049124105283119
Abstract
This article provides an introduction to ensemble statistical procedures as a special case of algorithmic methods. The discussion begins with classification and regression trees (CART) as a didactic device to introduce many of the key issues. Following the material on CART is a consideration of cross-validation, bagging, random forests, and boosting. Major points are illustrated with analyses of real data.Keywords
This publication has 17 references indexed in Scilit:
- Developing a Practical Forecasting Screener for Domestic Violence IncidentsEvaluation Review, 2005
- Maximum Likelihood Regression TreesJournal of Computational and Graphical Statistics, 2004
- Bagging Equalizes InfluenceMachine Learning, 2004
- Analyzing baggingThe Annals of Statistics, 2002
- Stochastic gradient boostingComputational Statistics & Data Analysis, 2002
- Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author)Statistical Science, 2001
- Additive logistic regression: a statistical view of boosting (With discussion and a rejoinder by the authors)The Annals of Statistics, 2000
- Combining Classifiers via DiscretizationJournal of the American Statistical Association, 1999
- A consistent combined classification ruleStatistics & Probability Letters, 1997
- Bagging predictorsMachine Learning, 1996