Bayesian CART: Prior Specification and Posterior Simulation
- 1 March 2007
- journal article
- Published by Taylor & Francis in Journal of Computational and Graphical Statistics
- Vol. 16 (1), 44-66
- https://doi.org/10.1198/106186007x180426
Abstract
We present advances in Bayesian modeling and computation for CART (classification and regression tree) models. The modeling innovations include a formal prior distributional structure for tree generation—the pinball prior—that allows for the combination of an explicit specification of a distribution for both the tree size and the tree shape. The core computational innovations involve a novel Metropolis–Hastings method that can dramatically improve the convergence and mixing properties of MCMC methods of Bayesian CART analysis. Earlier MCMC methods have simulated Bayesian CART models using very local MCMC moves, proposing only small changes to a “current” CART model. Our new Metropolis–Hastings move makes large changes in the CART tree, but is at the same time local in that it leaves unchanged the partition of observations into terminal nodes. We evaluate the effectiveness of the proposed algorithm in two examples, one with a constructed data set and one concerning analysis of a published breast cancer dataset.Keywords
This publication has 12 references indexed in Scilit:
- Bayesian analysis of binary prediction tree models for retrospectively sampled outcomesBiostatistics, 2004
- Integrated modeling of clinical and gene expression information for personalized prediction of disease outcomesProceedings of the National Academy of Sciences, 2004
- Gene expression predictors of breast cancer outcomesThe Lancet, 2003
- The Multiple-Try Method and Local Optimization in Metropolis SamplingJournal of the American Statistical Association, 2000
- Hierarchical priors for Bayesian CART shrinkageStatistics and Computing, 2000
- Bayesian CART Model SearchJournal of the American Statistical Association, 1998
- A Bayesian CART algorithmBiometrika, 1998
- Convergence assessment techniques for Markov chain Monte CarloStatistics and Computing, 1998
- Markov Chain Monte Carlo Convergence Diagnostics: A Comparative ReviewJournal of the American Statistical Association, 1996
- Reversible jump Markov chain Monte Carlo computation and Bayesian model determinationBiometrika, 1995