Peeking Inside the Black Box: Visualizing Statistical Learning With Plots of Individual Conditional Expectation
Top Cited Papers
- 2 January 2015
- journal article
- graphics and-visualization
- Published by Taylor & Francis in Journal of Computational and Graphical Statistics
- Vol. 24 (1), 44-65
- https://doi.org/10.1080/10618600.2014.907095
Abstract
This article presents individual conditional expectation (ICE) plots, a tool for visualizing the model estimated by any supervised learning algorithm. Classical partial dependence plots (PDPs) help visualize the average partial relationship between the predicted response and one or more features. In the presence of substantial interaction effects, the partial response relationship can be heterogeneous. Thus, an average curve, such as the PDP, can obfuscate the complexity of the modeled relationship. Accordingly, ICE plots refine the PDP by graphing the functional relationship between the predicted response and the feature for individual observations. Specifically, ICE plots highlight the variation in the fitted values across the range of a covariate, suggesting where and to what extent heterogeneities might exist. In addition to providing a plotting suite for exploratory analysis, we include a visual test for additive structure in the data-generating model. Through simulated examples and real datasets, we demonstrate how ICE plots can shed light on estimated models in ways PDPs cannot. Procedures outlined are available in the R package ICEbox.Keywords
All Related Versions
This publication has 16 references indexed in Scilit:
- The Personalized Advantage Index: Translating Research on Prediction into Individualized Treatment Recommendations. A DemonstrationPLOS ONE, 2014
- Statistical Procedures for Forecasting Criminal BehaviorCriminology & Public Policy, 2013
- BART: Bayesian additive regression treesThe Annals of Applied Statistics, 2010
- Statistical inference for exploratory data analysis and model diagnosticsPhilosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 2009
- A working guide to boosted regression treesJournal of Animal Ecology, 2008
- Stochastic gradient boostingComputational Statistics & Data Analysis, 2002
- Random ForestsMachine Learning, 2001
- Generalized Additive ModelsStatistical Science, 1986
- Estimating Optimal Transformations for Multiple Regression and CorrelationJournal of the American Statistical Association, 1985
- A Variable Span SmootherPublished by Defense Technical Information Center (DTIC) ,1984