Regularized rank-based estimation of high-dimensional nonparanormal graphical models
Open Access
- 1 October 2012
- journal article
- Published by Institute of Mathematical Statistics in The Annals of Statistics
- Vol. 40 (5)
- https://doi.org/10.1214/12-aos1041
Abstract
A sparse precision matrix can be directly translated into a sparse Gaussian graphical model under the assumption that the data follow a joint normal distribution. This neat property makes high-dimensional precision matrix estimation very appealing in many applications. However, in practice we often face nonnormal data, and variable transformation is often used to achieve normality. In this paper we consider the nonparanormal model that assumes that the variables follow a joint normal distribution after a set of unknown monotone transformations. The nonparanormal model is much more flexible than the normal model while retaining the good interpretability of the latter in that each zero entry in the sparse precision matrix of the nonparanormal model corresponds to a pair of conditionally independent variables. In this paper we show that the nonparanormal graphical model can be efficiently estimated by using a rank-based estimation scheme which does not require estimating these unknown transformation functions. In particular, we study the rank-based graphical lasso, the rank-based neighborhood Dantzig selector and the rank-based CLIME. We establish their theoretical properties in the setting where the dimension is nearly exponentially large relative to the sample size. It is shown that the proposed rank-based estimators work as well as their oracle counterparts defined with the oracle data. Furthermore, the theory motivates us to consider the adaptive version of the rank-based neighborhood Dantzig selector and the rank-based CLIME that are shown to enjoy graphical model selection consistency without assuming the irrepresentable condition for the oracle and rank-based graphical lasso. Simulated and real data are used to demonstrate the finite performance of the rank-based estimators.Comment: Published in at http://dx.doi.org/10.1214/12-AOS1041 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.orgKeywords
This publication has 33 references indexed in Scilit:
- High-dimensional semiparametric Gaussian copula graphical modelsThe Annals of Statistics, 2012
- High-dimensional covariance estimation by minimizing ℓ1-penalized log-determinant divergenceElectronic Journal of Statistics, 2011
- The adaptive and the thresholded Lasso for potentially misspecified models (and a lower bound for the Lasso)Electronic Journal of Statistics, 2011
- Sparsistency and rates of convergence in large covariance matrix estimationThe Annals of Statistics, 2009
- Partial Correlation Estimation by Joint Sparse Regression ModelsJournal of the American Statistical Association, 2009
- Sparse permutation invariant covariance estimationElectronic Journal of Statistics, 2008
- Sparse inverse covariance estimation with the graphical lassoBiostatistics, 2007
- Estimation of copula-based semiparametric time series modelsJournal of Econometrics, 2005
- Robust estimation and outlier detection with correlation coefficientsBiometrika, 1975
- Ordinal Measures of AssociationJournal of the American Statistical Association, 1958