Abstract
We are concerned with robust estimation procedures to estimate the parameters in partially linear models with large-dimensional covariates. To enhance the interpretability, we suggest implementing a nonconcave regularization method in the robust estimation procedure to select important covariates from the linear component. We establish the consistency for both the linear and the nonlinear components when the covariate dimension diverges at the rate of \(o\left( {\sqrt n } \right)\), where n is the sample size. We show that the robust estimate of linear component performs asymptotically as well as its oracle counterpart which assumes the baseline function and the unimportant covariates were known a priori. With a consistent estimator of the linear component, we estimate the nonparametric component by a robust local linear regression. It is proved that the robust estimate of nonlinear component performs asymptotically as well as if the linear component were known in advance. Comprehensive simulation studies are carried out and an application is presented to examine the finite-sample performance of the proposed procedures.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Bai Z D, Rao C R, Wu Y. M-estimation of multivariate linear regression parameters under a convex discrepancy function. Statist Sinica, 1992, 2: 237–254
Bai Z D, Wu Y. Limit behavior of M-estimators or regression coefficients in high dimensional linear models I: Scaledependent case. J Multivariate Anal, 1994, 51: 211–239
Boente G, He X M, Zhou J H. Robust estimates in generalized partially linear models. Ann Statist, 2006, 34: 2856–2878
Carroll R J, Fan J, Gijbels I, et al. Generalized partially linear single-index models. J Amer Statist Assoc, 1997, 92: 477–489
Chen H. Convergence rates for parametric components in a partly linear model. Ann Statist, 1988, 16: 136–146
Engle R F, Granger C W J, Rice J, et al. Semiparametric estimates of the relation between weather and electricity sales. J Amer Statist Assoc, 1986, 81: 310–320
Fan J, Gijbels I. Local Polynomial Modeling and its Applications. New York: Chapman and Hall, 1996.
Fan J, Hu T C, Truong Y K. Robust nonparametric function estimation. Scandinavian J Statist, 1994, 21: 433–446
Fan J, Li R. Variable selection via nonconcave penalized likelihood and it oracle properties. J Amer Statist Assoc, 2001, 96: 1348–1360
Fan J, Li R. New estimation and model selection procedures for semi-parametric modeling in longitudinal data analysis. J Amer Statist Assoc, 2004, 99: 710–723
Fan J, Peng H. Nonconcave penalized likelihood with a diverging number of parameters. Ann Statist, 2004, 32: 928–961
Hamilton S A, Truong Y K. Local linear estimation in partly linear models. J Multivariate Anal, 1997, 60: 1–19
Härdle W, Liang H, Gao J T. Partial Linear Models. New York: Springer-Verlag, 2000
He X M, Fung W K, Zhu Z Y. Robust estimation in generalized partial linear models for clustered data. J Amer Statist Assoc, 2005, 100: 1176–1184
Heckman N E. Spline smoothing in a partly linear model. J Royal Statist Soc Ser B, 1986, 48: 244–248
Huang J, Xie H. Asymptotic oracle properties of SCAD-penalized least square estimators. In: Institute of Mathematical Statistics Lecture Notes Monograph Seriess vol. 55. Asymptotics: Particles, Processes and Inverse Problems. Beachwood: IMS, 2007, 149–166
Huber P J. Robust regression: Asymptotics, conjectures and Monte Carlo. Ann Statist, 1973, 1: 799–821
Hunter D, Li R. Variable selection using MM algorithms. Ann Statist, 2005, 33: 1617–1642
Johnson R W. Kiplinger’s personal finance. J Statist Education, 2003, 57: 104–123
Li G R, Peng H, Zhu L X. Nonconcave penalized M-estimation with diverging number of parameters. Statist Sinica, 2011, 21: 391–420
Liang H, Li R. Variable selection for partially linear models with measurement errors. J Amer Statist Assoc, 2009, 104: 234–248
Mammen E. Asymptotics with increasing dimension for robust regression with application to the bootstrap. Ann Statist, 1989, 17: 382–400
Qin G Y, Zhu Z Y. Robustified maximum likelihood estimation in generalized partial linear mixed model for longitudinal data. Biometrics, 2009, 65: 52–59
Rao B L S P. Nonparametric Functional Estimation. Orlando: Academic Press, 1983
Robinson P M. Root-n-consistent semiparametric regression. Econometrika, 1988, 56: 931–954
Speckman P. Kernel smoothing in partial linear models. J Royal Statist Soc Ser B, 1988, 50: 413–436
Wang H, Li B, Leng C. Shrinkage tuning parameter selection with a diverging number of parameters. J Royal Statist Soc Ser B, 2009, 71: 671–683
Wang H, Li G, Jiang G. Robust regression shrinkage and consistent variable selection via the LAD-LASSO. J Business Economics Statist, 2007, 25: 347–355
Wang H, Li R, Tsai C L. Tuning parameter selectors for the smoothly clipped absolute deviation method. Biometrika, 2007, 94: 553–568
Wang L, Li R. Weighted Wilcoxon-type smoothly clipped absolute deviation method. Biometrics, 2009, 65: 564–571
Wu W B. M-estimation of linear models with dependent errors. Ann Statist, 2007, 35: 495–521
Xie H L, Huang J. SCAD-penalized regression in high-dimensional partially linear models. Ann Statist, 2009, 37: 673–696
Zhu L X, Fang K T. Asymptotics for kernel estimation of sliced inverse regression. Ann Statist, 1996, 3: 1053–1068
Zou H, Yuan M. Composite quantile regression and the oracle model selection theory. Ann Statist, 2008, 36: 1108–1126
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zhu, L., Li, R. & Cui, H. Robust estimation for partially linear models with large-dimensional covariates. Sci. China Math. 56, 2069–2088 (2013). https://doi.org/10.1007/s11425-013-4675-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11425-013-4675-0
Keywords
- partially linear models
- robust model selection
- smoothly clipped absolute deviation (SCAD)
- semiparametric models