Consistent tuning parameter selection in high-dimensional group-penalized regression

Li, Yaguang; Wu, Yaohua; Jin, Baisuo

doi:10.1007/s11425-017-9189-9

Consistent tuning parameter selection in high-dimensional group-penalized regression

Articles
Published: 28 April 2018

Volume 62, pages 751–770, (2019)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Science China Mathematics Aims and scope Submit manuscript

Consistent tuning parameter selection in high-dimensional group-penalized regression

Download PDF

Yaguang Li¹,
Yaohua Wu¹ &
Baisuo Jin¹

313 Accesses
1 Citation
Explore all metrics

Abstract

Various forms of penalized estimators with good statistical and computational properties have been proposed for variable selection respecting the grouping structure in the variables. The attractive properties of these shrinkage and selection estimators, however, depend critically on the choice of the tuning parameter. One method for choosing the tuning parameter is via information criteria, such as the Bayesian information criterion (BIC). In this paper, we consider the problem of consistent tuning parameter selection in high dimensional generalized linear regression with grouping structures. We extend the results of the extended regularized information criterion (ERIC) to group selection methods involving concave penalties and then investigate the selection consistency with diverging variables in each group. Moreover, we show that the ERIC-type selector enables consistent identification of the true model and that the resulting estimator possesses the oracle property even when the number of group is much larger than the sample size. Simulations show that the ERIC-type selector can significantly outperform the BIC and cross-validation selectors when choosing true grouped variables, and an empirical example is given to illustrate its use.

Article PDF

A doubly sparse approach for group variable selection

Article 28 June 2016

A flexible shrinkage operator for fussy grouped variable selection

Article 05 July 2016

A high-dimensional M-estimator framework for bi-level variable selection

Article 09 September 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

Breheny P, Huang J. Group descent algorithms for nonconvex penalized linear and logistic regression models with grouped predictors. Stat Comput, 2015, 25: 173–187
Article MathSciNet MATH Google Scholar
Chen J, Chen Z. Extended Bayesian information criteria for model selection with large model spaces. Biometrika, 2008, 95: 759–771
Article MathSciNet MATH Google Scholar
Cortez P, Silva A M G. Using data mining to predict secondary school student performance. In: Proceedings of 5th Annual Future Business Technology Conference. https://doi.org/hdl.handle.net/1822/8024, 2008
Google Scholar
Fan J, Li R. Variable selection via nonconcave penalized likelihood and its oracle properties. J Amer Statist Assoc, 2001, 96: 1348–1360
Article MathSciNet MATH Google Scholar
Fan J, Peng H. Nonconcave penalized likelihood with a diverging number of parameters. Ann Statist, 2004, 32: 928–961
Article MathSciNet MATH Google Scholar
Fan J, Song R. Sure independence screening in generalized linear models with NP-dimensionality. Ann Statist, 2010, 38: 3567–3604
Article MathSciNet MATH Google Scholar
Fan Y, Tang C Y. Tuning parameter selection in high dimensional penalized likelihood. J R Stat Soc Ser B Stat Methodol, 2013, 75: 531–552
Article MathSciNet Google Scholar
Friedman J, Hastie T, Tibshirani R. A note on the group LASSO and a sparse group LASSO. ArXiv:1001.0736, 2010
Google Scholar
Gao X, Carroll R J. Data integration with high dimensionality. Biometrika, 2017, 104: 251–272
MathSciNet Google Scholar
Huang J, Breheny P, Ma S. A selective review of group selection in high-dimensional models. Statist Sci, 2012, 27: 481–499
Article MathSciNet MATH Google Scholar
Huang J, Ma S, Zhang C H. Adaptive LASSO for sparse high-dimensional regression models. Statist Sinica, 2008, 18: 1603–1618
MathSciNet MATH Google Scholar
Hui F K C, Warton D I, Foster S D. Tuning parameter selection for the adaptive LASSO using ERIC. J Amer Statist Assoc, 2015, 110: 262–269
Article MathSciNet MATH Google Scholar
Kim Y, Kwon S, Choi H. Consistent model selection criteria on high dimensions. J Mach Learn Res, 2012, 13: 1037–1057
MathSciNet MATH Google Scholar
McCullagh P, Nelder J A. Generalized Linear Models. Boca Raton: CRC Press, 1989
Book MATH Google Scholar
Meier L, Van De Geer S, Bühlmann P. The group LASSO for logistic regression. J R Stat Soc Ser B Stat Methodol, 2008, 70: 53–71
Article MathSciNet MATH Google Scholar
Wang H, Leng C. A note on adaptive group LASSO. Comput Statist Data Anal, 2008, 52: 5277–5286
Article MathSciNet MATH Google Scholar
Wang H, Li B, Leng C. Shrinkage tuning parameter selection with a diverging number of parameters. J R Stat Soc Ser B Stat Methodol, 2009, 71: 671–683
Article MathSciNet MATH Google Scholar
Wang L, Chen G, Li H. Group SCAD regression analysis for microarray time course gene expression data. Bioinformatics, 2007, 23: 1486–1494
Article Google Scholar
Wei F, Huang J. Consistent group selection in high-dimensional linear regression. Bernoulli, 2010, 16: 1369–1384
Article MathSciNet MATH Google Scholar
Yuan M, Lin Y. Model selection and estimation in regression with grouped variables. J R Stat Soc Ser B Stat Methodol, 2006, 68: 49–67
Article MathSciNet MATH Google Scholar
Zhang Y, Li R, Tsai C L. Regularization parameter selections via generalized information criterion. J Amer Statist Assoc, 2010, 105: 312–323
Article MathSciNet MATH Google Scholar
Zhang Y, Shen X. Model selection procedure for high-dimensional data. Stat Anal Data Min, 2010, 3: 350–358
Article MathSciNet Google Scholar
Zou H. The adaptive Lasso and its oracle properties. J Amer Statist Assoc, 2006, 101: 1418–1429
Article MathSciNet MATH Google Scholar
Zou H, Li R. One-step sparse estimates in nonconcave penalized likelihood models. Ann Statist, 2008, 36: 1509–1533
Article MathSciNet MATH Google Scholar
Zou H, Zhang H H. On the adaptive elastic-net with a diverging number of parameters. Ann Statist, 2009, 37: 1733–1751
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

This work was supported by National Natural Science Foundation of China (Grant Nos. 11571337 and 71631006) and the Fundamental Research Funds for the Central Universities (Grant No. WK2040160028).

Author information

Authors and Affiliations

School of Management, University of Science and Technology of China, Hefei, 230026, China
Yaguang Li, Yaohua Wu & Baisuo Jin

Authors

Yaguang Li
View author publications
You can also search for this author in PubMed Google Scholar
Yaohua Wu
View author publications
You can also search for this author in PubMed Google Scholar
Baisuo Jin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Baisuo Jin.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, Y., Wu, Y. & Jin, B. Consistent tuning parameter selection in high-dimensional group-penalized regression. Sci. China Math. 62, 751–770 (2019). https://doi.org/10.1007/s11425-017-9189-9

Download citation

Received: 12 March 2017
Accepted: 26 September 2017
Published: 28 April 2018
Issue Date: April 2019
DOI: https://doi.org/10.1007/s11425-017-9189-9

Keywords

MSC(2010)

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Consistent tuning parameter selection in high-dimensional group-penalized regression

Abstract

Article PDF

Similar content being viewed by others

A doubly sparse approach for group variable selection

A flexible shrinkage operator for fussy grouped variable selection

A high-dimensional M-estimator framework for bi-level variable selection

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

MSC(2010)

Navigation

Consistent tuning parameter selection in high-dimensional group-penalized regression

Abstract

Article PDF

Similar content being viewed by others

A doubly sparse approach for group variable selection

A flexible shrinkage operator for fussy grouped variable selection

A high-dimensional M-estimator framework for bi-level variable selection

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

MSC(2010)

Search

Navigation