TY - JOUR
T1 - Spline estimator for simultaneous variable selection and constant coefficient identification in high-dimensional generalized varying-coefficient models
AU - Lian, Heng
AU - Meng, Jie
AU - Zhao, Kaifeng
PY - 2015/10
Y1 - 2015/10
N2 - In this paper, we are concerned with two common and related problems for generalized varying-coefficient models, variable selection and constant coefficient identification. Starting with a specification of generalized varying-coefficient models assuming possible nonlinear interactions between the index variable and all other predictors, we propose a polynomial-spline based procedure that simultaneously eliminates irrelevant predictors and identifies predictors that do not interact with the index variable. Our approach is based on a double-penalization strategy where two penalty functions are used for these two related purposes respectively, in a single functional. In a “large p, small n” setting, we demonstrate the convergence rates of the estimator under suitable regularity assumptions. Based on its previous success on parametric models, we use the extended Bayesian information criterion (eBIC) to automatically choose the regularization parameters. Finally, post-penalization estimator is proposed to further reduce the bias of the resulting estimator. Monte Carlo simulations are conducted to examine the finite sample performance of the proposed procedures and an application to a leukemia dataset is presented.
AB - In this paper, we are concerned with two common and related problems for generalized varying-coefficient models, variable selection and constant coefficient identification. Starting with a specification of generalized varying-coefficient models assuming possible nonlinear interactions between the index variable and all other predictors, we propose a polynomial-spline based procedure that simultaneously eliminates irrelevant predictors and identifies predictors that do not interact with the index variable. Our approach is based on a double-penalization strategy where two penalty functions are used for these two related purposes respectively, in a single functional. In a “large p, small n” setting, we demonstrate the convergence rates of the estimator under suitable regularity assumptions. Based on its previous success on parametric models, we use the extended Bayesian information criterion (eBIC) to automatically choose the regularization parameters. Finally, post-penalization estimator is proposed to further reduce the bias of the resulting estimator. Monte Carlo simulations are conducted to examine the finite sample performance of the proposed procedures and an application to a leukemia dataset is presented.
KW - B-spline basis
KW - Diverging parameters
KW - Group lasso
KW - Quasi-likelihood
UR - http://www.scopus.com/inward/record.url?scp=84947753651&partnerID=8YFLogxK
U2 - 10.1016/j.jmva.2015.06.011
DO - 10.1016/j.jmva.2015.06.011
M3 - Article
SN - 0047-259X
VL - 141
SP - 81
EP - 103
JO - Journal of Multivariate Analysis
JF - Journal of Multivariate Analysis
ER -