Variable selection in semiparametric regression modeling
Top Cited Papers
Open Access
- 1 February 2008
- journal article
- Published by Institute of Mathematical Statistics in The Annals of Statistics
- Vol. 36 (1) , 261-286
- https://doi.org/10.1214/009053607000000604
Abstract
In this paper, we are concerned with how to select significant variables in semiparametric modeling. Variable selection for semiparametric regression models consists of two components: model selection for nonparametric components and selection of significant variables for the parametric portion. Thus, semiparametric variable selection is much more challenging than parametric variable selection (e.g., linear and generalized linear models) because traditional variable selection procedures including stepwise regression and the best subset selection now require separate model selection for the nonparametric components for each submodel. This leads to a very heavy computational burden. In this paper, we propose a class of variable selection procedures for semiparametric regression models using nonconcave penalized likelihood. We establish the rate of convergence of the resulting estimate. With proper choices of penalty functions and regularization parameters, we show the asymptotic normality of the resulting estimate and further demonstrate that the proposed procedures perform as well as an oracle procedure. A semiparametric generalized likelihood ratio test is proposed to select significant variables in the nonparametric component. We investigate the asymptotic behavior of the proposed test and demonstrate that its limiting null distribution follows a chi-square distribution which is independent of the nuisance parameters. Extensive Monte Carlo simulation studies are conducted to examine the finite sample performance of the proposed variable selection procedures.Keywords
All Related Versions
This publication has 24 references indexed in Scilit:
- Variable selection in semiparametric regression modelingThe Annals of Statistics, 2008
- Profile likelihood inferences on semiparametric varying-coefficient partially linear modelsBernoulli, 2005
- Efficient Estimation and Inferences for Varying-Coefficient ModelsJournal of the American Statistical Association, 2000
- An Effective Bandwidth Selector for Local Least Squares RegressionJournal of the American Statistical Association, 1995
- Semiparametric Regression in Likelihood-Based ModelsJournal of the American Statistical Association, 1994
- Quasi-likelihood Estimation in Semiparametric ModelsJournal of the American Statistical Association, 1994
- A Statistical View of Some Chemometrics Regression ToolsTechnometrics, 1993
- Asymptotics for Least Absolute Deviation Regression EstimatorsEconometric Theory, 1991
- Estimating the Dimension of a ModelThe Annals of Statistics, 1978
- A new look at the statistical model identificationIEEE Transactions on Automatic Control, 1974