Nonparametric lack-of-fit testing and consistent variable selection

Adriano Zanin Zambom, Michael G. Akritas

Research output: Contribution to journalArticlepeer-review

13 Scopus citations

Abstract

Let X be a d-dimensional vector of covariates and Y be the response variable. Under the nonparametric model Y = m(X) + σ(X)ε we develop an ANOVA-type test for the null hypothesis that a particular coordinate of X has no influence on the regression function. The asymptotic distribution of the test statistic, using residuals based on local polynomial regression, is established under the null hypothesis and local alternatives. Simulations suggest that the test outperforms existing procedures in heteroscedastic settings. Using p-values from this test, a variable selection method based on False Discovery Rate corrections is proposed, and proved to be consistent in estimating the set of indices corresponding to the significant covariates. Simulations suggest that, under a sparse model, dimension reduction techniques can help avoid the curse of dimensionality. We also propose a backward elimination version of this procedure, called BEAMS (Backward Elimination ANOVA-type Model Selection), which performs competitively against well-established procedures in linear regression settings, and outperforms them in nonparametric settings. A data set is analyzed.

Original languageEnglish (US)
Pages (from-to)1837-1858
Number of pages22
JournalStatistica Sinica
Volume24
Issue number4
DOIs
StatePublished - Oct 2014

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Fingerprint Dive into the research topics of 'Nonparametric lack-of-fit testing and consistent variable selection'. Together they form a unique fingerprint.

Cite this