Skip to main content
Log in

Feature Screening for Nonparametric and Semiparametric Models with Ultrahigh-Dimensional Covariates

  • Published:
Journal of Systems Science and Complexity Aims and scope Submit manuscript

Abstract

This paper considers the feature screening and variable selection for ultrahigh dimensional covariates. The new feature screening procedure base on conditional expectation which is used to differentiate whether an explanatory variable contributes to a response variable or not, without requiring a specific parametric form of the underlying data model. The authors estimate the marginal conditional expectation by kernel regression estimator. The proposed method is showed to have sure screen property. The authors propose an iterative kernel estimator algorithm to reduce the ultrahigh dimensionality to an appropriate scale. Simulation results and real data analysis demonstrate the proposed method works well and performs better than competing methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Fan J and Lü J, Sure independence screening for ultrahigh dimensional feature space (with discussion), Journal of the Royal Statistical Society: Series B, 2008, 70: 849–911.

    Article  MathSciNet  Google Scholar 

  2. Hastie T, Tibshirani R, and Friedman J, The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd ed., Springer, New York, 2009.

    Book  MATH  Google Scholar 

  3. Buhlmann P and van de Geer S, Statistics for High-Dimensional Data: Methods, Theory and Applications, Springer, Heidelberg, 2011.

    Book  MATH  Google Scholar 

  4. Hall P and Miller H, Using generalized correlation to effect variable selection in very high dimensional problems, Journal of Computational and Graphical Statistics, 2009, 18: 533–550.

    Article  MathSciNet  Google Scholar 

  5. Fan J and Song R, Sure independence screening in generalized linear models with NPdimensionality, Annals of Statistics, 2010, 38: 3567–3604.

    Article  MathSciNet  MATH  Google Scholar 

  6. Fan J, Feng Y, and Song R, Nonparametric independence screening in sparse ultra-high-dimensional additive models, Journal of the American Statistics Association, 2011, 106: 544–557.

    Article  MathSciNet  MATH  Google Scholar 

  7. Fan J and Gijbels I, Local Polynomial Modeling and Its Applications, Chapman and Hall, New York, 1996.

    MATH  Google Scholar 

  8. Härdle W, Applied nonparametric regression, Econometric Society Monographs 19, Cambridge University Press, Cambridge, 1990.

    Book  Google Scholar 

  9. Wang H, forward regression for ultra-high dimension variable screening, Journal of the American Statistics Association, 2009, 104: 1512–1524.

    Article  MATH  Google Scholar 

  10. Fan J and Li R, Variable selection via nonconcave penalized likelihood and it oracle properties, Journal of the American Statistics Association, 2001, 96: 1348–1360.

    Article  MathSciNet  MATH  Google Scholar 

  11. Fan J, Samworth R, and Wu Y, Ultra-dimensional variable selection via independent learning: Beyond the linear model, Journal of Machine Learning Research, 2009, 10: 1829–1853.

    Google Scholar 

  12. Ruppert D, Sheather S, and Wand M, An effective bandwidth selector for local least squares regression, Journal of the American Statistics Association, 1995, 90: 1257–1270.

    Article  MathSciNet  MATH  Google Scholar 

  13. Ravikumar P, Liu H, Lafferty J, et al., Spam: Sparse additive models, Journal of the Royal Statistical Society: Series B, 2009, 71: 1009–1030.

    Article  MathSciNet  Google Scholar 

  14. Chiang A, Beck J, Yen H, et al, Homozygosity mapping with SNP arrays identifies TRIM32, an E3 ubiquitin ligase, as a bardetiedl syndrome gene (BBS11), Proceedings of the National Academy of Sciences, 2006, 103: 6287–6292.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Junying Zhang.

Additional information

The research was supported in part by the National Natural Science Foundation of China under Grant Nos. 11571112, 11501372, 11571148, 11471160, Doctoral Fund of Ministry of Education of China under Grant No. 20130076110004, Program of Shanghai Subject Chief Scientist under Grant No. 14XD1401600, and the 111 Project of China under Grant No. B14019.

This paper was recommended for publication by Editor SUN Liuquan.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, J., Zhang, R. & Zhang, J. Feature Screening for Nonparametric and Semiparametric Models with Ultrahigh-Dimensional Covariates. J Syst Sci Complex 31, 1350–1361 (2018). https://doi.org/10.1007/s11424-017-6310-6

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11424-017-6310-6

Keywords

Navigation