Skip to main content

Detecting Multiple Influential Observations in High Dimensional Linear Regression

  • Conference paper
  • First Online:
Advanced Intelligent Computing Theories and Applications (ICIC 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9227))

Included in the following conference series:

  • 2980 Accesses

Abstract

In this paper, we consider the detection of multiple influential observations in high dimensional regression, where the p number of covariates is much larger than sample size n. Detection of influential observations in high dimensional regression is challenging. In the case of single influential observation, Zhao et al. (2013) developed a method called High dimensional Influence Measure (HIM). However, the result of HIM is not applicable to the case of multiple influential observations, where the detection of influential observations is much more complicated than the case of single influential observation. We propose in this paper a new method based on the multiple deletion to detect the multiple influential.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Cook, R.D.: Detection of influential observation in linear regression. Technometrics 19, 15–18 (1977)

    Article  MathSciNet  Google Scholar 

  2. Behnken, D.W., Draper, N.R.: Residuals and their variance patterns. Technometrics 14, 101–111 (1972)

    Article  Google Scholar 

  3. Belsley, D.A., Kuh, E., Welsch, R.E.: The Grid: Regression Diagnostics: Identifying Influential Data and Sources of Collinearity. Wiley, New York (2005)

    MATH  Google Scholar 

  4. Chatterjee, S., Hadi, A.S.: The Grid: Sensitivity Analysis in Linear Regression. Wiley, New York (1988)

    Book  Google Scholar 

  5. Pena, D.: A new statistic for influence in linear regression. Technometrics 47(1), 1–12 (2005)

    Article  MathSciNet  Google Scholar 

  6. Pena, D.: Measures of Influence and Sensitivity in Linear Regression, pp. 523–536. Springer, London (2006). Springer Handbook of Engineering Statistics

    Google Scholar 

  7. Nurunnabi, A.A.M., Imon, A.H.M.R., Nasser, M.: A diagnostic measure for influential observations in linear regression. Commun. Stat. Theor. Methods 40, 1169–1183 (2011)

    Article  MathSciNet  MATH  Google Scholar 

  8. Nurunnabi, A.A.M., Hadi, A.S., Imon, A.H.M.R.: Procedures for the identification of multiple influential observations in linear regression. J. Appl. Stat. 41, 1315–1331 (2014)

    Article  MathSciNet  Google Scholar 

  9. Imon, A.R., Hadi, A.S.: Identification of multiple outliers in logistic regression. Commun. Stat. Theor. Methods 37, 1697–1709 (2008)

    Article  MathSciNet  MATH  Google Scholar 

  10. Imon, A.R., Hadi, A.S.: Identification of multiple high leverage points in logistic regression. J. Appl. Stat. 40, 2601–2616 (2013)

    Article  MathSciNet  Google Scholar 

  11. Zakaria, A., Howard, N.K., Nkansah, B.K.: On the detection of influential outliers in linear regression analysis. Am. J. Theor. Appl. Stat. 3, 100–106 (2014)

    Article  Google Scholar 

  12. Tibshirani, R.: Regression shrinkage and selection via the Lasso. J. Roy. Stat. Soc. B 58, 267–288 (1996)

    MathSciNet  MATH  Google Scholar 

  13. Fan, J., Lv, J.: Sure independence screening for ultrahigh dimensional feature space. J. Roy. Stat. Soc. Ser. B (Stat. Methodol.) 70, 849–911 (2008)

    Article  MathSciNet  Google Scholar 

  14. Wang, H., Li, G., Jiang, G.: Robust regression shrinkage and consistent variable selection through the LAD-Lasso. J. Bus. Econ. Stat. 25, 347–355 (2007)

    Article  MathSciNet  Google Scholar 

  15. She, Y., Owen, A.B.: Outlier detection using nonconvex penalized regression. J. Am. Stat. Assoc. 106, 626–639 (2011)

    Article  MathSciNet  MATH  Google Scholar 

  16. Rahmatullah Imon, A.H.M.: Identifying multiple influential observations in linear regression. J. Appl. Stat. 32, 929–946 (2005)

    Article  MathSciNet  MATH  Google Scholar 

  17. Pan, J.X., Fung, W.K., Fang, K.T.: Multiple outlier detection in multivariate data using projection pursuit techniques. J. Stat. Plann. Infer. 83(1), 153–167 (2000)

    Article  MathSciNet  Google Scholar 

  18. Benjamini, Y., Hochberg, Y.: Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. Roy. Stat. Soc.: Ser. B (Methodol.) 57, 289–300 (1995)

    MathSciNet  MATH  Google Scholar 

Download references

Acknowledgements

The research of Zhao was supposed by National Science Foundation of China (No. 11471030, 11101022) and the Fundamental Research Funds for the Central Universities.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Junlong Zhao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Zhao, J., Zhang, Y., Niu, L. (2015). Detecting Multiple Influential Observations in High Dimensional Linear Regression. In: Huang, DS., Han, K. (eds) Advanced Intelligent Computing Theories and Applications. ICIC 2015. Lecture Notes in Computer Science(), vol 9227. Springer, Cham. https://doi.org/10.1007/978-3-319-22053-6_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-22053-6_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-22052-9

  • Online ISBN: 978-3-319-22053-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics