Skip to main content

Unsupervised Bump Hunting Using Principal Components

  • Chapter
  • First Online:

Part of the book series: Contributions to Statistics ((CONTRIB.STAT.))

Abstract

Principal Components Analysis is a widely used technique for dimension reduction and characterization of variability in multivariate populations. Our interest lies in studying when and why the rotation to principal components can be used effectively within a response-predictor set relationship in the context of mode hunting. Specifically focusing on the Patient Rule Induction Method (PRIM), we first develop a fast version of this algorithm (fastPRIM) under normality which facilitates the theoretical studies to follow. Using basic geometrical arguments, we then demonstrate how the Principal Components rotation of the predictor space alone can in fact generate improved mode estimators. Simulation results are used to illustrate our findings.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   139.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Cox, D.: Notes on some aspects of regression analysis. J. R. Stat. Soc. Ser. A. 131, 265/279 (1968)

    Google Scholar 

  2. Dazard, J.-E., Rao, J.S.: Local sparse bump hunting. J. Comput. Graph. Stat. 19 (4), 900–929 (2010)

    Article  Google Scholar 

  3. Dazard, J.-E., Rao, J., Markowitz, S.: Local sparse bump hunting reveals molecular heterogeneity of colon tumors. Stat. Med. 31 (11–12), 1203–1220 (2012)

    Article  MathSciNet  Google Scholar 

  4. Díaz-Pachón, D., Rao, J., Dazard, J.-E.: On the explanatory power of principal components (2014). arXiv:1404.4917v1

    Google Scholar 

  5. Friedman, J., Fisher, N.: Bump hunting in high-dimensional data. Stat. Comput. 9, 123–143 (1999)

    Article  Google Scholar 

  6. Hadi, S., Ling, R.: Some cautionary notes on the use of principal components regression. Am. Stat. 52 (1), 15–19 (1998)

    Google Scholar 

  7. Hirose, H., Koga, G.: A comparative study in the bump hunting between the tree-ga and the prim. Stud. Comp. Int. Dev. 443, 13–25 (2012)

    Google Scholar 

  8. Joliffe, I.: A note on the use of principal components in regression. J. R. Stat. Soc. Ser. C. Appl. Stat. 31 (3), 300–303 (1982)

    Google Scholar 

  9. Mardia, K.: Multivariate Analysis. Academic Press, New York (1976)

    Google Scholar 

  10. Polonik, W., Wang, Z.: Prim analysis. J. Multivar. Anal., 101 (3), 525–540 (2010)

    Article  MathSciNet  MATH  Google Scholar 

Download references

Acknowledgements

All authors supported in part by NIH grant NCI R01-CA160593A1. We would like to thank Rob Tibshirani, Steve Marron, and Hemant Ishwaran for helpful discussions of the work. This work made use of the High Performance Computing Resource in the Core Facility for Advanced Research Computing at Case Western Reserve University.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to J. Sunil Rao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this chapter

Cite this chapter

Díaz-Pachón, D.A., Dazard, JE., Rao, J.S. (2017). Unsupervised Bump Hunting Using Principal Components. In: Ahmed, S. (eds) Big and Complex Data Analysis. Contributions to Statistics. Springer, Cham. https://doi.org/10.1007/978-3-319-41573-4_16

Download citation

Publish with us

Policies and ethics