Skip to main content

Simultaneous Threshold Interaction Detection in Binary Classification

  • Conference paper
  • First Online:
Data Analysis and Classification

Abstract

Classification Trunk Approach (CTA) is a method for the automatic selection of threshold interactions in generalized linear modelling (GLM). It comes out from the integration of classification trees and GLM. Interactions between predictors are expressed as “threshold interactions” instead of traditional cross-products. Unlike classification trees, CTA is based on a different splitting criterion and it is framed in a new algorithm – STIMA – that can be used to estimate threshold interactions effects in classification and regression models. This paper specifically focuses on the binary response case, and presents the results of an application on the Liver Disorders dataset to give insight into the advantages deriving from the use of CTA with respect to other model-based or decision tree-based approaches. Performances of the different methods are compared focusing on prediction accuracy and model complexity.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Breiman, L. (1996). Bagging predictors. Machine Learning, 24, 123–140.

    MathSciNet  MATH  Google Scholar 

  • Breiman, L. (2001). Random forests. Machine Learning, 45, 5–32.

    Article  MATH  Google Scholar 

  • Breiman, L., Friedman, J. H., Olshen, R. A., & Stone, C. J. (1984). Classification and regression trees. Belmont, CA: Wadsworth.

    MATH  Google Scholar 

  • Cohen, J., Cohen, P., West, S. G., & Aiken, L. S. (2003). Applied multiple regression/correlation analysis for the behavioral sciences (3rd edition). Mahwah NJ: Lawrence Erlbaum.

    Google Scholar 

  • de Gonzalez, A. B., & Cox, D. R. (2007). Interpretation of interaction: A review. Annals of Applied Statistics, 1(2), 371–375.

    Article  MathSciNet  MATH  Google Scholar 

  • Dusseldorp, E., & Meulman, J. (2004). The regression trunk approach to discover treatment covariate interactions. Psychometrika, 69, 355–374.

    Article  MathSciNet  Google Scholar 

  • Dusseldorp, E., Spinhoven, P., Bakker, A., Van Dyck, R., & Van Balkom, A. J. L. M. (2007). Which panic disorder patients benefit from which treatment: Cognitive therapy or antidepressants? Psychotherapy and Psychosomatics, 76, 154–161.

    Article  Google Scholar 

  • Dusseldorp, E., Conversano, C., & Van Os, B. J. (2009). Combining an Additive and tree-based regression model simulatenously: STIMA, Journal of Computational and Graphical Statistics, to appear.

    Google Scholar 

  • Fahrmeir, L., & Tutz, G. (2001). Multivariate statistical modelling based on generalized linear models (2nd edition). New York: Springer.

    MATH  Google Scholar 

  • Freund, Y., & Schapire, R. (1997). A decision-theoretic generalization of on-line learning and an application to Boosting. Journal of Computer and System Sciences, 55(1), 119–139.

    Article  MathSciNet  MATH  Google Scholar 

  • Friedman, J. H. (1991). Multivariate adaptive regression splines (with discussion). Annals of Statistics, 19, 1–141.

    Article  MathSciNet  MATH  Google Scholar 

  • Hastie, T. J., & Tibshirani, R. J. (1990). Generalized additive models. London: Chapman & Hall.

    MATH  Google Scholar 

  • McCullagh, P., & Nelder, J. A. (1989). Generalized linear models (2nd edition). London: Chapman & Hall.

    MATH  Google Scholar 

  • Vapnik, V. (1998). Statistical learning theory. New York: Wiley.

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Claudio Conversano .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Conversano, C., Dusseldorp, E. (2010). Simultaneous Threshold Interaction Detection in Binary Classification. In: Palumbo, F., Lauro, C., Greenacre, M. (eds) Data Analysis and Classification. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03739-9_26

Download citation

Publish with us

Policies and ethics