Simultaneous Threshold Interaction Detection in Binary Classification
Classification Trunk Approach (CTA) is a method for the automatic selection of threshold interactions in generalized linear modelling (GLM). It comes out from the integration of classification trees and GLM. Interactions between predictors are expressed as “threshold interactions” instead of traditional cross-products. Unlike classification trees, CTA is based on a different splitting criterion and it is framed in a new algorithm – STIMA – that can be used to estimate threshold interactions effects in classification and regression models. This paper specifically focuses on the binary response case, and presents the results of an application on the Liver Disorders dataset to give insight into the advantages deriving from the use of CTA with respect to other model-based or decision tree-based approaches. Performances of the different methods are compared focusing on prediction accuracy and model complexity.
- Cohen, J., Cohen, P., West, S. G., & Aiken, L. S. (2003). Applied multiple regression/correlation analysis for the behavioral sciences (3rd edition). Mahwah NJ: Lawrence Erlbaum.Google Scholar
- Dusseldorp, E., Conversano, C., & Van Os, B. J. (2009). Combining an Additive and tree-based regression model simulatenously: STIMA, Journal of Computational and Graphical Statistics, to appear.Google Scholar