Skip to main content

Alternative strategies and CATANOVA testing in two-stage binary segmentation

  • Conference paper
New Approaches in Classification and Data Analysis

Abstract

In the framework of binary segmentation, we introduce alternative splitting criteria based on the predictability r index of Goodman and Kruskal. We use such splitting criteria in a two-stage predictive splitting procedure. Furthermore, we introduce as stopping rule a statistical test based on the CATANOVA statistic of Light and Margolin. We show an example on a real data set.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • BREIMAN, L., FRIEDMAN, J.H., OLSHEN, R.A. and STONE, C.J. (1984): Classification and Regression Trees. Wads worth International Group, Belmont, California.

    Google Scholar 

  • CIAMPI, A. and THIFFAULT, J. (1987): Recursive partition and Amalgamation (REC-PAM) for censored survival data: criteria for tree selector, Statistical Software Newsletter, n. 2, vol. 14, 78–81.

    Google Scholar 

  • ELIE, S. (1992): Les methódes d’élagages des arbres de segmentation. D.E.A. thesis in: Controle des systemes, Renault/U.T.C./INRIA.

    Google Scholar 

  • GELFAND, S.B. (1991): An iterative growing and pruning algorithm for classification tree designai data. TREE Transaction of pattern analysis and machine intelligence, n. 2, vol. 13, 163–174.

    Article  Google Scholar 

  • GOODMAN, L.A. and KRUSKAL, W.H. (1954): Measures of association for cross-classification. Journal of American Statistical Association, 48, 732–762.

    Google Scholar 

  • LECHEVALLIER, Y., (1990): Recherche d’une partition optimale sous contrainte d’ordre total. rapports de Recherche N.1247 INRIA

    Google Scholar 

  • LIGHT, R.J. and MARGOLIN, B.H. (1971): An analysis of variance for categorical data. Journal of American Statistical Association, 66, 534–544.

    Article  Google Scholar 

  • MOLA, F. and SICILIANO, R. (1992): A two-stage predictive splitting algorithm in binary segmentation. In: Y. Dodge and J. Whittaker (eds.): Computational Statistics. (Compstat ’92 Proceedings). Physica Verlag, vol. 1.

    Google Scholar 

  • MOLA, F. (1993): Aspetti metodologici e computazionali delle tecniche di segmentazione binaria. Un contributo basato su funzioni di predizione. Tesi di Dottorato in Statistica Computazionale ed Applicazioni, V Ciclo, Università di Napoli.

    Google Scholar 

  • MORGAN, J.N. and SONQUIST, J.A. (1963): Problems in the analysis of survey data and a proposals. Journal of American Statistical Association, 58, 415–434.

    Article  Google Scholar 

  • QUINLAN, J.R. (1986): Induction of decision trees. Machine learning, n. 1, 81–106.

    Google Scholar 

  • UTGOFF, P.E. (1989): Incremental induction of decision trees. Machine learning, n. 4, 161–186.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1994 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Mola, F., Siciliano, R. (1994). Alternative strategies and CATANOVA testing in two-stage binary segmentation. In: Diday, E., Lechevallier, Y., Schader, M., Bertrand, P., Burtschy, B. (eds) New Approaches in Classification and Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-51175-2_37

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-51175-2_37

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-58425-4

  • Online ISBN: 978-3-642-51175-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics