Skip to main content

The Comparative Efficacy of Some Combinatorial Tests for Detection of Clusters and Mixtures of Probability Distributions

  • Conference paper
Classification in the Information Age
  • 594 Accesses

Abstract

Assume that n q-dimensional data points have been obtained and subjected to a cluster analysis algorithm. A potential concern is whether the resulting clusters have a “causal” interpretation or whether they are merely consequences of “random” fluctuation. In previous reports, the asymptotic properties of a number of potentially useful combinatorial tests based on the theory of random interval graphs were described. In the present work, comparisons of the asymptotic efficacy of a class of these tests are provided. As a particular illustration of potential applications, we discuss the detection of mixtures of probability distributions and provide some numerical illustrations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. BOCK, H.H. (1974): Automatische Klassifikation. Vandenhoeck & Ruprecht, Göttingen.

    Google Scholar 

  2. EBERL, W. and HAFNER, R. (1971): Die asymptotische Verteilung von Koinzi denzen. Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebiete, 18, 322–332.

    Article  Google Scholar 

  3. FISHER, R.A. (1936): The use of multiple measurements in taxonomic problems. Annals of Eugenic, 7, 179–188.

    Article  Google Scholar 

  4. FISHER, R.A. (1938): The statistical utilization of multiple measurements. Annals of Eugenic, 8, 376–386.

    Article  Google Scholar 

  5. FISHER, R.A. (1940): The precision of discriminant functions. Annals of Eugenic, 10, 422–429.

    Article  Google Scholar 

  6. GODEHARDT, E. (1990): Graphs as structural models. Vieweg, Braunschweig.

    Google Scholar 

  7. GODEHARDT, E. and HARRIS, B. (1995): Asymptotic properties of random interval graphs and their use in cluster analysis. University of Wisconsin Statistics Department Technical Report (submitted for publication).

    Google Scholar 

  8. HAFNER, R. (1972): Die asymptotische Verteilung von mehrfachen Koinzidenzen. Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebiete, 21, 96108.

    Article  Google Scholar 

  9. HARRIS, B. and GODEHARDT, E. (1998): Probability models and limit theorems for random interval graphs with applications to cluster analysis. In: I. Balderjahn, R. Mathar, M. Schader (eds.): Classification, Data Analysis, and Data Highways. Springer, Berlin–Heidelberg–New York, 54–61.

    Chapter  Google Scholar 

  10. JAMMALAMADAKA, S.R. and JANSON, S. (1986): Limit theorems for a triangular scheme of U-statistics with applications to interpoint distances. Annals of Probability 14, 1347–1358.

    Article  Google Scholar 

  11. JAMMALAMADAKA, S.R. and ZHOU, X. (1990): Some goodness of fit tests in higher dimensions based on interpoint distances. In: Proceedings of the R.C. Bose Symposium on Probability, Statistics and Design of Experiments, Delhi 1988. Wiley Eastern, New Delhi, 391–404.

    Google Scholar 

  12. SMAEHARA, H. (1990): On the intersection graph of random arcs on the cycle. In: M. Karonski, J. Jaworski, A. Rucinski (eds.): Random Graphs ‘87. John Wiley & Sons, New York–Chichester–Brisbane, 159–173.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer-Verlag Berlin · Heidelberg

About this paper

Cite this paper

Harris, B., Godehardt, E. (1999). The Comparative Efficacy of Some Combinatorial Tests for Detection of Clusters and Mixtures of Probability Distributions. In: Gaul, W., Locarek-Junge, H. (eds) Classification in the Information Age. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-60187-3_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-60187-3_30

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-65855-9

  • Online ISBN: 978-3-642-60187-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics