Abstract
Assume that n q-dimensional data points have been obtained and subjected to a cluster analysis algorithm. A potential concern is whether the resulting clusters have a “causal” interpretation or whether they are merely consequences of “random” fluctuation. In previous reports, the asymptotic properties of a number of potentially useful combinatorial tests based on the theory of random interval graphs were described. In the present work, comparisons of the asymptotic efficacy of a class of these tests are provided. As a particular illustration of potential applications, we discuss the detection of mixtures of probability distributions and provide some numerical illustrations.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
BOCK, H.H. (1974): Automatische Klassifikation. Vandenhoeck & Ruprecht, Göttingen.
EBERL, W. and HAFNER, R. (1971): Die asymptotische Verteilung von Koinzi denzen. Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebiete, 18, 322–332.
FISHER, R.A. (1936): The use of multiple measurements in taxonomic problems. Annals of Eugenic, 7, 179–188.
FISHER, R.A. (1938): The statistical utilization of multiple measurements. Annals of Eugenic, 8, 376–386.
FISHER, R.A. (1940): The precision of discriminant functions. Annals of Eugenic, 10, 422–429.
GODEHARDT, E. (1990): Graphs as structural models. Vieweg, Braunschweig.
GODEHARDT, E. and HARRIS, B. (1995): Asymptotic properties of random interval graphs and their use in cluster analysis. University of Wisconsin Statistics Department Technical Report (submitted for publication).
HAFNER, R. (1972): Die asymptotische Verteilung von mehrfachen Koinzidenzen. Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebiete, 21, 96108.
HARRIS, B. and GODEHARDT, E. (1998): Probability models and limit theorems for random interval graphs with applications to cluster analysis. In: I. Balderjahn, R. Mathar, M. Schader (eds.): Classification, Data Analysis, and Data Highways. Springer, Berlin–Heidelberg–New York, 54–61.
JAMMALAMADAKA, S.R. and JANSON, S. (1986): Limit theorems for a triangular scheme of U-statistics with applications to interpoint distances. Annals of Probability 14, 1347–1358.
JAMMALAMADAKA, S.R. and ZHOU, X. (1990): Some goodness of fit tests in higher dimensions based on interpoint distances. In: Proceedings of the R.C. Bose Symposium on Probability, Statistics and Design of Experiments, Delhi 1988. Wiley Eastern, New Delhi, 391–404.
SMAEHARA, H. (1990): On the intersection graph of random arcs on the cycle. In: M. Karonski, J. Jaworski, A. Rucinski (eds.): Random Graphs ‘87. John Wiley & Sons, New York–Chichester–Brisbane, 159–173.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin · Heidelberg
About this paper
Cite this paper
Harris, B., Godehardt, E. (1999). The Comparative Efficacy of Some Combinatorial Tests for Detection of Clusters and Mixtures of Probability Distributions. In: Gaul, W., Locarek-Junge, H. (eds) Classification in the Information Age. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-60187-3_30
Download citation
DOI: https://doi.org/10.1007/978-3-642-60187-3_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65855-9
Online ISBN: 978-3-642-60187-3
eBook Packages: Springer Book Archive