Abstract
Graph-theoretic classification models provide us with probability models which can be used to study the structure of a data set. In models of random interval graphs or, generally, random coincidence graphs, points are drawn “at random” and joined by lines if their mutual distances are smaller than a threshold d. This is exactly the procedure of finding linkage clusters. We present exact and asymptotic results for properties of those random graphs, especially for the properties that the expected numbers of isolated edges and of isolated vertices remain positive finite as the total number of vertices grows. These properties can serve as test statistics for testing the homogeneity in a data set; they can be used to derive tests for goodness of fit as well.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
APPEL, M.J.B., RUSSO, R.P. (1996, submitted): The minimum vertex degree of a graph on uniform points in [0,1]. Journal of Applied Probability.
BARBOUR, A.D., HOLST, L., JANSON, S. (1992): Poisson approximations. Clarendon Press, Oxford.
BOCK, H.H. (1996): Probabilistic models in cluster analysis. Computational Statistics and Data Analysis, 23, 5–28.
ERDöS, P., RENYI, A. (1960): On the evolution of random graphs. Publications of the Mathematical Institute of the Hungarian Academy of Sciences, 5, 17–61.
GILBERT, E.N. (1959): Random graphs. Annals of Mathematical Statistics, 30, 1141–1144.
GODEHARDT, E. (1993): Probability models for random multigraphs with applications in cluster analysis. Annals of Discrete Mathematics, 55, 93–108.
GODEHARDT, E., HORSCH, A. (1995): Graph-theoretic models for testing the homogeneity of data. in: W. GAUL, D. PFEIFER (eds.), From Data to Knowledge: Theoretical and Practical Aspects of Classification, Data Analysis and Knowledge Organization (Proceedings 18th Annual Conference of the Gesellschaft für Klassifikation e. V., Oldenburg, March 9–11, 1994). Springer, Berlin–Heidelberg–New York, 167–176.
GODEHARDT, E., JAWORSKI, J. (1996): On the connectivity of a random graph. Random Structures and Algorithms, 9, 137–161.
HAFNER, R. (1972): The asymptotic distribution of random clumps. Computing, 10, 335–351.
KENNEDY, J.W. (1976): Random clumps, graphs, and polymer solutions. In: Y. ALAVI, D.R. LICK (eds.): Theory and Applications of Graphs. Springer, Berlin–Heidelberg–New York, 314–329.
SCHEINERMAN, E.R. (1990): An evolution of interval graphs. Discrete Mathematics, 82, 287–302.
SOLOMON, H. (1976): Geometric probability. Society for Industrial and Applied Mathematics, Philadelphia.
ZHOU, X., JAMMALAMADAKA, S.R. (1989): Bahadur Efficiencies of Spacings Tests for Goodness of Fit. Annals of the Institute of Statistics and Mathematics 41, 541–553.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin · Heidelberg
About this paper
Cite this paper
Godehardt, E., Jaworski, J., Godehardt, D. (1998). The Application of Random Coincidence Graphs for Testing the Homogeneity of Data. In: Balderjahn, I., Mathar, R., Schader, M. (eds) Classification, Data Analysis, and Data Highways. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-72087-1_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-72087-1_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63909-1
Online ISBN: 978-3-642-72087-1
eBook Packages: Springer Book Archive