Skip to main content

The Application of Random Coincidence Graphs for Testing the Homogeneity of Data

  • Conference paper
Book cover Classification, Data Analysis, and Data Highways

Abstract

Graph-theoretic classification models provide us with probability models which can be used to study the structure of a data set. In models of random interval graphs or, generally, random coincidence graphs, points are drawn “at random” and joined by lines if their mutual distances are smaller than a threshold d. This is exactly the procedure of finding linkage clusters. We present exact and asymptotic results for properties of those random graphs, especially for the properties that the expected numbers of isolated edges and of isolated vertices remain positive finite as the total number of vertices grows. These properties can serve as test statistics for testing the homogeneity in a data set; they can be used to derive tests for goodness of fit as well.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • APPEL, M.J.B., RUSSO, R.P. (1996, submitted): The minimum vertex degree of a graph on uniform points in [0,1]. Journal of Applied Probability.

    Google Scholar 

  • BARBOUR, A.D., HOLST, L., JANSON, S. (1992): Poisson approximations. Clarendon Press, Oxford.

    Google Scholar 

  • BOCK, H.H. (1996): Probabilistic models in cluster analysis. Computational Statistics and Data Analysis, 23, 5–28.

    Article  Google Scholar 

  • ERDöS, P., RENYI, A. (1960): On the evolution of random graphs. Publications of the Mathematical Institute of the Hungarian Academy of Sciences, 5, 17–61.

    Google Scholar 

  • GILBERT, E.N. (1959): Random graphs. Annals of Mathematical Statistics, 30, 1141–1144.

    Article  Google Scholar 

  • GODEHARDT, E. (1993): Probability models for random multigraphs with applications in cluster analysis. Annals of Discrete Mathematics, 55, 93–108.

    Article  Google Scholar 

  • GODEHARDT, E., HORSCH, A. (1995): Graph-theoretic models for testing the homogeneity of data. in: W. GAUL, D. PFEIFER (eds.), From Data to Knowledge: Theoretical and Practical Aspects of Classification, Data Analysis and Knowledge Organization (Proceedings 18th Annual Conference of the Gesellschaft für Klassifikation e. V., Oldenburg, March 9–11, 1994). Springer, Berlin–Heidelberg–New York, 167–176.

    Google Scholar 

  • GODEHARDT, E., JAWORSKI, J. (1996): On the connectivity of a random graph. Random Structures and Algorithms, 9, 137–161.

    Article  Google Scholar 

  • HAFNER, R. (1972): The asymptotic distribution of random clumps. Computing, 10, 335–351.

    Article  Google Scholar 

  • KENNEDY, J.W. (1976): Random clumps, graphs, and polymer solutions. In: Y. ALAVI, D.R. LICK (eds.): Theory and Applications of Graphs. Springer, Berlin–Heidelberg–New York, 314–329.

    Google Scholar 

  • SCHEINERMAN, E.R. (1990): An evolution of interval graphs. Discrete Mathematics, 82, 287–302.

    Article  Google Scholar 

  • SOLOMON, H. (1976): Geometric probability. Society for Industrial and Applied Mathematics, Philadelphia.

    Google Scholar 

  • ZHOU, X., JAMMALAMADAKA, S.R. (1989): Bahadur Efficiencies of Spacings Tests for Goodness of Fit. Annals of the Institute of Statistics and Mathematics 41, 541–553.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1998 Springer-Verlag Berlin · Heidelberg

About this paper

Cite this paper

Godehardt, E., Jaworski, J., Godehardt, D. (1998). The Application of Random Coincidence Graphs for Testing the Homogeneity of Data. In: Balderjahn, I., Mathar, R., Schader, M. (eds) Classification, Data Analysis, and Data Highways. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-72087-1_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-72087-1_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-63909-1

  • Online ISBN: 978-3-642-72087-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics