Skip to main content

Approximation of Distributions by Sets

  • Conference paper
Classification in the Information Age

Abstract

The well-known ’k-means’ clustering can be regarded as an approximation of a given distribution (which can be a sample) by a set of optimally chosen k points. However, in many cases approximative sets of different types are of interest. For example, approximation of a distribution by circles is important in allocating communication stations, the circles being interpreted as working areas of the stations. The paper covers two related topics. First we propose a heuristic algorithm to find k circles of a given radius r that fit with the planar data set. Then we analyse the problem of consistency: does a sequence of sample-based sets of optimal circles converge to the class of optimal circles for the population? The positive answer is given for arbitrary finite-dimensional normed linear spaces.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • BOCK, H.H. (1974): Automatische Klassifikation, Theoretische und Praktische Methoden zur Gruppierung und Strukturierung von Daten (Clusteranalyse). Vandenhoeck and Ruprecht, Göttingen.

    Google Scholar 

  • CUESTA, J. and MATRAN, C. (1988): The strong law of large numbers for k-means and best possible nets of Banach-valued random variables. Probability Theory and Related Fields, 78, 523–534.

    Article  Google Scholar 

  • FLURY, B.A. (1993): Principal points. Biometrika, 77, 33–42.

    Article  Google Scholar 

  • HASTIE, T. and STUETZLE, W. (1989): Principal curves. Journal of American Satistical Association, 84, 502–516.

    Article  Google Scholar 

  • LLOYD, S. P. (1982): Least squares quantization in PCM. IEEE Transactions on Information Theory, 28, 129–136.

    Article  Google Scholar 

  • MACQUEEN, J. (1967): Some methods for classification and analysis of multivariate observations. Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, I, 281–297.

    Google Scholar 

  • KIPPER, S. and PARNA, K. (1992): Optimal k-centres for a two-dimensional normal distribution. Acta et Commentationes Universitatis Tartuensis, 942, 21–27.

    Google Scholar 

  • POLLARD, D. (1981): Strong consistency of k-means clustering. Annals of Statistics, 9, 135–140.

    Article  Google Scholar 

  • SPÄTH, H. (1975): Cluster-Analyse-Algorithmen zur Objektklassifizierung und Datenreduktion. R. Oldenbourg Verlag, München - Wien.

    Google Scholar 

  • SPüTH, H. (1997): Orthonormal distance fitting by circles and ellipses with given area. Journal of Computational Statistics, 12, 343–354.

    Google Scholar 

  • VARADARAJAN, V.S. (1958): Weak convergence of measures on separable metric spaces. Sankhya, 19, 15–22.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer-Verlag Berlin · Heidelberg

About this paper

Cite this paper

Pärna, K., Lember, J., Viiart, A. (1999). Approximation of Distributions by Sets. In: Gaul, W., Locarek-Junge, H. (eds) Classification in the Information Age. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-60187-3_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-60187-3_21

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-65855-9

  • Online ISBN: 978-3-642-60187-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics