Approximation of Distributions by Sets

Pärna, K.; Lember, J.; Viiart, A.

doi:10.1007/978-3-642-60187-3_21

K. Pärna⁶,
J. Lember⁶ &
A. Viiart⁶

Part of the book series: Studies in Classification, Data Analysis, and Knowledge Organization ((STUDIES CLASS))

592 Accesses
3 Citations

Abstract

The well-known ’k-means’ clustering can be regarded as an approximation of a given distribution (which can be a sample) by a set of optimally chosen k points. However, in many cases approximative sets of different types are of interest. For example, approximation of a distribution by circles is important in allocating communication stations, the circles being interpreted as working areas of the stations. The paper covers two related topics. First we propose a heuristic algorithm to find k circles of a given radius r that fit with the planar data set. Then we analyse the problem of consistency: does a sequence of sample-based sets of optimal circles converge to the class of optimal circles for the population? The positive answer is given for arbitrary finite-dimensional normed linear spaces.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

BOCK, H.H. (1974): Automatische Klassifikation, Theoretische und Praktische Methoden zur Gruppierung und Strukturierung von Daten (Clusteranalyse). Vandenhoeck and Ruprecht, Göttingen.
Google Scholar
CUESTA, J. and MATRAN, C. (1988): The strong law of large numbers for k-means and best possible nets of Banach-valued random variables. Probability Theory and Related Fields, 78, 523–534.
Article Google Scholar
FLURY, B.A. (1993): Principal points. Biometrika, 77, 33–42.
Article Google Scholar
HASTIE, T. and STUETZLE, W. (1989): Principal curves. Journal of American Satistical Association, 84, 502–516.
Article Google Scholar
LLOYD, S. P. (1982): Least squares quantization in PCM. IEEE Transactions on Information Theory, 28, 129–136.
Article Google Scholar
MACQUEEN, J. (1967): Some methods for classification and analysis of multivariate observations. Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, I, 281–297.
Google Scholar
KIPPER, S. and PARNA, K. (1992): Optimal k-centres for a two-dimensional normal distribution. Acta et Commentationes Universitatis Tartuensis, 942, 21–27.
Google Scholar
POLLARD, D. (1981): Strong consistency of k-means clustering. Annals of Statistics, 9, 135–140.
Article Google Scholar
SPÄTH, H. (1975): Cluster-Analyse-Algorithmen zur Objektklassifizierung und Datenreduktion. R. Oldenbourg Verlag, München - Wien.
Google Scholar
SPüTH, H. (1997): Orthonormal distance fitting by circles and ellipses with given area. Journal of Computational Statistics, 12, 343–354.
Google Scholar
VARADARAJAN, V.S. (1958): Weak convergence of measures on separable metric spaces. Sankhya, 19, 15–22.
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Mathematical Statistics, University of Tartu, J.Liivi St.2, Tartu, 50409, Estonia
K. Pärna, J. Lember & A. Viiart

Authors

K. Pärna
View author publications
You can also search for this author in PubMed Google Scholar
J. Lember
View author publications
You can also search for this author in PubMed Google Scholar
A. Viiart
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institut für Entscheidungstheorie und Unternehmensforschung, University of Karlsruhe, Postfach 69 80, D-76128, Karlsruhe, Germany
Wolfgang Gaul
Lehrstuhl für BWL, insb. Finanzwirtschaft, University of Dresden, Mommsenstr. 13, D-01062, Dresden, Germany
Hermann Locarek-Junge

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pärna, K., Lember, J., Viiart, A. (1999). Approximation of Distributions by Sets. In: Gaul, W., Locarek-Junge, H. (eds) Classification in the Information Age. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-60187-3_21

Download citation

DOI: https://doi.org/10.1007/978-3-642-60187-3_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65855-9
Online ISBN: 978-3-642-60187-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics