Abstract
The well-known ’k-means’ clustering can be regarded as an approximation of a given distribution (which can be a sample) by a set of optimally chosen k points. However, in many cases approximative sets of different types are of interest. For example, approximation of a distribution by circles is important in allocating communication stations, the circles being interpreted as working areas of the stations. The paper covers two related topics. First we propose a heuristic algorithm to find k circles of a given radius r that fit with the planar data set. Then we analyse the problem of consistency: does a sequence of sample-based sets of optimal circles converge to the class of optimal circles for the population? The positive answer is given for arbitrary finite-dimensional normed linear spaces.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
BOCK, H.H. (1974): Automatische Klassifikation, Theoretische und Praktische Methoden zur Gruppierung und Strukturierung von Daten (Clusteranalyse). Vandenhoeck and Ruprecht, Göttingen.
CUESTA, J. and MATRAN, C. (1988): The strong law of large numbers for k-means and best possible nets of Banach-valued random variables. Probability Theory and Related Fields, 78, 523–534.
FLURY, B.A. (1993): Principal points. Biometrika, 77, 33–42.
HASTIE, T. and STUETZLE, W. (1989): Principal curves. Journal of American Satistical Association, 84, 502–516.
LLOYD, S. P. (1982): Least squares quantization in PCM. IEEE Transactions on Information Theory, 28, 129–136.
MACQUEEN, J. (1967): Some methods for classification and analysis of multivariate observations. Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, I, 281–297.
KIPPER, S. and PARNA, K. (1992): Optimal k-centres for a two-dimensional normal distribution. Acta et Commentationes Universitatis Tartuensis, 942, 21–27.
POLLARD, D. (1981): Strong consistency of k-means clustering. Annals of Statistics, 9, 135–140.
SPÄTH, H. (1975): Cluster-Analyse-Algorithmen zur Objektklassifizierung und Datenreduktion. R. Oldenbourg Verlag, München - Wien.
SPüTH, H. (1997): Orthonormal distance fitting by circles and ellipses with given area. Journal of Computational Statistics, 12, 343–354.
VARADARAJAN, V.S. (1958): Weak convergence of measures on separable metric spaces. Sankhya, 19, 15–22.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin · Heidelberg
About this paper
Cite this paper
Pärna, K., Lember, J., Viiart, A. (1999). Approximation of Distributions by Sets. In: Gaul, W., Locarek-Junge, H. (eds) Classification in the Information Age. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-60187-3_21
Download citation
DOI: https://doi.org/10.1007/978-3-642-60187-3_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65855-9
Online ISBN: 978-3-642-60187-3
eBook Packages: Springer Book Archive