Abstract
This paper describes a new approach to high-dimensional mixed-type data clustering with missing values, which combines information on common nearest neighbors with classic between-vectors distances calculated by an original technique. The results are applied to form intersecting clusters for every missing value.
Similar content being viewed by others
References
Little, R.J.A. and Rubin, D.B., Statistical Analysis with Missing Data, New Jersey: Wiley, 2002, 2nd ed.
Ayuyev, V.V., Aung, Z.Y., and Thein, Ch.M., The Domain Compensation Method for Incomplete Information in a Database, in Tr. Mosk. Gos. Tekhn. Univ., Moskow: Gos. Tekhn. Univ., 2007, vol. 2, pp. 57–64.
Fujikawa, Y. and Ho, T.B., Cluster-based Algorithms for Dealing with Missing Values, in Proc. in Advances in Knowledge Discovery and Data Mining, Berlin: Springer, 2002, pp. 549–554.
Mantaras, R.L., A Distance-based Attribute Selection Measure for Decision Tree Induction, Mach. Learn., 1991, vol. 6, pp. 81–92.
Tan, P.N., Steinbach, M., and Kumar, V., Introduction to Data Mining, New York: Addison-Wesley, 2005.
Gan, G., Ma, C., and Wu, J., Data Clustering: Theory, Algorithms, and Applications, in ASA-SIAM Series on Statistics and Applied Probability, Philadelphia: SIAM, 2007, vol. 20, p. 466.
Wishart, D., K-means Clustering with Outlier Detection, Mixed Variables and Missing Values, in Exploratory Data Analysis in Empirical Research, Schwaiger, M. and Opitz, O., Eds., New York: Springer, 2003, pp. 216–226.
Ayuyev, V.V., Thura, A., Hlaing, N.N., and Loginova, M.B., A Modification of the Static Clustering Method for Operation with Inconnected Data, in Tr. Mosk. Gos. Tekhn. Univ., Moskow: Gos. Tekhn. Univ., 2008, vol. 2, pp. 86–93.
Asuncion, A. and Newman, D.J., UCI Machine Learning Repository, Irvine: Univ. of California, School of Inf. and Computer Sci., 2008.
Ertoz, L., Steinback, M., and Kumar, V., Finding Clusters of Different Sizes, Shapes, and Density in Noisy High-dimensional Data, Second SIAM Int. Conf. on Data Mining, San Francisco: SIAM, 2003, pp. 47–58.
Schafer, J.L., Multiple Imputation: A Primer, Statist. Meth. Medical Res., 1999, vol. 8, no. 1, pp. 3–15.
Author information
Authors and Affiliations
Additional information
Original Russian Text © V.V. Ayuyev, A. Thura, N.N. Hlaing, M.B. Loginova, 2008, published in Sistemy Upravleniya i Informatsionnye Tekhnologii, 2008, No. 3, pp. 26–29.
Rights and permissions
About this article
Cite this article
Ayuyev, V.V., Thura, A., Hlaing, N.N. et al. The quick dynamic clustering method for mixed-type data. Autom Remote Control 73, 2083–2088 (2012). https://doi.org/10.1134/S0005117912120120
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1134/S0005117912120120