Clustering of Economic Data with Modified K-Mean Technique

Pham, Trung T.

doi:10.1007/978-3-030-12388-8_44

Clustering of Economic Data with Modified K-Mean Technique

Trung T. Pham⁴

Conference paper
First Online: 02 February 2019

1289 Accesses

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 69))

Abstract

This paper presents a newly modified K-Mean technique for clustering data that are not situated around a single point center. When the clusters are elongated, the traditional K-Mean technique cannot yield meaningful results. In modifying the K-Mean technique to allow a center to be a line segment, elongated clusters can be extracted for analysis. The distance function is modified to measure the distance between a point and a set (line segment). The modified technique can be easily extended to multidimensional data where the center is shaped as a hyperplane, and the clusters of data that are situated around the hyperplane can be easily extracted and modeled into a regression model. The technique is applied to economic data of Chile, where the clusters are shown to be of irregular shapes, and where it is common to find regression model representing data sets.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Aggarwal, C.C., Reddy, C.K.: Data Clustering: Algorithms and Applications. Chapman & Hall/CRC, Boca Raton (2013)
Book Google Scholar
Gan, G., Ma, C., Wu, J.: Data Clustering: Theory, Algorithms, and Applications. Society for Industrial and Applied Mathematics (SIAM), Philadelphis (2007)
Book Google Scholar
Akhiezer, N.I., Glazman, I.M.: Theory of Linear Operators in Hilbert Space. Dover Publications, New York (1993)
MATH Google Scholar
Young, N.: An Introduction to Hilbert Space. Cambridge University Press, Cambridge (1988)
Book Google Scholar
Schroeder, L.D., Sjoquist, D.L., Stephan, P.E.: Understanding Regression Analysis: An Introductory Guide. SAGE Publications, Thousand Oaks (2017)
Book Google Scholar
Treiman, D.J.: Quantitative Data Analysis: Doing Social Research to Test Ideas. Jossey-Bass, San Francisco (2009)
Google Scholar
Berkhin, P.: A survey of clustering data mining techniques. In: Grouping Multidimensional Data, pp. 25–71. Springer, Heidelberg (2006)
Google Scholar
Popat, S.K., Emmanuel, M.: Review and comparative study of clustering techniques. Int. J. Comput. Sci. Inf. Technol. 5(1), 805–812 (2014)
Google Scholar
Wu, J.: Advances in K-Means Clustering: A Data Mining Thinking. Springer-Verlag, Berlin (2012)
Book Google Scholar
Wang, J., Wang, J., Song, J., Xu, X.S., Shen, H.T., Li, S.: Optimized cartesian K-means. IEEE Trans. Knowl. Data Eng. 27(1), 180–192 (2015)
Article Google Scholar
Memon, K.H., Lee, D.H.: Generalised fuzzy c-means clustering algorithm with local information. IET Image Process. 11(1), 1–12 (2017)
Article Google Scholar
Sato, M., Sato, Y.: Fuzzy Clustering Models and Applications. Physica-Verlag, Heidelberg (2002)
MATH Google Scholar
Huang, W., Ribeiro, A.: Hierarchical clustering given confidence intervals of metric distances. IEEE Trans. Signal Process. 66(10), 2600–2615 (2018)
Article MathSciNet Google Scholar
Zhou, S., Xu, Z., Liu, F.: Method for determining the optimal number of clusters based on agglomerative hierarchical clustering. IEEE Trans. Neural Netw. Learn. Syst. 28(12), 3007–3017 (2017)
Article MathSciNet Google Scholar
Nguyen, H.D., McLachlan, G.J., Orban, P., Bellec, P., Janke, A.L.: Maximum pseudolikelihood estimation for model-based clustering of time series data. Neural Comput. 29(4), 990–1020 (2017)
Article MathSciNet Google Scholar
Chen, L., Jiang, Q., Wang, S.: Model-based method for projective clustering. IEEE Trans. Knowl. Data Eng. 24(7), 1291–1305 (2012)
Article Google Scholar
Kutner, M.H., Nachtsheim, C.K., Neter, J.: Applied Linear Regression Models. McGraw-Hill Education, New York (2004)
Google Scholar
Darlington, R.B., Hayes, A.F.: Regression Analysis and Linear Models: Concepts, Applications, and Implementation. The Guilford Press, New York (2016)
Google Scholar
Breuer, J.: Introduction to the Theory of Sets. Dover Publications, New York (2006)
MATH Google Scholar
Cunningham, D.W.: Set Theory: A First Course. Cambridge University Press, Cambridge (2016)
Book Google Scholar
Brand, L.: Vector Analysis. Dover Publications, New York (2006)
MATH Google Scholar
Alabiso, C., Weiss, I.: A Primer on Hilbert Space Theory: Linear Spaces, Topological Spaces, Metric Spaces, Normed Spaces, and Topological Groups. Springer, New York (2015)
Book Google Scholar
Barvinok, A.: A Course in Convexity. American Mathematical Society, Providence (2002)
Book Google Scholar
Berkovitz, L.D.: Convexity and Optimization in Rⁿ. Wiley-Interscience, New York (2001)
Google Scholar

Download references

Acknowledgment

Part of this study was supported by the Chilean R&D Agency CONICYT, under the research grant FONDEF IT15I10042 for the duration of 2016–2018. Economic data used in this paper were obtained from the Central Bank of Chile.

Author information

Authors and Affiliations

Universidad de Talca, Talca, 3460000, Región Del Maule, Chile
Trung T. Pham

Authors

Trung T. Pham
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Trung T. Pham .

Editor information

Editors and Affiliations

Faculty of Science and Engineering, Saga University, Saga, Japan
Kohei Arai
The Science and Information (SAI) Organization, Bradford, UK
Rahul Bhatia

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pham, T.T. (2020). Clustering of Economic Data with Modified K-Mean Technique. In: Arai, K., Bhatia, R. (eds) Advances in Information and Communication. FICC 2019. Lecture Notes in Networks and Systems, vol 69. Springer, Cham. https://doi.org/10.1007/978-3-030-12388-8_44

Download citation

DOI: https://doi.org/10.1007/978-3-030-12388-8_44
Published: 02 February 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-12387-1
Online ISBN: 978-3-030-12388-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics