A Local Search Approximation Algorithm for the k-means Problem with Penalties

Zhang, Dongmei; Hao, Chunlin; Wu, Chenchen; Xu, Dachuan; Zhang, Zhenning

doi:10.1007/978-3-319-62389-4_47

A Local Search Approximation Algorithm for the k-means Problem with Penalties

Dongmei Zhang¹⁵,
Chunlin Hao¹⁶,
Chenchen Wu¹⁷,
Dachuan Xu¹⁶ &
…
Zhenning Zhang¹⁶

Conference paper
First Online: 01 July 2017

1551 Accesses
4 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 10392))

Abstract

In this paper, we study the k-means problem with (nonuniform) penalties (k-MPWP) which is a natural generalization of the classic k-means problem. In the k-MPWP, we are given an n-client set \( \mathcal {D} \subset \mathbb {R}^d\), a penalty cost \(p_j>0\) for each \(j \in \mathcal {D}\), and an integer \(k \le n\). The goal is to open a center subset \(F \subset \mathbb {R}^d\) with \( |F| \le k\) and to choose a client subset \(P \subseteq \mathcal {D} \) as the penalized client set such that the total cost (including the sum of squares of distance for each client in \( \mathcal {D} \setminus P \) to the nearest open center and the sum of penalty cost for each client in P) is minimized. We offer a local search \(( 81+ \varepsilon )\)-approximation algorithm for the k-MPWP by using single-swap operation. We further improve the above approximation ratio to \(( 25+ \varepsilon )\) by using multi-swap operation.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Aloise, D., Deshpande, A., Hansen, P., Popat, P.: NP-hardness of Euclidean sum-of-squares clustering. Mach. Learn. 75, 245–249 (2009)
Article Google Scholar
Arya, V., Garg, N., Khandekar, R., Meyerson, A., Munagala, K., Pandit, V.: Local search heuristics for \(k\)-median and facility location problems. SIAM J. Comput. 33, 544–562 (2004)
Article MathSciNet MATH Google Scholar
Bandyapadhyay, S., Varadarajan, K.: On variants of \(k\)-means clustering. In: Proceedings of SoCG, Article No. 14, pp. 14:1–14:15 (2016)
Google Scholar
Byrka, J., Pensyl, T., Rybicki, B., Srinivasan, A., Trinh, K.: An improved approximation for \(k\)-median, and positive correlation in budgeted optimization. In: Proceedings of SODA, pp. 737–756 (2014)
Google Scholar
Charikar, M., Guha, S.: Improved combinatorial algorithms for the facility location and \(k\)-median problems. In: Proceedings of FOCS, pp. 378–388 (1999)
Google Scholar
Charikar, M., Guha, S., Tardos, \(\acute{\rm E}\)., Shmoys, D.B. A constant-factor approximation algorithm for the \(k\)-median problem. In: Proceedings of STOC, pp. 1–10 (1999)
Google Scholar
Charikar, M., Khuller, S., Mount, D.M., Narasimhan, G.: Algorithms for facility location problems with outliers. In: Proceedings of SODA, pp. 642–651 (2001)
Google Scholar
Dasgupta, S. The hardness of \(k\)-means clustering. Technical Report CS2007-0890, University of California, San Diego (2007)
Google Scholar
Georgogiannis, A.: Robust \(k\)-means: a theoretical revisit. In: Proceedings of NIPS, pp. 2883–2891 (2016)
Google Scholar
Kanungo, T., Mount, D.M., Netanyahu, N.S., Piatko, C.D., Silverman, R., Wu, A.Y.: A local search approximation algorithm for \(k\)-means clustering. Comput. Geom. Theory Appl. 28, 89–112 (2004)
Article MathSciNet MATH Google Scholar
Li, Y., Du, D., Xiu, N., Xu, D.: Improved approximation algorithms for the facility location problems with linear/submodular penalties. Algorithmica 73, 460–482 (2015)
Article MathSciNet MATH Google Scholar
Li, S., Svensson, O.: Approximating \(k\)-median via pseudo-approximation. In: Proceedings of STOC, pp. 901–910 (2013)
Google Scholar
Lloyd, S.: Least squares quantization in PCM. Technical report, Bell Laboratories (1957)
Google Scholar
Lloyd, S.: Least squares quantization in PCM. IEEE Trans. Inf. Theory 28, 129–137 (1982)
Article MathSciNet MATH Google Scholar
Mahajan, M., Nimbhorkar, P., Varadarajan, K.: The planar \(k\)-means problem is NP-hard. In: Proceedings of WALCOM, pp. 274–285 (2009)
Google Scholar
Makarychev, K., Makarychev, Y., Sviridenko, M., Ward, J.: A bi-criteria approximation algorithm for \(k\)-means. In: Proceedings of APPROX/RONDOM, Article No. 14, pp. 14:1–14:20 (2016)
Google Scholar
Tseng, G.C.: Penalized and weighted \(k\)-means for clustering with scattered objects and prior information in high-throughput biological data. Bioinformatics 23, 2247–2255 (2007)
Article Google Scholar
Matoušek, J.: On approximate geometric \(k\)-clustering. Discrete Comput. Geom. 24, 61–84 (2000)
Article MathSciNet MATH Google Scholar
Ward, J. Private Communication (2017)
Google Scholar
Zhang, P.: A new approximation algorithm for the \(k\)-facility location problem. Theoret. Comput. Sci. 384, 126–135 (2007)
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

The research of the first author is supported by Higher Educational Science and Technology Program of Shandong Province (No. J15LN23). The second author is supported by Ri-Xin Talents Project of Beijing University of Technology. The third author is supported by Natural Science Foundation of China (No. 11501412). The fourth author is supported by Natural Science Foundation of China (No. 11531014). The fifth author is supported by Beijing Excellent Talents Funding (No. 2014000020124G046).

Author information

Authors and Affiliations

School of Computer Science and Technology, Shandong Jianzhu University, Jinan, 250101, People’s Republic of China
Dongmei Zhang
Department of Information and Operations Research, College of Applied Sciences, Beijing University of Technology, Beijing, 100124, People’s Republic of China
Chunlin Hao, Dachuan Xu & Zhenning Zhang
College of Science, Tianjin University of Technology, Tianjin, 300384, People’s Republic of China
Chenchen Wu

Authors

Dongmei Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Chunlin Hao
View author publications
You can also search for this author in PubMed Google Scholar
Chenchen Wu
View author publications
You can also search for this author in PubMed Google Scholar
Dachuan Xu
View author publications
You can also search for this author in PubMed Google Scholar
Zhenning Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dachuan Xu .

Editor information

Editors and Affiliations

Department of Computing, Hong Kong Polytechnic University, Hong Kong, China
Yixin Cao
Texas A&M University, College Station, Texas, USA
Jianer Chen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, D., Hao, C., Wu, C., Xu, D., Zhang, Z. (2017). A Local Search Approximation Algorithm for the k-means Problem with Penalties. In: Cao, Y., Chen, J. (eds) Computing and Combinatorics. COCOON 2017. Lecture Notes in Computer Science(), vol 10392. Springer, Cham. https://doi.org/10.1007/978-3-319-62389-4_47

Download citation

DOI: https://doi.org/10.1007/978-3-319-62389-4_47
Published: 01 July 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-62388-7
Online ISBN: 978-3-319-62389-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics