Abstract
In this paper, we study the k-means problem with (nonuniform) penalties (k-MPWP) which is a natural generalization of the classic k-means problem. In the k-MPWP, we are given an n-client set \( \mathcal {D} \subset \mathbb {R}^d\), a penalty cost \(p_j>0\) for each \(j \in \mathcal {D}\), and an integer \(k \le n\). The goal is to open a center subset \(F \subset \mathbb {R}^d\) with \( |F| \le k\) and to choose a client subset \(P \subseteq \mathcal {D} \) as the penalized client set such that the total cost (including the sum of squares of distance for each client in \( \mathcal {D} \setminus P \) to the nearest open center and the sum of penalty cost for each client in P) is minimized. We offer a local search \(( 81+ \varepsilon )\)-approximation algorithm for the k-MPWP by using single-swap operation. We further improve the above approximation ratio to \(( 25+ \varepsilon )\) by using multi-swap operation.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Aloise, D., Deshpande, A., Hansen, P., Popat, P.: NP-hardness of Euclidean sum-of-squares clustering. Mach. Learn. 75, 245–249 (2009)
Arya, V., Garg, N., Khandekar, R., Meyerson, A., Munagala, K., Pandit, V.: Local search heuristics for \(k\)-median and facility location problems. SIAM J. Comput. 33, 544–562 (2004)
Bandyapadhyay, S., Varadarajan, K.: On variants of \(k\)-means clustering. In: Proceedings of SoCG, Article No. 14, pp. 14:1–14:15 (2016)
Byrka, J., Pensyl, T., Rybicki, B., Srinivasan, A., Trinh, K.: An improved approximation for \(k\)-median, and positive correlation in budgeted optimization. In: Proceedings of SODA, pp. 737–756 (2014)
Charikar, M., Guha, S.: Improved combinatorial algorithms for the facility location and \(k\)-median problems. In: Proceedings of FOCS, pp. 378–388 (1999)
Charikar, M., Guha, S., Tardos, \(\acute{\rm E}\)., Shmoys, D.B. A constant-factor approximation algorithm for the \(k\)-median problem. In: Proceedings of STOC, pp. 1–10 (1999)
Charikar, M., Khuller, S., Mount, D.M., Narasimhan, G.: Algorithms for facility location problems with outliers. In: Proceedings of SODA, pp. 642–651 (2001)
Dasgupta, S. The hardness of \(k\)-means clustering. Technical Report CS2007-0890, University of California, San Diego (2007)
Georgogiannis, A.: Robust \(k\)-means: a theoretical revisit. In: Proceedings of NIPS, pp. 2883–2891 (2016)
Kanungo, T., Mount, D.M., Netanyahu, N.S., Piatko, C.D., Silverman, R., Wu, A.Y.: A local search approximation algorithm for \(k\)-means clustering. Comput. Geom. Theory Appl. 28, 89–112 (2004)
Li, Y., Du, D., Xiu, N., Xu, D.: Improved approximation algorithms for the facility location problems with linear/submodular penalties. Algorithmica 73, 460–482 (2015)
Li, S., Svensson, O.: Approximating \(k\)-median via pseudo-approximation. In: Proceedings of STOC, pp. 901–910 (2013)
Lloyd, S.: Least squares quantization in PCM. Technical report, Bell Laboratories (1957)
Lloyd, S.: Least squares quantization in PCM. IEEE Trans. Inf. Theory 28, 129–137 (1982)
Mahajan, M., Nimbhorkar, P., Varadarajan, K.: The planar \(k\)-means problem is NP-hard. In: Proceedings of WALCOM, pp. 274–285 (2009)
Makarychev, K., Makarychev, Y., Sviridenko, M., Ward, J.: A bi-criteria approximation algorithm for \(k\)-means. In: Proceedings of APPROX/RONDOM, Article No. 14, pp. 14:1–14:20 (2016)
Tseng, G.C.: Penalized and weighted \(k\)-means for clustering with scattered objects and prior information in high-throughput biological data. Bioinformatics 23, 2247–2255 (2007)
Matoušek, J.: On approximate geometric \(k\)-clustering. Discrete Comput. Geom. 24, 61–84 (2000)
Ward, J. Private Communication (2017)
Zhang, P.: A new approximation algorithm for the \(k\)-facility location problem. Theoret. Comput. Sci. 384, 126–135 (2007)
Acknowledgements
The research of the first author is supported by Higher Educational Science and Technology Program of Shandong Province (No. J15LN23). The second author is supported by Ri-Xin Talents Project of Beijing University of Technology. The third author is supported by Natural Science Foundation of China (No. 11501412). The fourth author is supported by Natural Science Foundation of China (No. 11531014). The fifth author is supported by Beijing Excellent Talents Funding (No. 2014000020124G046).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Zhang, D., Hao, C., Wu, C., Xu, D., Zhang, Z. (2017). A Local Search Approximation Algorithm for the k-means Problem with Penalties. In: Cao, Y., Chen, J. (eds) Computing and Combinatorics. COCOON 2017. Lecture Notes in Computer Science(), vol 10392. Springer, Cham. https://doi.org/10.1007/978-3-319-62389-4_47
Download citation
DOI: https://doi.org/10.1007/978-3-319-62389-4_47
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-62388-7
Online ISBN: 978-3-319-62389-4
eBook Packages: Computer ScienceComputer Science (R0)