Skip to main content

A Local Search Approximation Algorithm for the k-means Problem with Penalties

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 10392))

Abstract

In this paper, we study the k-means problem with (nonuniform) penalties (k-MPWP) which is a natural generalization of the classic k-means problem. In the k-MPWP, we are given an n-client set \( \mathcal {D} \subset \mathbb {R}^d\), a penalty cost \(p_j>0\) for each \(j \in \mathcal {D}\), and an integer \(k \le n\). The goal is to open a center subset \(F \subset \mathbb {R}^d\) with \( |F| \le k\) and to choose a client subset \(P \subseteq \mathcal {D} \) as the penalized client set such that the total cost (including the sum of squares of distance for each client in \( \mathcal {D} \setminus P \) to the nearest open center and the sum of penalty cost for each client in P) is minimized. We offer a local search \(( 81+ \varepsilon )\)-approximation algorithm for the k-MPWP by using single-swap operation. We further improve the above approximation ratio to \(( 25+ \varepsilon )\) by using multi-swap operation.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Aloise, D., Deshpande, A., Hansen, P., Popat, P.: NP-hardness of Euclidean sum-of-squares clustering. Mach. Learn. 75, 245–249 (2009)

    Article  Google Scholar 

  2. Arya, V., Garg, N., Khandekar, R., Meyerson, A., Munagala, K., Pandit, V.: Local search heuristics for \(k\)-median and facility location problems. SIAM J. Comput. 33, 544–562 (2004)

    Article  MathSciNet  MATH  Google Scholar 

  3. Bandyapadhyay, S., Varadarajan, K.: On variants of \(k\)-means clustering. In: Proceedings of SoCG, Article No. 14, pp. 14:1–14:15 (2016)

    Google Scholar 

  4. Byrka, J., Pensyl, T., Rybicki, B., Srinivasan, A., Trinh, K.: An improved approximation for \(k\)-median, and positive correlation in budgeted optimization. In: Proceedings of SODA, pp. 737–756 (2014)

    Google Scholar 

  5. Charikar, M., Guha, S.: Improved combinatorial algorithms for the facility location and \(k\)-median problems. In: Proceedings of FOCS, pp. 378–388 (1999)

    Google Scholar 

  6. Charikar, M., Guha, S., Tardos, \(\acute{\rm E}\)., Shmoys, D.B. A constant-factor approximation algorithm for the \(k\)-median problem. In: Proceedings of STOC, pp. 1–10 (1999)

    Google Scholar 

  7. Charikar, M., Khuller, S., Mount, D.M., Narasimhan, G.: Algorithms for facility location problems with outliers. In: Proceedings of SODA, pp. 642–651 (2001)

    Google Scholar 

  8. Dasgupta, S. The hardness of \(k\)-means clustering. Technical Report CS2007-0890, University of California, San Diego (2007)

    Google Scholar 

  9. Georgogiannis, A.: Robust \(k\)-means: a theoretical revisit. In: Proceedings of NIPS, pp. 2883–2891 (2016)

    Google Scholar 

  10. Kanungo, T., Mount, D.M., Netanyahu, N.S., Piatko, C.D., Silverman, R., Wu, A.Y.: A local search approximation algorithm for \(k\)-means clustering. Comput. Geom. Theory Appl. 28, 89–112 (2004)

    Article  MathSciNet  MATH  Google Scholar 

  11. Li, Y., Du, D., Xiu, N., Xu, D.: Improved approximation algorithms for the facility location problems with linear/submodular penalties. Algorithmica 73, 460–482 (2015)

    Article  MathSciNet  MATH  Google Scholar 

  12. Li, S., Svensson, O.: Approximating \(k\)-median via pseudo-approximation. In: Proceedings of STOC, pp. 901–910 (2013)

    Google Scholar 

  13. Lloyd, S.: Least squares quantization in PCM. Technical report, Bell Laboratories (1957)

    Google Scholar 

  14. Lloyd, S.: Least squares quantization in PCM. IEEE Trans. Inf. Theory 28, 129–137 (1982)

    Article  MathSciNet  MATH  Google Scholar 

  15. Mahajan, M., Nimbhorkar, P., Varadarajan, K.: The planar \(k\)-means problem is NP-hard. In: Proceedings of WALCOM, pp. 274–285 (2009)

    Google Scholar 

  16. Makarychev, K., Makarychev, Y., Sviridenko, M., Ward, J.: A bi-criteria approximation algorithm for \(k\)-means. In: Proceedings of APPROX/RONDOM, Article No. 14, pp. 14:1–14:20 (2016)

    Google Scholar 

  17. Tseng, G.C.: Penalized and weighted \(k\)-means for clustering with scattered objects and prior information in high-throughput biological data. Bioinformatics 23, 2247–2255 (2007)

    Article  Google Scholar 

  18. Matoušek, J.: On approximate geometric \(k\)-clustering. Discrete Comput. Geom. 24, 61–84 (2000)

    Article  MathSciNet  MATH  Google Scholar 

  19. Ward, J. Private Communication (2017)

    Google Scholar 

  20. Zhang, P.: A new approximation algorithm for the \(k\)-facility location problem. Theoret. Comput. Sci. 384, 126–135 (2007)

    Article  MathSciNet  MATH  Google Scholar 

Download references

Acknowledgements

The research of the first author is supported by Higher Educational Science and Technology Program of Shandong Province (No. J15LN23). The second author is supported by Ri-Xin Talents Project of Beijing University of Technology. The third author is supported by Natural Science Foundation of China (No. 11501412). The fourth author is supported by Natural Science Foundation of China (No. 11531014). The fifth author is supported by Beijing Excellent Talents Funding (No. 2014000020124G046).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dachuan Xu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Zhang, D., Hao, C., Wu, C., Xu, D., Zhang, Z. (2017). A Local Search Approximation Algorithm for the k-means Problem with Penalties. In: Cao, Y., Chen, J. (eds) Computing and Combinatorics. COCOON 2017. Lecture Notes in Computer Science(), vol 10392. Springer, Cham. https://doi.org/10.1007/978-3-319-62389-4_47

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-62389-4_47

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-62388-7

  • Online ISBN: 978-3-319-62389-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics