Abstract
In this paper, we study the spherical k-means problem (SKMP) which is one of the most well-studied clustering problems. In the SKMP, we are given an n-client set \( \mathcal {D}\) in d-dimensional unit sphere \(\mathbb {S}^d\), and an integer \(k \le n\). The goal is to open a center subset \(F \subset \mathbb {S}^d\) with \( |F| \le k\) that minimizes the sum of cosine dissimilarity measure for each client in \( \mathcal {D} \) to the nearest open center. We give a \((2 (4+\sqrt{7}) + \varepsilon )\)-approximation algorithm for this problem using local search scheme.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ahmadian, S., Norouzi-Fard, A., Svensson, O., Ward, J.: Better guarantees for \(k\)-means and Euclidean \(k\)-median by primal-dual algorithms. In: Proceedings of FOCS, pp. 61–72 (2017)
Aloise, D., Deshpande, A., Hansen, P., Popat, P.: NP-hardness of Euclidean sum-of-squares clustering. Mach. Learn. 75, 245–249 (2009)
Arthur, D., Vassilvitskii, S.: \(k\)-means++: the advantages of careful seeding. In: Proceedings of SODA, pp. 1027–1035 (2007)
Arya, V., Garg, N., Khandekar, R., Meyerson, A., Munagala, K., Pandit, V.: Local search heuristics for \(k\)-median and facility location problems. SIAM J. Comput. 33, 544–562 (2004)
Byrka, J., Pensyl, T., Rybicki, B., Srinivasan, A., Trinh, K.: An improved approximation for \(k\)-median, and positive correlation in budgeted optimization. ACM Trans. Algorithms 13(2) (2017). Article No. 23
Charikar, M., Guha, S.: Improved combinatorial algorithms for the facility location and \(k\)-median problems. In: Proceedings of FOCS, pp. 378–388 (1999)
Charikar, M., Guha, S., Tardos, \(\acute{\rm E}\)., Shmoys, D.B.: A constant-factor approximation algorithm for the \(k\)-median problem. J. Comput. Syst. Sci. 65, 129–149 (2002)
Cohen-Addad, V., Mathieu, C.: Effectiveness of local search for geometric optimization. In: Proceedings of SoCG, pp. 329–343 (2015)
Cohen-Addad, V., Klein, P.N., Mathieu, C.: Local search yields approximation schemes for \(k\)-means and \(k\)-median in Euclidean and minor-free metrics. In: Proceedings of FOCS, pp. 353–364 (2016)
Cohen-Addad, V., Schwiegelshohn, C.: On the local structure of stable clustering instances. In: Proceedings of FOCS, pp. 49–60 (2017)
Dasgupta, S.: The hardness of \(k\)-means clustering. Technical report CS2007-0890, University of California, San Diego (2007)
Dhillon, I.S., Modha, D.S.: Concept decompositions for large sparse text data using clustering. Mach. Learn. 42, 143–175 (2001)
Endo, Y., Miyamoto, S.: Spherical \(k\)-means\(++\) clustering. In: Proceedings of MDAI, pp. 103–114 (2015)
Friggstad, Z., Rezapour, M., Salavatipour, M.R.: Local search yields a PTAS for \(k\)-means in doubling metrics. In: Proceedings of FOCS, pp. 365–374 (2016)
Friggstad, Z., Zhang, Y.: Tight analysis of a multiple-swap heurstic for budgeted red-blue median. In: Proceedings of ICALP, pp. 75:1–75:13 (2016)
Hornik, K., Feinerer, I., Kober, M., Buchta, C.: Spherical \(k\)-means clustering. J. Stat. Softw. 50(10), 1–22 (2012)
Jain, K., Vazirani, V.V.: Approximation algorithms for metric facility location and \(k\)-median problems using the primal-dual schema and Lagrangian relaxation. J. ACM 48, 274–296 (2001)
Kanungo, T., Mount, D.M., Netanyahu, N.S., Piatko, C.D., Silverman, R., Wu, A.Y.: A local search approximation algorithm for \(k\)-means clustering. Comput. Geom. Theory Appl. 28, 89–112 (2004)
Li, M., Xu, D., Zhang, D., Zou, J.: The seeding algorithms for spherical \(k\)-means clustering. J. Glob. Optim. 1–14 (2019). https://doi.org/10.1007/s10898-019-00779-w
Li, S., Svensson, O.: Approximating \(k\)-median via pseudo-approximation. SIAM J. Comput. 45, 530–547 (2016)
Lloyd, S.: Least squares quantization in PCM. Technical report, Bell Laboratories (1957)
Lloyd, S.: Least squares quantization in PCM. IEEE Trans. Inf. Theory 28, 129–137 (1982)
Mahajan, M., Nimbhorkar, P., Varadarajan K.: The planar \(k\)-means problem is NP-hard. In: Proceedings of WALCOM, pp. 274–285 (2009)
Makarychev, K., Makarychev, Y., Sviridenko, M., Ward, J.: A bi-criteria approximation algorithm for \(k\)-means. In: Proceedings of APPROX/RONDOM, pp. 14:1–14:20 (2016). Article No. 14
Matou\(\check{\rm s}\)ek, J.: On approximate geometric \(k\)-clustering. Discrete Comput. Geom. 24, 61–84 (2000)
Zhang, P.: A new approximation algorithm for the \(k\)-facility location problem. Theor. Comput. Sci. 384, 126–135 (2007)
Acknowledgements
The first author is supported by Natural Science Foundation of China (No. 11871081). The second author is supported by Natural Science Foundation of China (No. 11871366). The third author is the Higher Educational Science and Technology Program of Shandong Province (No. J17KA171). The fourth author is supported by Natural Science Foundation of China (No. 61433012), Shenzhen Research Grant (KQJSCX2018033017 0311901, JCYJ20180305180840138 and GFW2017073114031767), and Hong Kong GRF 17210017. The fifth author is supported by Natural Science Foundation of China (No. 11531014).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Zhang, D., Cheng, Y., Li, M., Wang, Y., Xu, D. (2019). Local Search Approximation Algorithms for the Spherical k-Means Problem. In: Du, DZ., Li, L., Sun, X., Zhang, J. (eds) Algorithmic Aspects in Information and Management. AAIM 2019. Lecture Notes in Computer Science(), vol 11640. Springer, Cham. https://doi.org/10.1007/978-3-030-27195-4_31
Download citation
DOI: https://doi.org/10.1007/978-3-030-27195-4_31
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-27194-7
Online ISBN: 978-3-030-27195-4
eBook Packages: Computer ScienceComputer Science (R0)