Skip to main content

Exploiting Capacity-Constrained K-Means Clustering for Aspect-Phrase Grouping

  • Conference paper
  • First Online:
  • 2971 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9403))

Abstract

Aspect-phrase clustering is an important task for aspect finding in aspect-level sentiment analysis. Most of existing methods for this problem are based on a context model which aggregates related sentences that contains assigned aspect-phrase as context. In this paper, we explore a novel idea, capacity limitation, which states that the number of aggregated sentences in an aspect-phrase group has upper bound. And we propose a capacity constrained K-means algorithm to cluster aspect-phrases which encodes the capacity limitation as constraint. Empirical evaluation shows that the proposed method outperforms existing state-of-the-art methods.

This is a preview of subscription content, log in via an institution.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Basu, S., Banerjee, A., Mooney, R.: Semi-supervised clustering by seeding. In: Proc. of ICML (2002)

    Google Scholar 

  2. Chen, Z., Mukherjee, A., Liu, B., Hsu, M., Castellanos, M., Ghosh, R.: Exploiting domain knowledge in aspect extraction. In: Proc. of EMNLP, pp. 1655–1667. ACL (2013)

    Google Scholar 

  3. Fang, L., Huang, M., Zhu, X.: Exploring weakly supervised latent sentiment explanations for aspect-level review analysis. In: Proc. of CIKM, pp. 1057–1066. ACM (2013)

    Google Scholar 

  4. Ghosh, A.B.J.: On scaling up balanced clustering algorithms, p. 333. Society for Industrial and Applied Mathematics (2002)

    Google Scholar 

  5. Guo, H., Zhu, H., Guo, Z., Zhang, X., Su, Z.: Product feature categorization with multilevel latent semantic association. In: Proc. of CIKM, pp. 1087–1096. ACM (2009)

    Google Scholar 

  6. Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proc. of KDD, pp. 168–177. ACM (2004)

    Google Scholar 

  7. Jin, W., Ho, H.H., Srihari, R.K.: Opinionminer: A novel machine learning system for web opinion mining and extraction. In: Proc. of KDD, pp. 1195–1204. ACM, New York (2009)

    Google Scholar 

  8. Jo, Y., Oh, A.H.: Aspect and sentiment unification model for online review analysis. In: Proc. of WSDM, pp. 815–824. ACM, New York (2011)

    Google Scholar 

  9. Kim, S.M., Hovy, E.: Extracting opinions, opinion holders, and topics expressed in online news media text. In: Proc. of ACL Workshop on Sentiment and Subjectivity in Text, pp. 1–8. Association for Computational Linguistics, Sydney (2006)

    Google Scholar 

  10. Kobayashi, N., Inui, K., Matsumoto, Y.: Extracting aspect-evaluation and aspect-of relations in opinion mining. In: Proc. of EMNLP-CoNLL, pp. 1065–1074. Association for Computational Linguistics, Prague (2007)

    Google Scholar 

  11. Ku, L.W., Liang, Y.T., Chen, H.H.: Opinion extraction, summarization and tracking in news and blog corpora. In: Proc. of AAAI-CAAW, vol. 100107 (2006)

    Google Scholar 

  12. Lin, C., He, Y.: Joint sentiment/topic model for sentiment analysis. In: Proc. of CIKM, pp. 375–384. ACM, Hong Kong (2009), 1646003

    Google Scholar 

  13. Liu, B.: Sentiment Analysis And Opinion Mining. Morgan Claypool Publishers (2012)

    Google Scholar 

  14. Liu, K., Xu, L., Zhao, J.: Extracting opinion targets and opinion words from online reviews with graph co-ranking. In: Proc. of ACL, pp. 314–324. Association for Computational Linguistics (2014)

    Google Scholar 

  15. Lu, B., Ott, M., Cardie, C., Tsou, B.K.: Multi-aspect sentiment analysis with topic models. In: Proc. of ICDMW, pp. 81–88. IEEE (2011)

    Google Scholar 

  16. Mei, Q., Ling, X., Wondra, M., Su, H., Zhai, C.: Topic sentiment mixture: Modeling facets and opinions in weblogs. In: Proc. of WWW, pp. 171–180. ACM, New York (2007)

    Google Scholar 

  17. Moghaddam, S., Ester, M.: On the design of lda models for aspect-based opinion mining. In: Proc. of CIKM, pp. 803–812. ACM (2012), 2396863

    Google Scholar 

  18. Pang, B., Lee, L.: Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval 2(1–2), 1–135 (2008)

    Article  Google Scholar 

  19. Titov, I., Ryan, M.: Modeling online reviews with multi-grain topic models. In: Proc. of WWW, pp. 111–120 (2008)

    Google Scholar 

  20. Wagstaff, K., Cardie, C., Rogers, S., Schr O Dl, S.: Constrained k-means clustering with background knowledge. In: Proc. of ICML, pp. 577–584. Morgan Kaufmann Publishers Inc. (2001)

    Google Scholar 

  21. Zhai, Z., Liu, B., Xu, H., Jia, P.: Grouping product features using semi-supervised learning with soft-constraints. In: Proc. of COLING, pp. 1272–1280 (2010)

    Google Scholar 

  22. Zhai, Z., Liu, B., Xu, H., Jia, P.: Clustering product features for opinion mining. In: Proc. of WSDM, pp. 347–354. ACM (2011), 1935884

    Google Scholar 

  23. Zhai, Z., Liu, B., Xu, H., Jia, P.: Constrained lda for grouping product features in opinion mining. In: Proc. of PAKDD, pp. 448–459 (2011)

    Google Scholar 

  24. Zhao, L., Huang, M., Chen, H., Cheng, J., Zhu, X.: Clustering aspect-related phrases by leveraging sentiment distribution consistency. In: Proc. of EMNLP, pp. 1614–1623. Association for Computational Linguistics (2014)

    Google Scholar 

  25. Zhao, W.X., Jiang, J., Yan, H., Li, X.: Jointly modeling aspects and opinions with a maxent-lda hybrid. In: Proc. of EMNLP, pp. 56–65. Association for Computational Linguistics (2010), 1870664

    Google Scholar 

  26. Zhong, S., Ghosh, J.: A unified framework for model-based clustering. J. Mach. Learn. Res. 4, 1001–1037 (2003)

    MathSciNet  MATH  Google Scholar 

  27. Zhu, S., Wang, D., Li, T.: Data clustering with size constraints. Knowledge-Based Systems 23(8), 883–889 (2010)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Donghong Ji .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Xiong, S., Ji, D. (2015). Exploiting Capacity-Constrained K-Means Clustering for Aspect-Phrase Grouping. In: Zhang, S., Wirsing, M., Zhang, Z. (eds) Knowledge Science, Engineering and Management. KSEM 2015. Lecture Notes in Computer Science(), vol 9403. Springer, Cham. https://doi.org/10.1007/978-3-319-25159-2_34

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-25159-2_34

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-25158-5

  • Online ISBN: 978-3-319-25159-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics