Skip to main content

Scenic Negative Comment Clustering Based on Balance Weighted Comment Topic Model

  • Conference paper
  • First Online:
Book cover Proceedings of 2017 Chinese Intelligent Systems Conference (CISC 2017)

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 459))

Included in the following conference series:

  • 1202 Accesses

Abstract

The scenic comment information from visitors often hidden the different aspects of the recommendations and expectations of the attractions, the extraction of these key information will help the spot managers find their own shortcomings and improve themselves. In this paper, we improved the author topic model and proposed a model of clustering the negative comments of the scenic spots. There are two improvements from our proposed model. Firstly, we added the importance of the comment category to the text clustering. Secondly, in order to prevent the stop words accumulating in the sampling process, we introduced the balance weight to the proposed model. Experiments showed that the model could not only effectively cluster these data, but also could extract the rich information related to different comment categories from the clustering results, which could help the managers of the scenic spots to better manage the attractions and attract tourists.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Kavitha Karun A, Mintu P, Lubna K. Comparative analysis of similarity measures in document clustering. In: International conference on green computing, communication and conservation of energy;2013. p. 857–60.

    Google Scholar 

  2. Bharti KK, Singh PK. Hybrid dimension reduction by integrating feature selection with feature extraction method for text clustering. Expert Syst Appl Int J. 2015;42(6):3105–14.

    Article  Google Scholar 

  3. Lee S, Kim J, Myaeng S H. An extension of topic models for text classification: a term weighting approach. In: International conference on big data and smart computing. IEEE;2015. p. 217–24.

    Google Scholar 

  4. Hashimoto T, Chakraborty B, Aramvith S, et al. Affected people’s needs detection after the East Japan Great Earthquake—time series analysis using LDA. In: Asia-Pacific signal and information processing association, 2014 summit and conference. IEEE;2014. p. 1–6.

    Google Scholar 

  5. Liu G, Xu X, Zhu Y, et al. An improved latent Dirichlet allocation model for hot topic extraction. In: IEEE fourth international conference on big data and cloud computing. IEEE;2014. p. 470–6.

    Google Scholar 

  6. Ding Y, Yan S, Xiao Y, et al. Ranking algorithm based on relational topic model. In: International joint conference on neural networks. IEEE;2015. p. 1–8.

    Google Scholar 

  7. Yang Z, Kotov A, Mohan A, et al. Parametric and non-parametric user-aware sentiment topic models. In: International ACM SIGIR conference on research and development in information retrieval. ACM;2015. p. 413–22.

    Google Scholar 

  8. Pyo S, Kim E, Kim M. LDA-based unified topic modeling for similar TV user grouping and TV program recommendation. IEEE Trans Cybern. 2014;45(8):1476–90.

    Google Scholar 

  9. Li HE, Jia Y, Han W, et al. Mining user interest in microblogs with a user-topic model. Commun China. 2014;11(8):131–44.

    Article  Google Scholar 

  10. Lu Y, Okada S, Nitta K. Semi-supervised latent Dirichlet allocation for multi-label text classification. In: Recent trends in applied artificial intelligence;2013. p. 351–60.

    Google Scholar 

  11. Huynh T, Fritz M, Schiele B. Discovery of activity patterns using topic models. In: UBICOMP 2008: ubiquitous computing, international conference, UBICOMP 2008, Seoul, Korea, 21–24 Sept 2008, Proceedings. DBLP;2008. p. 10–9.

    Google Scholar 

  12. Ichise R, Fujita S, Muraki T, et al. Research mining using the relationships among authors, topics and papers. In: International conference information visualization. IEEE Computer Society;2007. p. 425–30.

    Google Scholar 

  13. Cai H, Yang Y, Li X, et al. What are popular: exploring twitter features for event detection, tracking and visualization. ACM international conference on multimedia. ACM;2015. p. 89–98.

    Google Scholar 

  14. Blei DM, Ng AY, Jordan MI. Latent Dirichlet allocation. J Mach Learn Res. 2003;3:993–1022.

    MATH  Google Scholar 

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (No. 61320106006, No. 61532006, No. 61502042).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Junping Du .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte Ltd.

About this paper

Cite this paper

Lin, Z., Du, J., Li, Y., Ye, L., Luo, A. (2018). Scenic Negative Comment Clustering Based on Balance Weighted Comment Topic Model. In: Jia, Y., Du, J., Zhang, W. (eds) Proceedings of 2017 Chinese Intelligent Systems Conference. CISC 2017. Lecture Notes in Electrical Engineering, vol 459. Springer, Singapore. https://doi.org/10.1007/978-981-10-6496-8_28

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-6496-8_28

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-6495-1

  • Online ISBN: 978-981-10-6496-8

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics