Skip to main content

Learning to Filter User Explicit Intents in Online Vietnamese Social Media Texts

  • Conference paper
  • 1610 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9622))

Abstract

Today, Internet users are much more willing to express themselves on online social media channels. They commonly share their daily activities, their thoughts or feelings, and even their intention (e.g., buy a camera, rent an apartment, borrow a loan, etc.) about what they plan to do on blogs, forums, and especially online social networks. Understanding intents of online users, therefore, has become a crucial need for many enterprises operating in different business areas like production, banking, retail, e–commerce, and online advertising. In this paper, we will present a machine learning approach to analyze users’ posts and comments on online social media to filter posts or comments containing user plans or intents. Fully understanding user intent in social media texts is a complicated process including three major stages: user intent filtering, intent domain identification, and intent parsing and extraction. In the scope of this study, we will propose a solution to the first one, that is, building a binary classification model to determine whether a post or comment carries an intent or not. We carefully conducted an empirical evaluation for our model on a medium–sized collection of posts in Vietnamese and achieved promising results with an average accuracy of more than 90 %.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    Time Person of the Year (2006): You (i.e., the Internet users).

References

  1. Ashkan, A., Clarke, C.L.A., Agichtein, E., Guo, Q.: Classifying and characterizing query intent. In: Boughanem, M., Berrut, C., Mothe, J., Soule-Dupuy, C. (eds.) ECIR 2009. LNCS, vol. 5478, pp. 578–586. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  2. Berger, A., Pietra, S.A.D., Pietra, V.J.D.: A maximum entropy approach to natural language processing. Comput. Linguist. 22(1), 39–71 (1996)

    Google Scholar 

  3. Bratman, M.: Intention, plans, and practical reason. Harvard University Press, Cambridge (1987)

    Google Scholar 

  4. Chen, L.: Understanding and exploiting user intent in community question answering. Ph.D. Dissertation, Birkbeck University of London (2014)

    Google Scholar 

  5. Chen, Z., Lin, F., Liu, H., Liu, Y., Ma, W.Y., Wenyin, L.: User intention modeling in web applications using data mining. J. WWW 5(3), 181–191 (2002)

    Google Scholar 

  6. Church, K., Smyth, B.: Understanding the intent behind mobile information needs. In: The 14th IUI (2009)

    Google Scholar 

  7. Dai, H., Nie, Z., Wang, L., Wen, J.R., Zhao, L., Li, Y.: Detecting online commercial intention. In: The WWW (2006)

    Google Scholar 

  8. Hu, J., Wang, G., Lochovsky, F., Sun, J.T., Chen, Z.: Undertanding user’s query intent with wikipedia. In: The WWW (2009)

    Google Scholar 

  9. Hu, D.H., Shen, D., Sun, J.-T., Yang, Q., Chen, Z.: Context-aware online commercial intention detection. In: Zhou, Z.-H., Washio, T. (eds.) ACML 2009. LNCS, vol. 5828, pp. 135–149. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  10. Jethava, V., Liliana, C.B., Ricardo, B.Y.: Scalable multi-dimensional user intent identification using tree structured distributions. In: The ACM SIGIR (2011)

    Google Scholar 

  11. Kroll, M., Strohmaier, M.: Analyzing human intentions in natural language text. In: The K-CApP (2009)

    Google Scholar 

  12. Lee, U., Liu, Z., Cho, J.: Automatic identification of user goals in web search. In: The WWW (2005)

    Google Scholar 

  13. Li, X.: Understanding the semantic structure of noun phrase queries. In: ACL (2010)

    Google Scholar 

  14. Liu, D., Nocedal, J.: On the limited memory BFGS method for large-scale optimization. Math. Program. 45, 503–528 (1989)

    Article  MathSciNet  MATH  Google Scholar 

  15. Malouf, R.: A comparison of algorithms for maximum entropy parameter estimation. In: COLING, pp. 1–7 (2002)

    Google Scholar 

  16. Nigam, K., Lafferty, J., McCallum, A.: Using maximum entropy for text classification. In: IJCAI Workshop on Machine Learning for Information Filtering, pp. 61–69 (1999)

    Google Scholar 

  17. Rose, D.E., Levinson, D.: Understanding user goals in web search. In: WWW (2004)

    Google Scholar 

  18. Shen, Y., Yan, J., Yan, S., Ji, L., Liu, N., Chen, Z.: Sparse hidden-dynamic conditional random fields for user intent understanding. In: The WWW (2011)

    Google Scholar 

Download references

Acknowledgements

This work was supported by the project QG.15.29 from Vietnam National University, Hanoi (VNU).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Thai-Le Luong .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Luong, TL., Tran, TH., Truong, QT., Truong, TMN., Phi, TT., Phan, XH. (2016). Learning to Filter User Explicit Intents in Online Vietnamese Social Media Texts. In: Nguyen, N.T., Trawiński, B., Fujita, H., Hong, TP. (eds) Intelligent Information and Database Systems. ACIIDS 2016. Lecture Notes in Computer Science(), vol 9622. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-49390-8_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-49390-8_2

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-49389-2

  • Online ISBN: 978-3-662-49390-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics