Abstract
Today, Internet users are much more willing to express themselves on online social media channels. They commonly share their daily activities, their thoughts or feelings, and even their intention (e.g., buy a camera, rent an apartment, borrow a loan, etc.) about what they plan to do on blogs, forums, and especially online social networks. Understanding intents of online users, therefore, has become a crucial need for many enterprises operating in different business areas like production, banking, retail, e–commerce, and online advertising. In this paper, we will present a machine learning approach to analyze users’ posts and comments on online social media to filter posts or comments containing user plans or intents. Fully understanding user intent in social media texts is a complicated process including three major stages: user intent filtering, intent domain identification, and intent parsing and extraction. In the scope of this study, we will propose a solution to the first one, that is, building a binary classification model to determine whether a post or comment carries an intent or not. We carefully conducted an empirical evaluation for our model on a medium–sized collection of posts in Vietnamese and achieved promising results with an average accuracy of more than 90 %.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
Time Person of the Year (2006): You (i.e., the Internet users).
References
Ashkan, A., Clarke, C.L.A., Agichtein, E., Guo, Q.: Classifying and characterizing query intent. In: Boughanem, M., Berrut, C., Mothe, J., Soule-Dupuy, C. (eds.) ECIR 2009. LNCS, vol. 5478, pp. 578–586. Springer, Heidelberg (2009)
Berger, A., Pietra, S.A.D., Pietra, V.J.D.: A maximum entropy approach to natural language processing. Comput. Linguist. 22(1), 39–71 (1996)
Bratman, M.: Intention, plans, and practical reason. Harvard University Press, Cambridge (1987)
Chen, L.: Understanding and exploiting user intent in community question answering. Ph.D. Dissertation, Birkbeck University of London (2014)
Chen, Z., Lin, F., Liu, H., Liu, Y., Ma, W.Y., Wenyin, L.: User intention modeling in web applications using data mining. J. WWW 5(3), 181–191 (2002)
Church, K., Smyth, B.: Understanding the intent behind mobile information needs. In: The 14th IUI (2009)
Dai, H., Nie, Z., Wang, L., Wen, J.R., Zhao, L., Li, Y.: Detecting online commercial intention. In: The WWW (2006)
Hu, J., Wang, G., Lochovsky, F., Sun, J.T., Chen, Z.: Undertanding user’s query intent with wikipedia. In: The WWW (2009)
Hu, D.H., Shen, D., Sun, J.-T., Yang, Q., Chen, Z.: Context-aware online commercial intention detection. In: Zhou, Z.-H., Washio, T. (eds.) ACML 2009. LNCS, vol. 5828, pp. 135–149. Springer, Heidelberg (2009)
Jethava, V., Liliana, C.B., Ricardo, B.Y.: Scalable multi-dimensional user intent identification using tree structured distributions. In: The ACM SIGIR (2011)
Kroll, M., Strohmaier, M.: Analyzing human intentions in natural language text. In: The K-CApP (2009)
Lee, U., Liu, Z., Cho, J.: Automatic identification of user goals in web search. In: The WWW (2005)
Li, X.: Understanding the semantic structure of noun phrase queries. In: ACL (2010)
Liu, D., Nocedal, J.: On the limited memory BFGS method for large-scale optimization. Math. Program. 45, 503–528 (1989)
Malouf, R.: A comparison of algorithms for maximum entropy parameter estimation. In: COLING, pp. 1–7 (2002)
Nigam, K., Lafferty, J., McCallum, A.: Using maximum entropy for text classification. In: IJCAI Workshop on Machine Learning for Information Filtering, pp. 61–69 (1999)
Rose, D.E., Levinson, D.: Understanding user goals in web search. In: WWW (2004)
Shen, Y., Yan, J., Yan, S., Ji, L., Liu, N., Chen, Z.: Sparse hidden-dynamic conditional random fields for user intent understanding. In: The WWW (2011)
Acknowledgements
This work was supported by the project QG.15.29 from Vietnam National University, Hanoi (VNU).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Luong, TL., Tran, TH., Truong, QT., Truong, TMN., Phi, TT., Phan, XH. (2016). Learning to Filter User Explicit Intents in Online Vietnamese Social Media Texts. In: Nguyen, N.T., Trawiński, B., Fujita, H., Hong, TP. (eds) Intelligent Information and Database Systems. ACIIDS 2016. Lecture Notes in Computer Science(), vol 9622. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-49390-8_2
Download citation
DOI: https://doi.org/10.1007/978-3-662-49390-8_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-49389-2
Online ISBN: 978-3-662-49390-8
eBook Packages: Computer ScienceComputer Science (R0)