Abstract
We used the twitter posts about New Year’s resolutions as data source to capture users’ long-term goals. New Year’s resolutions are the commitments that people set for their personal goals, and generally, people plan to fulfill them for the whole following year. Therefore, we can think of such tweets as data source to explore people’s possible long-term goals. The key words in each tweet were extracted for clustering. Considering the form of word-pairs led by verbs is a more intuitive and clearer way to express people’s intentions than the one of separate words, we propose a generative model that incorporates word connections into the smoothed LDA to cluster the key words of long-term goals. The experiments demonstrate the proposed model is capable of clustering the word-pairs with better intuitive character, and clearly dividing people’s long-term goals.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Erosheva, E., Fienberg, S., Lafferty, J.: Mixed-membership models of scientific publications. Proc. of the National Academy of Sciences, 5220–5227 (2004)
Hoffman, T.: Probabilistic Latent Semantic Analysis. In: Proc. of Uncertainty in Artificial Intelligence, UAI (1999)
Nallapati, R., Ahmed, A., Xing, E.P., Cohen, W.W.: Joint Latent Topic Models for Text and Citations. In: Proc. of KDD, pp. 24–27 (2008)
Andrzejewski, D., Zhu, X., Craven, M.: Incorporating Domain Knowledge into Topic Modeling via Dirichlet Forest Priors. In: Proc. of ICML (2009)
Schmid, H.: Improvements in Part-of-Speech Tagging with an Application to German. In: Proc. of the ACL SIGDAT-Workshop, pp. 47–50 (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhu, D., Fukazawa, Y., Karapetsas, E., Ota, J. (2012). Long-Term Goal Discovery in the Twitter Posts through the Word-Pair LDA Model. In: Isahara, H., Kanzaki, K. (eds) Advances in Natural Language Processing. JapTAL 2012. Lecture Notes in Computer Science(), vol 7614. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33983-7_26
Download citation
DOI: https://doi.org/10.1007/978-3-642-33983-7_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33982-0
Online ISBN: 978-3-642-33983-7
eBook Packages: Computer ScienceComputer Science (R0)