Long-Term Goal Discovery in the Twitter Posts through the Word-Pair LDA Model

Zhu, Dandan; Fukazawa, Yusuke; Karapetsas, Eleftherios; Ota, Jun

doi:10.1007/978-3-642-33983-7_26

Long-Term Goal Discovery in the Twitter Posts through the Word-Pair LDA Model

Dandan Zhu²⁰,
Yusuke Fukazawa²¹,
Eleftherios Karapetsas²⁰ &
…
Jun Ota²⁰

Conference paper

1601 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7614))

Abstract

We used the twitter posts about New Year’s resolutions as data source to capture users’ long-term goals. New Year’s resolutions are the commitments that people set for their personal goals, and generally, people plan to fulfill them for the whole following year. Therefore, we can think of such tweets as data source to explore people’s possible long-term goals. The key words in each tweet were extracted for clustering. Considering the form of word-pairs led by verbs is a more intuitive and clearer way to express people’s intentions than the one of separate words, we propose a generative model that incorporates word connections into the smoothed LDA to cluster the key words of long-term goals. The experiments demonstrate the proposed model is capable of clustering the word-pairs with better intuitive character, and clearly dividing people’s long-term goals.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Erosheva, E., Fienberg, S., Lafferty, J.: Mixed-membership models of scientific publications. Proc. of the National Academy of Sciences, 5220–5227 (2004)
Google Scholar
Hoffman, T.: Probabilistic Latent Semantic Analysis. In: Proc. of Uncertainty in Artificial Intelligence, UAI (1999)
Google Scholar
Nallapati, R., Ahmed, A., Xing, E.P., Cohen, W.W.: Joint Latent Topic Models for Text and Citations. In: Proc. of KDD, pp. 24–27 (2008)
Google Scholar
Andrzejewski, D., Zhu, X., Craven, M.: Incorporating Domain Knowledge into Topic Modeling via Dirichlet Forest Priors. In: Proc. of ICML (2009)
Google Scholar
Schmid, H.: Improvements in Part-of-Speech Tagging with an Application to German. In: Proc. of the ACL SIGDAT-Workshop, pp. 47–50 (1995)
Google Scholar

Download references

Author information

Authors and Affiliations

The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa-shi, Chiba, 277-8568, Japan
Dandan Zhu, Eleftherios Karapetsas & Jun Ota
Services & Solution Development Dept., NTTDOCOMO, Inc., NTT DOCOMO R&D Center, 3-5 Hikarinooka, Yokosuka, Kanagawa, 239-8536, Japan
Yusuke Fukazawa

Authors

Dandan Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Yusuke Fukazawa
View author publications
You can also search for this author in PubMed Google Scholar
Eleftherios Karapetsas
View author publications
You can also search for this author in PubMed Google Scholar
Jun Ota
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Information and Media Center, Toyohashi Universtiy of Technology, 1-1 Hibarigaoka, Tenpakucho, 441-8580, Toyohashi, Japan
Hitoshi Isahara & Kyoko Kanzaki &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhu, D., Fukazawa, Y., Karapetsas, E., Ota, J. (2012). Long-Term Goal Discovery in the Twitter Posts through the Word-Pair LDA Model. In: Isahara, H., Kanzaki, K. (eds) Advances in Natural Language Processing. JapTAL 2012. Lecture Notes in Computer Science(), vol 7614. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33983-7_26

Download citation

DOI: https://doi.org/10.1007/978-3-642-33983-7_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33982-0
Online ISBN: 978-3-642-33983-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics