Abstract
User profiling is one of the key issues in personalized recommendation systems. A content curation social network is a content-centric network; it encourages users to repin items from other users and other websites. It further permits users to arrange the pins according to their interests. It is therefore possible to estimate user interest from the pins. In this paper, we propose a user profiling approach to combining topic model and pointwise mutual information(TM-PMI). We first extract a pin?s description, and then apply latent Dirichlet allocation (LDA, one of the topic modeling schemes). A three-layer hierarchical Bayesian model of user-topic-word is thus obtained. Then, a personal model is obtained by selecting a set of correlated words with constraints of word probability and PMI. The experimental results confirm the efficiency of the proposed approach.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Hall, C., Zarro, M.: Social curation on the website Pinterest.com. Proc. Am. Soc. Inf. Sci. Technol. 49(1), 1?9 (2012)
Gilbert, E., Bakhshi, S., Chang, S., Terveen, L.: ?I need to try this?? a statistical overview of pinterest. In: CHI, pp. 2427?2436. ACM (2013)
Geng, X., Zhang, H., Song, Z., Yang, Y., Luan, H., Chua, T.: One of a kind: user profiling by social curation. In: Proceedings of the ACM International Conference on Multimedia, Orlando, pp. 567?576. ACM (2014)
Bernardini, C., Silverston, T., Festor, O.: A Pin is worth a thousand words: characterization of publications in Pinterest, pp. 322?327. IEEE (2014)
Blei, D., Carin, L., Dunson, D.: Probabilistic topic models: a focus on graphical model design and applications to document and image analysis. IEEE Signal Process. Mag. 27(6), 55?65 (2010)
Hofmann, T.: Probabilistic latent semantic analysis. In: Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence, Stockholm, pp. 289?296. Morgan Kaufmann Publishers Inc., (1999)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3(4?5), 993?1022 (2003). doi:10.1162/jmlr.2003.3.4-5.993
Rosen-Zvi, M., Griffiths, T., Steyvers, M., Smyth, P.: The author-topic model for authors and documents. In: Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence, Banff, pp. 487?494. AUAI Press (2004)
McCallum, A., Wang, X., Corrada-Emmanuel, A.: Topic and role discovery in social networks with experiments on enron and academic email. J. Artif. Intell. Res. 30, 249?272 (2007)
Weng, J., Lim, E., Jiang, J., He, Q.: TwitterRank: finding topic-sensitive influential twitterers. In: Proceedings of the Third ACM International Conference on Web Search and Data Mining, New York, pp. 261?270. ACM (2010)
Zhang, H., Yu, H., Xiong, D., Liu, Q.: HHMM-based Chinese lexical analyzer ICTCLAS. In: Proceedings of the Second SIGHAN Workshop on Chinese Language Processing, Vol. 17, Sapporo, Association for Computational Linguistics, pp. 184?187 (2003)
Griffiths, T.L., Steyvers, M.: Finding scientific topics. Proc. Natl. Acad. Sci. USA 1011, 5228?5235 (2004). doi:10.1073/pnas.0307752101
Heinrich, G.: Parameter estimation for text analysis. Technical Note Version 2.4, vsonix. Technical Report (2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Wu, L., Wang, D., Guo, C., Zhang, J., Chen, C.w. (2016). User Profiling by Combining Topic Modeling and Pointwise Mutual Information (TM-PMI). In: Tian, Q., Sebe, N., Qi, GJ., Huet, B., Hong, R., Liu, X. (eds) MultiMedia Modeling. MMM 2016. Lecture Notes in Computer Science(), vol 9517. Springer, Cham. https://doi.org/10.1007/978-3-319-27674-8_14
Download citation
DOI: https://doi.org/10.1007/978-3-319-27674-8_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-27673-1
Online ISBN: 978-3-319-27674-8
eBook Packages: Computer ScienceComputer Science (R0)