Skip to main content

User Profiling by Combining Topic Modeling and Pointwise Mutual Information (TM-PMI)

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9517))

Abstract

User profiling is one of the key issues in personalized recommendation systems. A content curation social network is a content-centric network; it encourages users to repin items from other users and other websites. It further permits users to arrange the pins according to their interests. It is therefore possible to estimate user interest from the pins. In this paper, we propose a user profiling approach to combining topic model and pointwise mutual information(TM-PMI). We first extract a pin?s description, and then apply latent Dirichlet allocation (LDA, one of the topic modeling schemes). A three-layer hierarchical Bayesian model of user-topic-word is thus obtained. Then, a personal model is obtained by selecting a set of correlated words with constraints of word probability and PMI. The experimental results confirm the efficiency of the proposed approach.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Hall, C., Zarro, M.: Social curation on the website Pinterest.com. Proc. Am. Soc. Inf. Sci. Technol. 49(1), 1?9 (2012)

    Google Scholar 

  2. Gilbert, E., Bakhshi, S., Chang, S., Terveen, L.: ?I need to try this?? a statistical overview of pinterest. In: CHI, pp. 2427?2436. ACM (2013)

    Google Scholar 

  3. Geng, X., Zhang, H., Song, Z., Yang, Y., Luan, H., Chua, T.: One of a kind: user profiling by social curation. In: Proceedings of the ACM International Conference on Multimedia, Orlando, pp. 567?576. ACM (2014)

    Google Scholar 

  4. Bernardini, C., Silverston, T., Festor, O.: A Pin is worth a thousand words: characterization of publications in Pinterest, pp. 322?327. IEEE (2014)

    Google Scholar 

  5. Blei, D., Carin, L., Dunson, D.: Probabilistic topic models: a focus on graphical model design and applications to document and image analysis. IEEE Signal Process. Mag. 27(6), 55?65 (2010)

    Google Scholar 

  6. Hofmann, T.: Probabilistic latent semantic analysis. In: Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence, Stockholm, pp. 289?296. Morgan Kaufmann Publishers Inc., (1999)

    Google Scholar 

  7. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3(4?5), 993?1022 (2003). doi:10.1162/jmlr.2003.3.4-5.993

    MATH  Google Scholar 

  8. Rosen-Zvi, M., Griffiths, T., Steyvers, M., Smyth, P.: The author-topic model for authors and documents. In: Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence, Banff, pp. 487?494. AUAI Press (2004)

    Google Scholar 

  9. McCallum, A., Wang, X., Corrada-Emmanuel, A.: Topic and role discovery in social networks with experiments on enron and academic email. J. Artif. Intell. Res. 30, 249?272 (2007)

    Google Scholar 

  10. Weng, J., Lim, E., Jiang, J., He, Q.: TwitterRank: finding topic-sensitive influential twitterers. In: Proceedings of the Third ACM International Conference on Web Search and Data Mining, New York, pp. 261?270. ACM (2010)

    Google Scholar 

  11. Zhang, H., Yu, H., Xiong, D., Liu, Q.: HHMM-based Chinese lexical analyzer ICTCLAS. In: Proceedings of the Second SIGHAN Workshop on Chinese Language Processing, Vol. 17, Sapporo, Association for Computational Linguistics, pp. 184?187 (2003)

    Google Scholar 

  12. Griffiths, T.L., Steyvers, M.: Finding scientific topics. Proc. Natl. Acad. Sci. USA 1011, 5228?5235 (2004). doi:10.1073/pnas.0307752101

    Article  Google Scholar 

  13. Heinrich, G.: Parameter estimation for text analysis. Technical Note Version 2.4, vsonix. Technical Report (2008)

    Google Scholar 

  14. https://en.wikipedia.org/wiki/Pointwise_mutual_information

  15. https://en.wikipedia.org/wiki/Levenshtein_distance

  16. https://en.wikipedia.org/wiki/Pareto_principle

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lifang Wu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Wu, L., Wang, D., Guo, C., Zhang, J., Chen, C.w. (2016). User Profiling by Combining Topic Modeling and Pointwise Mutual Information (TM-PMI). In: Tian, Q., Sebe, N., Qi, GJ., Huet, B., Hong, R., Liu, X. (eds) MultiMedia Modeling. MMM 2016. Lecture Notes in Computer Science(), vol 9517. Springer, Cham. https://doi.org/10.1007/978-3-319-27674-8_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-27674-8_14

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-27673-1

  • Online ISBN: 978-3-319-27674-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics