Advertisement

Mining Explainable User Interests from Scalable User Behavior Data

  • Li Jun
  • Zuo Xin-qiang
  • Zhou Meng-qi
  • Fan Gong-yuan
  • Li Lian-cun
Part of the Communications in Computer and Information Science book series (CCIS, volume 320)

Abstract

Capturing user interests from big user behavior data is critical for online advertising. Based on the user interests, advertisers can significantly reduce their advertising cost by delivering the most relevant ads for the user.

The state-of-the-art user Behavior Targeting (BT) models treat user behaviors as documents, and thus use topic models to extract their interests. A limitation of these methods is that user behaviors are usually described as unexplainable hidden topics, which cannot be directly used to guide online advertising. To this end, we propose in this paper a systematic User Interest Distribution Mining (UIDM for short) Framework to extract explainable user interests from big user behavior data. In the solution, we first use the Probabilistic Latent Semantic Analysis (PLSA) to discover the relationship between users and their behaviors, which can be described as hidden topics. Then, we construct a mapping matrix between the hidden topics and user interests by manually labeling a feature entity matrix. Experiments on real-world data sets demonstrate the performance of the proposed method.

Keywords

Behavior Targeting Probabilistic Latent Semantic Analysis User Category 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Beitzel, S., Jensen, E., Frieder, O., Lewis, D., Chowdhury, A., Kolcz, A.: Improving automatic query classification via semi-supervised learning. In: Fifth IEEE International Conference on Data Mining, p. 8. IEEE (2005)Google Scholar
  2. 2.
    Cao, H., Hu, D., Shen, D., Jiang, D., Sun, J., Chen, E., Yang, Q.: Context-aware query classification. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 3–10. ACM (2009)Google Scholar
  3. 3.
    Cao, H., Jiang, D., Pei, J., Chen, E., Li, H.: Towards context-aware search by learning a very large variable length hidden markov model from search logs. In: Proceedings of the 18th International Conference on World Wide Web, pp. 191–200. ACM (2009)Google Scholar
  4. 4.
    Hu, J., Wang, G., Lochovsky, F., Sun, J., Chen, Z.: Understanding user’s query intent with wikipedia. In: Proceedings of the 18th International Conference on World Wide Web, pp. 471–480. ACM (2009)Google Scholar
  5. 5.
    Shen, D., Sun, J., Yang, Q., Chen, Z.: Building bridges for web query classification. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 131–138. ACM (2006)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Li Jun
    • 1
    • 2
    • 3
  • Zuo Xin-qiang
    • 1
  • Zhou Meng-qi
    • 1
  • Fan Gong-yuan
    • 1
  • Li Lian-cun
    • 1
  1. 1.State Grid Energy Research InstituteBeijingChina
  2. 2.School of Computer ScienceBeijing University of Post and TelecommunicationBeijingChina
  3. 3.Key Laboratory of Trustworthy Distributed Computing and Service (BUPT)Ministry of EducationBeijingChina

Personalised recommendations