A Simple Integration of Social Relationship and Text Data for Identifying Potential Customers in Microblogging

  • Guansong Pang
  • Shengyi Jiang
  • Dongyi Chen
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8346)


Identifying potential customers among a huge number of users in microblogging is a fundamental problem for microblog marketing. One challenge in potential customer detection in microblogging is how to generate an accurate characteristic description for users, i.e., user profile generation. Intuitively, the preference of a user’s friends (i.e., the person followed by the user in microblogging) is of great importance to capture the characteristic of the user. Also, a user’s self-defined tags are often concise and accurate carriers for the user’s interests. In this paper, for identifying potential customers in microblogging, we propose a method to generate user profiles via a simple integration of social relationship and text data. In particular, our proposed method constructs self-defined tag based user profiles by aggregating tags of the users and their friends. We further identify potential customers among users by using text classification techniques. Although this framework is simple, easy to implement and manipulate, it can obtain desirable potential customer detection accuracy. This is illustrated by extensive experiments on datasets derived from Sina Weibo, the most popular microblogging in China.


identifying potential customers user profiling social relationship text data text classification microblog marketing 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Pennacchiotti, M., Popescu, A.: Democrats, republicans and starbucks afficionados: user classification in twitter. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 430–438 (2011)Google Scholar
  2. 2.
    Konopnicki, D., Shmueli-Scheuer, M., Cohen, D., Sznajder, B., Herzig, J., Raviv, A., Zwerling, N., Roitman, H., Mass, Y.: A statistical approach to mining customers’ conversational data from social media. IBM Journal of Research and Development 57(3/4), 11–14 (2013)Google Scholar
  3. 3.
    Burger, J.D., Henderson, J., Kim, G., Zarrella, G.: Discriminating gender on Twitter. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 1301–1309 (2011)Google Scholar
  4. 4.
    Fink, C., Kopecky, J., Morawskib, M.: Inferring Gender from the Content of Tweets: A Region Specific Example. In: Proceedings of the 6th International AAAI Conference on Weblogs and Social Media (2012)Google Scholar
  5. 5.
    Rao, D., Yarowsky, D., Shreevats, A., Gupta, M.: Classifying latent user attributes in twitter. In: Proceedings of the 2nd International Workshop on Search and Mining User-generated Contents, pp. 37–44 (2010)Google Scholar
  6. 6.
    Tinati, R., Carr, L., Hall, W., Bentwood, J.: Identifying communicator roles in twitter. In: Proceedings of the 21st International Conference Companion on World Wide Web, pp. 1161–1168 (2012)Google Scholar
  7. 7.
    Yu, S., Kak, S.: A Survey of Prediction Using Social Media. arXiv preprint:1203.1647 (2012)Google Scholar
  8. 8.
    Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. The Journal of Machine Learning Research 3, 993–1022 (2003)zbMATHGoogle Scholar
  9. 9.
    Drew, J.H., Mani, D.R., Betz, A.L., Datta, P.: Targeting customers with statistical and data-mining techniques. Journal of Service Research 3(3), 205–219 (2001)CrossRefGoogle Scholar
  10. 10.
    Kim, Y., Street, W.N.: An intelligent system for customer targeting: a data mining approach. Decision Support Systems 37(2), 215–228 (2004)CrossRefGoogle Scholar
  11. 11.
    Kim, Y., Street, W.N., Russell, G.J., Menczer, F.: Customer targeting: A neural network approach guided by genetic algorithms. Management Science 51(2), 264–276 (2005)CrossRefGoogle Scholar
  12. 12.
    Berry, M.J., Linoff, G.S.: Data mining techniques: for marketing, sales, and customer relationship management. Wiley Computer Publishing (2004)Google Scholar
  13. 13.
    Yang, Y., Liu, X.: A re-examination of text categorization methods. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 42–49 (1999)Google Scholar
  14. 14.
    Aggarwal, C.C., Zhai, C.: A survey of text classification algorithms. Mining Text Data, pp.163–222. Springer US (2012)Google Scholar
  15. 15.
    Salton, G., Wong, A., Yang, C.: A vector space model for automatic indexing. Communications of the ACM 18(11), 613–620 (1975)CrossRefzbMATHGoogle Scholar
  16. 16.
    Kim, H., Howland, P., Park, H.: Dimension reduction in text classification with support vector machines. The Journal of Machine Learning Research 6, 37–53 (2005)zbMATHMathSciNetGoogle Scholar
  17. 17.
    Guo, G., Wang, H., Bell, D., Bi, Y., Greer, K.: Using kNN model for automatic text categori-zation. Soft Computing 10(5), 423–430 (2006)CrossRefGoogle Scholar
  18. 18.
    Pang, G., Jiang, S.: A generalized cluster centroid based classifier for text categorization. Information Processing & Management 49(2), 576–586 (2013)CrossRefGoogle Scholar
  19. 19.
    Chang, C., Lin, C.: LIBSVM: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology (TIST) 2(3), 27 (2011)Google Scholar
  20. 20.
    Guan, H., Zhou, J., Guo, M.: A class-feature-centroid classifier for text categorization. In: Proceedings of the 18th International Conference on World Wide Web, pp. 201–210 (2009)Google Scholar
  21. 21.
    Han, E., Karypis, G., Kumar, V.: Text categorization using weight adjusted k-nearest neigh-bor classification. In: Proceedings of the 5th Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 53–65 (2001)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Guansong Pang
    • 1
  • Shengyi Jiang
    • 2
  • Dongyi Chen
    • 2
  1. 1.School of ManagementGuangdong University of Foreign StudiesGuangzhouChina
  2. 2.School of InformaticsGuangdong University of Foreign StudiesGuangzhouChina

Personalised recommendations