Skip to main content
Log in

An Experimental Study of Text Representation Methods for Cross-Site Purchase Preference Prediction Using the Social Text Data

  • Regular Paper
  • Published:
Journal of Computer Science and Technology Aims and scope Submit manuscript

Abstract

Nowadays, many e-commerce websites allow users to login with their existing social networking accounts. When a new user comes to an e-commerce website, it is interesting to study whether the information from external social media platforms can be utilized to alleviate the cold-start problem. In this paper, we focus on a specific task on cross-site information sharing, i.e., leveraging the text posted by a user on the social media platform (termed as social text) to infer his/her purchase preference of product categories on an e-commerce platform. To solve the task, a key problem is how to effectively represent the social text in a way that its information can be utilized on the e-commerce platform. We study two major kinds of text representation methods for predicting cross-site purchase preference, including shallow textual features and deep textual features learned by deep neural network models. We conduct extensive experiments on a large linked dataset, and our experimental results indicate that it is promising to utilize the social text for predicting purchase preference. Specially, the deep neural network approach has shown a more powerful predictive ability when the number of categories becomes large.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Zhang Y, Pennacchiotti M. Recommending branded products from social media. In Proc. the 7th ACM Conference on Recommender Systems, Oct. 2013, pp.77-84.

  2. Ricci F, Rokach L, Shapira B. Introduction to Recommender Systems Handbook. Springer US, 2011, pp.1-35.

  3. Zhang Y, Pennacchiotti M. Predicting purchase behaviors from social media. In Proc. the 22nd International Conference on World Wide Web, May 2013, pp.1521-1532.

  4. Wang J, ZhaoWX, He Y, Li X. Leveraging product adopter information from online reviews for product recommendation. In Proc. the 9th International AAAI Conference on Web and Social Media, May 2015, pp.464-472.

  5. Wang J, Cong G, Zhao W X, Li X. Mining user intents in Twitter: A semi-supervised approach to inferring intent categories for tweets. In Proc. the 29th AAAI Conference on Artificial Intelligence, Jan. 2015, pp.318-324.

  6. Zhao W X, Guo Y, He Y, Jiang H, Wu Y, Li X. We know what you want to buy: A demographic-based system for product recommendation on microblogs. In Proc. the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Aug. 2014, pp.1935-1944.

  7. Tang D, Wei F, Yang N, Zhou M, Liu T, Qin B. Learning sentiment-specific word embedding for Twitter sentiment classification. In Proc. the 52nd Annual Meeting of the Association for Computational Linguistics, June 2014, pp.1555-1565.

  8. Cao Z, Li W, Li S, Wei F, Li Y. AttSum: Joint learning of focusing and summarization with neural attention. In Proc. COLING, Dec. 2016, pp.547-556.

  9. Gui L, Xu R, He Y, Lu Q, Wei Z. Intersubjectivity and sentiment: From language to knowledge. In Proc. the 25th International Joint Conference on Artificial Intelligence, July 2016, pp.2789-2795.

  10. Lai S, Xu L, Liu K, Zhao J. Recurrent convolutional neural networks for text classification. In Proc. the 29th International AAAI Conference on Web and Social Media, Jan. 2015, pp.2267-2273.

  11. Yih W, He X, Meek C. Semantic parsing for single-relation question answering. In Proc. the 52nd Annual Meeting of the Association for Computational Linguistics, June 2014, pp.643-648.

  12. Kalchbrenner N, Grefenstette E, Blunsom P. A convolutional neural network for modelling sentences. In Proc. the 52nd Annual Meeting of the Association for Computational Linguistics, June 2014, pp.655-665.

  13. Basili R. Learning to classify text using support vector machines: Methods, theory, and algorithms by Thorsten Joachims. Computational Linguistics, 2003, 29(4): 655-661.

    Article  Google Scholar 

  14. Salton G, Wong A, Yang C S. A vector space model for automatic indexing. Commun. ACM, 1975, 18(11): 613-620.

    Article  MATH  Google Scholar 

  15. Robertson S. Understanding inverse document frequency: On theoretical arguments for IDF. Journal of Documentation, 2004, 60(5): 503-520.

    Article  Google Scholar 

  16. Blei D M, Ng A Y, Jordan M I. Latent Dirichlet allocation. In Proc. NIPS, Dec. 2001, pp.601-608.

  17. Seroussi Y, Bohnert F, Zukerman I. Personalised rating prediction for new users using latent factor models. In Proc. the 22nd ACM Conference on Hypertext and Hypermedia, June 2011, pp.47-56.

  18. Mikolov T, Chen K, Corrado G, Dean J. Efficient estimation of word representations in vector space. In Proc. International Conference on Learning Representations, May 2013.

  19. Mikolov T, Sutskever I, Chen K, Corrado G S, Dean J. Distributed representations of words and phrases and their compositionality. In Proc. NIPS, Dec. 2013, pp.3111-3119.

  20. Le Q, Mikolov T. Distributed representations of sentences and documents. In Proc. the 31st International Conference on Machine Learning, June 2014, pp.1188-1196.

  21. Kim Y. Convolutional neural networks for sentence classification. In Proc. the Empirical Methods in Natural Language Processing, Oct. 2014, pp.1746-1751.

  22. Hinton G E, Salakhutdinov R R. Reducing the dimensionality of data with neural networks. Science, 2006, 313(5786): 504-507.

    Article  MathSciNet  MATH  Google Scholar 

  23. Erhan D, Bengio Y, Courville A, Manzagol P A, Vincent P, Bengio S. Why does unsupervised pretraining help deep learning? Journal of Machine Learning Research, 2010, 11: 625-660.

    MathSciNet  MATH  Google Scholar 

  24. Collobert R, Weston J, Bottou L, Karlen M, Kavukcuoglu K, Kuksa P. Natural language processing (almost) from scratch. Journal of Machine Learning Research, 2011, 12: 2493-2537.

    MATH  Google Scholar 

  25. Kalchbrenner N, Blunsom P. Recurrent convolutional neural networks for discourse compositionality. In Proc. the Workshop on Continuous Vector Space Models and Their Compositionality, Aug. 2013, pp.119-126.

  26. Hochreiter S, Schmidhuber J. Long short-term memory. Neural Computation, 1997, 9(8): 1735-1780.

    Article  Google Scholar 

  27. Gers F A, Schraudolph N N, Schmidhuber J. Learning precise timing with LSTM recurrent networks. Journal of Machine Learning Research, 2002, 3: 115-143.

    MathSciNet  MATH  Google Scholar 

  28. Zhang S, Zheng D, Hu X, Yang M. Bidirectional long shortterm memory networks for relation classification. In Proc. the 29th Pacific Asia Conference on Language, Information and Computation, Oct.30-Nov.1, 2015.

  29. Jarvelin K, Kekalainen J. Cumulated gain-based evaluation of IR techniques. ACM Transactions on Information Systems, 2002, 20(4): 422-446.

    Article  Google Scholar 

  30. Burges C, Shaked T, Renshaw E, Lazier A, Deeds M, Hamilton N, Hullender G N. Learning to rank using gradient descent. In Proc. the 22nd Annual International Conference on Machine Learning, Aug. 2005, pp.89-96.

  31. Zhao W X, Li S, He Y, Chang E Y, Wen J, Li X. Connecting social media to e-commerce: Cold-start product recommendation using microblogging information. IEEE Transactions on Knowledge and Data Engineering, 2016, 28(5): 1147-1159.

    Article  Google Scholar 

  32. Guo S, Wang M, Leskovec J. The role of social networks in online shopping: Information passing, price of trust, and consumer choice. In Proc. the 12th ACM Conference on Electronic Commerce, June 2011, pp.157-166.

  33. Bhatt R, Chaoji V, Parekh R. Predicting product adoption in large-scale social networks. In Proc. the 19th ACM International Conference on Information and Knowledge Management, Oct. 2010, pp.1039-1048.

  34. Zhou F, Ji Y, Jiao R J. Predicting product adoption in large social networks for demand estimation. In Proc. ISPECE, Sept. 2014, pp.890-899.

  35. Hill S, Provost F, Volinsky C. Network-based marketing: Identifying likely adopters via consumer networks. Statistical Science, 2006, 21(2): 256-276.

    Article  MathSciNet  MATH  Google Scholar 

  36. Iyengar R, Han S, Gupta S. Do friends influence purchases in a social network? SSRN ELectronic Journal, 2009.

  37. Pan S J, Yang Q. A survey on transfer learning. IEEE Transactions on Knowledge and Data Engineering, 2010, 22(10): 1345-1359.

    Article  Google Scholar 

  38. PanW, Xiang EW, Yang Q. Transfer learning in collaborative filtering with uncertain ratings. In Proc. the 26th AAAI Conference on Artificial Intelligence, July 2012, pp.662-668.

  39. Li B, Yang Q, Xue X. Can movies and books collaborate? Cross-domain collaborative filtering for sparsity reduction. In Proc. the 21st International Joint Conference on Artificial Intelligence, July 2009, pp.2052-2057.

  40. Singh A P, Gordon G J. Relational learning via collective matrix factorization. In Proc. the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Aug. 2008, pp.650-658.

  41. Zhao L, Pan S J, Xiang E W, Zhong E, Lu Z, Yang Q. Active transfer learning for cross-system recommendation. In Proc. the 27th AAAI Conference on Artificial Intelligence, July 2013.

  42. Hu L, Cao J, Xu G, Cao L, Gu Z, Zhu C. Personalized recommendation via cross-domain triadic factorization. In Proc. the 22nd International Conference on World Wide Web, May 2013, pp.595-606.

  43. Cao Y, Xu R, Chen T. Combining convolutional neural network and support vector machine for sentiment classification. In Proc. the 4th National Conference on Social Media Processing, Nov. 2015, pp.144-155.

  44. Miwa M, Bansal M. End-to-end relation extraction using LSTMs on sequences and tree structures. In Proc. the 54th Annual Meeting of the Association for Computational Linguistics, Aug. 2016.

  45. Xu Y, Jia R, Mou L, Li G, Chen Y, Lu Y, Jin Z. Improved relation classification by deep recurrent neural networks with data augmentation. In Proc. the 26th International Conference on Computational Linguistics, Dec. 2016, pp.1461-1470.

  46. Chen W F, Ku L W. UTCNN: A deep learning model of stance classification on social media text. In Proc. the 26th International Conference on Computational Linguistics, Dec. 2016, pp.1635-1645.

  47. Dong L, Wei F, Zhou M, Xu K. Adaptive multicompositionality for recursive neural models with applications to sentiment analysis. In Proc. the 28th AAAI Conference on Artificial Intelligence, July 2014, pp.1537-1543.

  48. Hermann K M, Kocisky T, Grefenstette E, Espeholt L, Kay W, Suleyman M, Blunsom P. Teaching machines to read and comprehend. In Proc. NIPS, Dec. 2015, pp.1693-1701.

  49. Kobayashi S, Tian R, Okazaki N, Inui K. Dynamic entity representation with max-pooling improves machine reading. In Proc. the 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics, June 2016, pp.850-855.

  50. Chen D, Bolton J, Manning C D. A thorough examination of the CNN/daily mail reading comprehension task. In Proc. the 54th Annual Meeting of the Association for Computational Linguistics, Aug. 2016.

  51. Wang L, Cao Z, de Melo G, Liu Z. Relation classification via multi-level attention CNNs. In Proc. the 54th Annual Meeting of the Association for Computational Linguistics, Aug. 2016.

Download references

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wayne Xin Zhao.

Electronic supplementary material

Below is the link to the electronic supplementary material.

ESM 1

(PDF 189 kb)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bai, T., Dou, HJ., Zhao, W.X. et al. An Experimental Study of Text Representation Methods for Cross-Site Purchase Preference Prediction Using the Social Text Data. J. Comput. Sci. Technol. 32, 828–842 (2017). https://doi.org/10.1007/s11390-017-1763-6

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11390-017-1763-6

Keywords

Navigation