A Novel Approach to Extract and Analyse Trending Cuisines on Social Media
- 44 Downloads
In this technological era, we have seen a huge increase in the number of reviewing sites in the internet. In case of online food delivery stores, these reviews are very important as they, on the whole express public sentiment towards a particular restaurant or cuisine. In this paper, we are proposing an approach to predict which cuisines and restaurants are “trending” in a country based on the analysis of social media. We mine social media platforms like Twitter for food-related tweets and extract these tweets by using our own manually curated food lexicon. From these tweets, we use similarity matching to extract the food items that were tweeted about and run each of these items through a cuisine classifier based on logistic regression and word2vec word embeddings. This is done for all the tweets and thus, we can get which cuisines and restaurants have been popular while, which restaurants are fading. Our approach can, therefore be used by restaurants to analyze which markets they need to expand into and also where they have to revamp their business strategies.
KeywordsTwitter mining Recommendation system Logistic regression Cuisine classification Social media analysis
The authors would like to thank the reviewers and the experts who have helped in this research and given us great insights and thoughts which have guided us and helped us improve our work.
- 2.Derczynski, L., Ritter, A., Clark, S., Bontcheva, K.: Twitter part-of-speech tagging for all: overcoming sparse and noisy data. In: Proceedings of the International Conference on Recent Advances in Natural Language Processing, January 2013Google Scholar
- 3.Effrosynidis, D., Symeonidis, S., Arampatzis, A.: A comparison of pre-processing techniques for Twitter sentiment analysis. In: 21st International Conference on Theory and Practice of Digital Libraries. LNCS, vol. 10450, September 2017Google Scholar
- 4.Kim, A.Y., Ha, J.G., Choi, H., Moon, H.: Automated text analysis based on skip-gram model for food evaluation in predicting consumer acceptance. Comput. Intell. Neurosci. 2, 1–12 (2018)Google Scholar
- 6.Zhao, W.X., Jiang, J., He, J., Song, Y., Achananuparp, P., Lim, E.-P., Li, X.: Topical keyphrase extraction from Twitter. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 379–388, June 2011Google Scholar
- 9.Ramos, J.: Using TF-IDF to determine word relevance in document queries. In: Proceedings of the First Instructional Conference on Machine Learning, vol. 242, pp. 133–142, January 2003Google Scholar
- 10.Koster, C.H.A., Beney, J.G.: On the importance of parameter tuning in text categorization. In: International Andrei Ershov Memorial Conference on Perspectives of System Informatics, pp. 270–283, June 2006. Springer (2006)Google Scholar
- 14.Lokeshkumar, R., Sindhuja, R., Sengottuvelan, P.: A survey on preprocessing of web log file in web usage mining to improve the quality of data. Int. J. Emerg. Technol. Adv. Eng. 4(8), 229–234 (2014)Google Scholar
- 15.Gopalakrishnan, T., Sengottuvelan, P., Bharathi, A., Lokeshkumar, R.: An approach to webpage prediction method using variable order Markov model in recommendation systems. J. Internet Technol. 19(2), 415–424 (2018)Google Scholar