User Recommendation in Low Degree Networks with a Learning-Based Approach
User recommendation plays an important role in microblogging systems since users connect to these networks to share and consume content. Finding relevant users to follow is then a hot topic in the study of social networks. Microblogging networks are characterized by having a large number of users, but each of them connects with a limited number of other users, making the graph of followers to have a low degree. One of the main problems of approaching user recommendation with a learning-based approach in low-degree networks is the problem of extreme class imbalance. In this article, we propose a balancing scheme to face this problem, and we evaluate different classification algorithms using as features classical metrics for link prediction. We found that the learning-based approach outperformed individual metrics for the problem of user recommendation in the evaluated dataset. We also found that the proposed balancing approach lead to better results, enabling a better identification of existing connections between users.
KeywordsUser recommendation Online social networks Link prediction
This work was partially supported by research project PICT-2014-2750.
- Ertekin, S., Huang, J., Giles, C.L.: Active learning for class imbalance problem. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2007, pp. 823–824. ACM, New York (2007)Google Scholar
- Ho, T.K.: Random decision forests. In: Proceedings of 3rd International Conference on Document Analysis and Recognition, vol. 1, pp. 278–282 (1995)Google Scholar
- Karakoulas, G., Shawe-Taylor, J.: Optimizing classifiers for imbalanced training sets. In: Proceedings of the 11th International Conference on Neural Information Processing Systems, NIPS 1998, pp. 253–259. MIT Press, Cambridge (1998)Google Scholar
- McCandless, M., Hatcher, E., Gospodnetic, O.: Lucene in Action, Second Edition: Covers Apache Lucene 3.0. Manning Publications Co., Greenwich (2010)Google Scholar
- Mitchell, T.M.: Machine Learning, vol. 45, no. 37, pp. 870–877. McGraw Hill, Burr Ridge (1997)Google Scholar
- Porter, M.F.: An algorithm for suffix stripping. Rossiiskaya Akademiya Nauk. Programmirovanie 14(3), 130–137 (1980)Google Scholar
- Rish, I.: An empirical study of the naive Bayes classifier. In: IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence, pp. 41–46. IBM, New York (2001)Google Scholar
- Salton, G., Mcgill, M.J.: Introduction to Modern Information Retrieval. McGraw-Hill, New York (1986)Google Scholar
- Witten, I.H., et al.: Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann, Burlington (2016)Google Scholar