Abstract
Recommender system has been popularly used for recommending products and services to the online buyers and users. Collaborative Filtering (CF) is one of the most popular filtering approaches used to find the preferences of users for the recommendation. CF works on the ratings given by the users for a particular item. It predicts the rating that is not explicitly given for any item and build the recommendation list for a particular user. Different similarity metrics and prediction approaches are used for this purpose. But these metrics and approaches have some issues in dealing with highly sparse datasets. In this paper, we sought to find the most accurate combinations of similarity metrics and prediction approaches for both user and item similarity based CF. In this comparative study, we deliberately instill sparsity of different magnitudes (10, 20, 30 and 40%) by deleting given ratings in an existing dataset. We then predict the deleted ratings using different combinations of similarity metrics and prediction approach. We assessed the accuracy of the prediction with the help of two evaluation metrics (MAE and RMSE).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
MovieLens|GroupLens, https://grouplens.org/datasets/movielens/. Last retrieved July 16, 2016.
Top 10 movie recommendation engines—CNET, https://www.cnet.com/news/top-10-movie-recommendation-engines/. Last Retrieved July 7, 2017.
García-Cumbreras, M. Á., Montejo-Ráez, A., & Díaz-Galiano, M. C. (2013). Pessimists and optimists: Improving collaborative filtering through sentiment analysis. Expert Systems with Applications: An International Journal, 40(17), 6758–6765.
Boratto, L., & Salvatore, C. (2014). Using collaborative filtering to overcome the curse of dimensionality when clustering users in a group recommender system. In 16th International Conference on Enterprise Information Systems.
Said, A., Fields, B., Jain, B. J., & Albayrak, S. (2013). User-centric evaluation of a K-furthest neighbor collaborative filtering recommender algorithm. In 16th ACM Conference on Computer Supported Cooperative Work and Social Computing.
Pirasteh, P., Jung, J. J., & Hwang, D. (2014). Item-based collaborative filtering with attribute correlation: A case study on movie recommendation. In Intelligent Information and Database Systems: 6th Asian Conference.
Jawaheer, G., Szomszor, M., & Kostkova, P. (2010). Comparison of implicit and explicit feedback from an online music recommendation service. 1st International Workshop on Information Heterogeneity and Fusion in Recommender Systems.
Sun, D., Luo, Z., & Zhang, F. (2011). A novel approach for collaborative filtering to alleviate the new item cold-start problem. In 11th International Symposium on Communications and Information Technologies.
Hu, R., & Lu, Y. (2006). A hybrid user and item-based collaborative filtering with smoothing on sparse data. ICAT Workshops.
Sarwar, B. M., Karypis, G., Konstan, J. A., & Riedl, J. T. (2000). Application of dimensionality reduction in recommender system—A case study. ACM WEBKDD Workshop.
Bokde, D. K., Girase, S., & Mukhopadhyay, D. (2015). An item-based collaborative filtering using dimensionality reduction techniques on mahout framework. CoRR.
Puntheeranurak, S., & Chaiwitooanukool, T. (2011). An item-based collaborative filtering method using Item-based hybrid similarity. In 2nd International Conference on Software Engineering and Service Science.
Fikir, O. B., Yaz, I. O., & Özyer, T. (2010). A movie rating prediction algorithm with collaborative filtering. In International Conference on Advances in Social Networks Analysis and Mining.
Sarwar, B., Karypis, G., Konstan, J., & Riedl, J. (2001). Item-based collaborative filtering recommendation algorithms. In 10th International Conference on World Wide Web.
Bilge, A., & Kaleli, C. (2014). A multi-criteria item-based collaborative filtering framework. In 11th International Joint Conference on Computer Science and Software Engineering.
Bobadilla, J., Hernando, A., Ortega, F., & Abraham, G. (2012). Collaborative filtering based on significances. Information Sciences, 185(1), 1–17.
Xu, J., & Man, H. (2011). Dictionary learning based on Laplacian score in sparse coding. In Machine Learning and Data Mining in Pattern Recognition—7th International Conference.
Liu, H., Hu, Z., Mian, A. U., Tian, H., & Zhu, X. (2014). A new user similarity model to improve the accuracy of collaborative filtering. Knowledge-Based Systems, 56, 156–166.
Wu, J., Chen, L., Feng, Z., Zhou, M., & Wu, Z. (2013). Predicting quality of service for selection by neighborhood based collaborative filtering. IEEE Transactions Systems, Man, and Cybernetics: Systems, 43(2), 428–439.
Herlocker, J. L., Konstan, J. A., Borchers, A., & Riedl, J. (1999). An algorithmic framework for performing collaborative filtering. In 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.
Herlocker, J. L., Konstan, J. A., & Riedl, J. (2002). An empirical analysis of design choices in neighborhood-based collaborative filtering algorithms. Information Retrieval, 5, 287–310.
Yang, X., Guo, Y., Liu, Y., & Steck, H. (2014). A survey of collaborative filtering based social recommender systems. Computer Communications, 41, 1–10.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Singh, P.K., Pramanik, P.K.D., Choudhury, P. (2019). A Comparative Study of Different Similarity Metrics in Highly Sparse Rating Dataset. In: Balas, V., Sharma, N., Chakrabarti, A. (eds) Data Management, Analytics and Innovation. Advances in Intelligent Systems and Computing, vol 839. Springer, Singapore. https://doi.org/10.1007/978-981-13-1274-8_4
Download citation
DOI: https://doi.org/10.1007/978-981-13-1274-8_4
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1273-1
Online ISBN: 978-981-13-1274-8
eBook Packages: EngineeringEngineering (R0)