Skip to main content

A Comparative Study of Different Similarity Metrics in Highly Sparse Rating Dataset

  • Conference paper
  • First Online:
Data Management, Analytics and Innovation

Abstract

Recommender system has been popularly used for recommending products and services to the online buyers and users. Collaborative Filtering (CF) is one of the most popular filtering approaches used to find the preferences of users for the recommendation. CF works on the ratings given by the users for a particular item. It predicts the rating that is not explicitly given for any item and build the recommendation list for a particular user. Different similarity metrics and prediction approaches are used for this purpose. But these metrics and approaches have some issues in dealing with highly sparse datasets. In this paper, we sought to find the most accurate combinations of similarity metrics and prediction approaches for both user and item similarity based CF. In this comparative study, we deliberately instill sparsity of different magnitudes (10, 20, 30 and 40%) by deleting given ratings in an existing dataset. We then predict the deleted ratings using different combinations of similarity metrics and prediction approach. We assessed the accuracy of the prediction with the help of two evaluation metrics (MAE and RMSE).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. MovieLens|GroupLens, https://grouplens.org/datasets/movielens/. Last retrieved July 16, 2016.

  2. Top 10 movie recommendation engines—CNET, https://www.cnet.com/news/top-10-movie-recommendation-engines/. Last Retrieved July 7, 2017.

  3. García-Cumbreras, M. Á., Montejo-Ráez, A., & Díaz-Galiano, M. C. (2013). Pessimists and optimists: Improving collaborative filtering through sentiment analysis. Expert Systems with Applications: An International Journal, 40(17), 6758–6765.

    Article  Google Scholar 

  4. Boratto, L., & Salvatore, C. (2014). Using collaborative filtering to overcome the curse of dimensionality when clustering users in a group recommender system. In 16th International Conference on Enterprise Information Systems.

    Google Scholar 

  5. Said, A., Fields, B., Jain, B. J., & Albayrak, S. (2013). User-centric evaluation of a K-furthest neighbor collaborative filtering recommender algorithm. In 16th ACM Conference on Computer Supported Cooperative Work and Social Computing.

    Google Scholar 

  6. Pirasteh, P., Jung, J. J., & Hwang, D. (2014). Item-based collaborative filtering with attribute correlation: A case study on movie recommendation. In Intelligent Information and Database Systems: 6th Asian Conference.

    Google Scholar 

  7. Jawaheer, G., Szomszor, M., & Kostkova, P. (2010). Comparison of implicit and explicit feedback from an online music recommendation service. 1st International Workshop on Information Heterogeneity and Fusion in Recommender Systems.

    Google Scholar 

  8. Sun, D., Luo, Z., & Zhang, F. (2011). A novel approach for collaborative filtering to alleviate the new item cold-start problem. In 11th International Symposium on Communications and Information Technologies.

    Google Scholar 

  9. Hu, R., & Lu, Y. (2006). A hybrid user and item-based collaborative filtering with smoothing on sparse data. ICAT Workshops.

    Google Scholar 

  10. Sarwar, B. M., Karypis, G., Konstan, J. A., & Riedl, J. T. (2000). Application of dimensionality reduction in recommender system—A case study. ACM WEBKDD Workshop.

    Google Scholar 

  11. Bokde, D. K., Girase, S., & Mukhopadhyay, D. (2015). An item-based collaborative filtering using dimensionality reduction techniques on mahout framework. CoRR.

    Google Scholar 

  12. Puntheeranurak, S., & Chaiwitooanukool, T. (2011). An item-based collaborative filtering method using Item-based hybrid similarity. In 2nd International Conference on Software Engineering and Service Science.

    Google Scholar 

  13. Fikir, O. B., Yaz, I. O., & Özyer, T. (2010). A movie rating prediction algorithm with collaborative filtering. In International Conference on Advances in Social Networks Analysis and Mining.

    Google Scholar 

  14. Sarwar, B., Karypis, G., Konstan, J., & Riedl, J. (2001). Item-based collaborative filtering recommendation algorithms. In 10th International Conference on World Wide Web.

    Google Scholar 

  15. Bilge, A., & Kaleli, C. (2014). A multi-criteria item-based collaborative filtering framework. In 11th International Joint Conference on Computer Science and Software Engineering.

    Google Scholar 

  16. Bobadilla, J., Hernando, A., Ortega, F., & Abraham, G. (2012). Collaborative filtering based on significances. Information Sciences, 185(1), 1–17.

    Article  Google Scholar 

  17. Xu, J., & Man, H. (2011). Dictionary learning based on Laplacian score in sparse coding. In Machine Learning and Data Mining in Pattern Recognition—7th International Conference.

    Google Scholar 

  18. Liu, H., Hu, Z., Mian, A. U., Tian, H., & Zhu, X. (2014). A new user similarity model to improve the accuracy of collaborative filtering. Knowledge-Based Systems, 56, 156–166.

    Article  Google Scholar 

  19. Wu, J., Chen, L., Feng, Z., Zhou, M., & Wu, Z. (2013). Predicting quality of service for selection by neighborhood based collaborative filtering. IEEE Transactions Systems, Man, and Cybernetics: Systems, 43(2), 428–439.

    Google Scholar 

  20. Herlocker, J. L., Konstan, J. A., Borchers, A., & Riedl, J. (1999). An algorithmic framework for performing collaborative filtering. In 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.

    Google Scholar 

  21. Herlocker, J. L., Konstan, J. A., & Riedl, J. (2002). An empirical analysis of design choices in neighborhood-based collaborative filtering algorithms. Information Retrieval, 5, 287–310.

    Article  Google Scholar 

  22. Yang, X., Guo, Y., Liu, Y., & Steck, H. (2014). A survey of collaborative filtering based social recommender systems. Computer Communications, 41, 1–10.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pradeep Kumar Singh .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Singh, P.K., Pramanik, P.K.D., Choudhury, P. (2019). A Comparative Study of Different Similarity Metrics in Highly Sparse Rating Dataset. In: Balas, V., Sharma, N., Chakrabarti, A. (eds) Data Management, Analytics and Innovation. Advances in Intelligent Systems and Computing, vol 839. Springer, Singapore. https://doi.org/10.1007/978-981-13-1274-8_4

Download citation

  • DOI: https://doi.org/10.1007/978-981-13-1274-8_4

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-13-1273-1

  • Online ISBN: 978-981-13-1274-8

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics