Abstract
Recommender systems have been widely used in contemporary industry and plenty of work has been done in this field to help users to identify items of interest. Collaborative Filtering (CF, for short) algorithm is an important technology in recommender systems. However, less work has been done in automobile recommendation system with the sharp increase of the amount of automobiles. Whats more, the computational speed is a major weakness for collaborative filtering technology. Therefore, using MapReduce framework to optimize the CF algorithm is a vital solution to this performance problem. In this paper, we present a recommendation of the users comment on industrial automobiles with various properties based on real world industrial datasets of user-automobile comment data collection, and provide recommendation for automobile providers and help them predict users comment on automobiles with new-coming property. Firstly, we solve the sparseness of matrix using previous construction of score matrix. Secondly, we solve the data normalization problem by removing dimensional effects from the raw data of automobiles, where different dimensions of automobile properties bring great error to the calculation of CF. Finally, we use the MapReduce framework to optimize the CF algorithm, and the computational speed has been improved times. UV decomposition used in this paper is an often used matrix factorization technology in CF algorithm, without calculating the interpolation weight of neighbors, which will be more convenient in industry.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Adomavicius, G., Tuzhilin, A.: Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions. IEEE Trans. Knowl. Data Eng. 17(6), 734–749 (2005)
Bell, R.M., Koren, Y.: Scalable collaborative filtering with jointly derived neighborhood interpolation weights. In: The Seventh IEEE International Conference on Data Mining (ICDM), vol. 7, pp. 43–52. IEEE (2007)
Bell, R.M., Koren, Y., Volinsky, C.: The bellkor 2008 solution to the netflix prize. Statistics Research Department at AT&T Research, 1 (2008)
Bottou, L.: Large-scale machine learning with stochastic gradient descent. In: Lechevallier, Y., Saporta, G. (eds.) Proceedings of COMPSTAT 2010, pp. 177–186. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-7908-2604-3_16
Chen, L.C., Kuo, P.J., Liao, I.E.: Ontology-based library recommender system using MapReduce. Cluster Comput. 18(1), 113–121 (2015)
Delimitrou, C., Kozyrakis, C.: Quasar: resource-efficient and QoS-aware cluster management. ACM SIGPLAN Not. 49(4), 127–144 (2014)
Fruchter, B., Fruchter, D.A.: Factor content of the wais with separate digits-forward and digits-backward scores for a borderline and mentally retarded sample. Monogr. Am. Assoc. Ment. Defic. (1), 67 (1973)
Gallivan, K.A., Plemmons, R.J., Sameh, A.H.: Parallel algorithms for dense linear algebra computations. SIAM Rev. 32(1), 54–135 (1990)
Geist, G.A., Romine, C.H.: Lu factorization algorithms on distributed-memory multiprocessor architectures. SIAM J. Sci. Stat. Comput. 9(4), 639–649 (1988)
Guilford, J.P.: Structure-of-intellect abilities in preliterate children and the mentally retarded. Monogr. Am. Assoc. Ment. Defic. 1, 46–58 (1973)
Ishii, K., et al.: A diagnostic method for suspected alzheimer’s disease using h 2 15 O positron emission tomography perfusion Z score. Neuroradiology 42(11), 787–794 (2000)
Kiwiel, K.C.: Convergence and efficiency of subgradient methods for quasiconvex minimization. Math. Program. 90(1), 1–25 (2001)
Koren, Y., Bell, R., Volinsky, C.: Matrix factorization techniques for recommender systems. Computer 42(8), 30–37 (2009)
Kumar, V., Grama, A., Anshul, G., Karypis, G.: Introduction to parallel computing: design and analysis of algorithms. Benjamin/Cummings Publishing Company 18, 82–109 (1994)
Marlin, B.: Collaborative filtering: a machine learning perspective. University of Toronto (2004)
O’Leary, D.P., Stewart, G.W.: Assignment and scheduling in parallel matrix factorization. Linear Algebra Appl. 77, 275–299 (1986)
Pradel B., et al.: A case study in a recommender system based on purchase data. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 377–385. ACM (2011)
Rajaraman, A., Ullman, J.: Textbook on mining of massive datasets (2011)
Rodriguez, M.A., Egenhofer, M.J.: Determining semantic similarity among entity classes from different ontologies. IEEE Trans. Knowl. Data Eng. 15(2), 442–456 (2003)
Su, X., Taghi, M.K.: A survey of collaborative filtering techniques. Adv. Artif. Intell. 2009, 4 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Nie, Z., Sun, J., He, Z. (2019). A Case Study for User Rating Prediction on Automobile Recommendation System Using MapReduce. In: Hu, C., Yang, W., Jiang, C., Dai, D. (eds) High-Performance Computing Applications in Numerical Simulation and Edge Computing. HPCMS HiDEC 2018 2018. Communications in Computer and Information Science, vol 913. Springer, Singapore. https://doi.org/10.1007/978-981-32-9987-0_6
Download citation
DOI: https://doi.org/10.1007/978-981-32-9987-0_6
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-32-9986-3
Online ISBN: 978-981-32-9987-0
eBook Packages: Computer ScienceComputer Science (R0)