Abstract
With the increase of mobile apps, i.e. applications, it is more and more difficult for users to discover their desired apps. Similar app recommendation, which plays a critical role in the app discovering process, is of our main concern in this paper. Intuitively, name is an important feature to distinguish apps. So app names are often used to learn the app similarity. However, existing studies do not perform well because names are usually very short. In this paper, we explore the phenomenon of the ill performance, and dive into the underlying reason, which motivates us to leverage additional corpus to bridge the gap between similar words. Specifically, we learn app representation from names and other related corpus, and formalize it as a collective matrix factorization problem. Moreover, we propose to utilize alternating direction method of multipliers to solve this collective matrix factorization problem. Experimental results on real-world data sets indicate that our proposed approach outperforms state-of-the-art methods on similar app recommendation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Core code is available at https://github.com/bnn2010/iscas2016_AppSimilarity.
References
Kajanan, S., Pervin, N., Datta, A., Dutta, K.: Mobilewalla: a mobile application search engine. IEEE Trans. Mob. Comput. (2011)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. JMLR 3, 993–1022 (2003)
Chen, N., Hoi, S.C., Li, S., Xiao, X.: Simapp: a framework for detecting similarmobile applications by online kernel learning. In: WSDM 2015, pp. 305–314 (2015)
Leung, K.W.T., Ng, W., Di Jiang, K., Vosecky, J.: Panorama: a semantic-aware application search framework. In: Extending Database Technology (2013)
Fellbaum, C.: WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)
Yang, C.S., Salton, G., Wong, A.: A vector space model for automatic indexing. Commun. ACM 18(11), 613–620 (1975)
Harris, Z.: Distributional structure. Word 10(23), 146–162 (1954)
Shawe-Taylor, J., Cristianini, N., Watkins, C., Lodhi, H., Saunders, C.: Text classification using string kernels. JMLR 2, 419–444 (2002)
Manning, C.D., Pennington, J., Socher, R.: Glove: Global vectors for word representation (2014)
Jones, R., Rey, B., Madani, O., Greiner, W.: Generating query substitutions. In: WWW 2006, pp. 387–396 (2006)
Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008)
Metzler, D., Dumais, S.T., Meek, C.: Similarity measures for short segments of text. In: Amati, G., Carpineto, C., Romano, G. (eds.) ECiR 2007. LNCS, vol. 4425, pp. 16–27. Springer, Heidelberg (2007)
Parikh, N., Boyd, S.: Proximal algorithms. Found. Trends Optim. 1(3), 127–239 (2014)
Park, D.H., Liu, M., Zhai, C., Wang, H.: Leveraging user reviews to improve accuracy for mobile app retrieval. In: SIGIR 2015, pp. 533–542 (2015)
Peter, P.P., Turney, D.: From frequency to meaning: vector space models of semantics. J. Artif. Intell. Res. 37(1), 141–188 (2010)
Mikolov, T., Le, Q.: Distributed representations of sentences and documents (2014)
Rajaraman, A., Ullman, J.D., Leskovec, J.: Mining of Massive Datasets, vol. 1. Cambridge University Press, Cambridge (2012)
Sahami, M., Heilman, T.D.: A web-based kernel function for measuring the similarity of short text snippets. In: WWW 2006, pp. 377–386 (2006)
Furnas, G.W., Landauer, T.K., Harshman, R., Scott, D., Dumais, S.T.: Indexing by latent semantic analysis. J. Am. Soc. Inf. Sci. 41(6), 391–407 (1990)
Singh, A.P., Gordon, G.J.: Relational learning via collective matrix factorization. In: KDD 2008, pp. 650–658 (2008)
Chen, K., Corrado, G.S., Dean, J., Mikolov, T., Sutskever, I.: Distributed representations of words and phrases and their compositionality. In: NIPS (2013)
Von Neumann, J.: Functional Operators: The Geometry of Orthogonal Spaces. Princeton University Press, Princeton (1950)
Yih, W.-T., Meek, C.: Improving similarity measures for short segments of text. In: AAAI, vol. 7, pp. 1489–1494 (2007)
Zhang, Y.: An alternating direction algorithm for nonnegative matrix factorization (2010, preprint)
Zhu, H., Chen, E., Xiong, H., Cao, H., Tian, J.: Mobile app classification with enriched contextual information. IEEE Trans. Mob. Comput. 13(7), 1550–1563 (2014)
Zhu, Y., Chen, Y., Lu, Z., Pan, S.J., Xue, G.R., Yu, Y., Yang, Q.: Heterogeneous transfer learning for image classification. In: AAAI (2011)
Zhuo, J., Huang, Z., Liu, Y., Kang, Z., Cao, X., Li, M., Jin, L.: Semantic matchingin app search. In: WSDM 2015, pp. 209–210 (2015)
Acknowledgments
This paper is supported in part by the National High-tech R&D Program of China (No. 2012AA010902), and NSFC (No. 61303059).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Bu, N., Niu, S., Yu, L., Ma, W., Long, G. (2016). Bridging Semantic Gap Between App Names: Collective Matrix Factorization for Similar Mobile App Recommendation. In: Cellary, W., Mokbel, M., Wang, J., Wang, H., Zhou, R., Zhang, Y. (eds) Web Information Systems Engineering – WISE 2016. WISE 2016. Lecture Notes in Computer Science(), vol 10042. Springer, Cham. https://doi.org/10.1007/978-3-319-48743-4_26
Download citation
DOI: https://doi.org/10.1007/978-3-319-48743-4_26
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48742-7
Online ISBN: 978-3-319-48743-4
eBook Packages: Computer ScienceComputer Science (R0)