Advertisement

Data Mining and Knowledge Discovery

, Volume 33, Issue 1, pp 58–95 | Cite as

Modeling location-based social network data with area attraction and neighborhood competition

  • Thanh-Nam DoanEmail author
  • Ee-Peng Lim
Article
  • 205 Downloads

Abstract

Modeling user check-in behavior helps us gain useful insights about venues as well as the users visiting them. These insights are important in urban planning and recommender system applications. Since check-in behavior is the result of multiple factors, this paper focuses on studying two venue related factors, namely, area attraction and neighborhood competition. The former refers to the ability of a spatial area covering multiple venues to collectively attract check-ins from users, while the latter represents the extent to which a venue can compete with other venues in the same area for check-ins. We first embark on empirical studies to ascertain the two factors using three datasets gathered from users and venues of three major cities, Singapore, Jakarta and New York City. We then propose the visitation by area attractiveness and neighborhood competition (VAN) model incorporating area attraction and neighborhood competition factors. Our VAN model is also extended to incorporate social homophily so as to further enhance its modeling power. We evaluate VAN model using real world datasets against various state-of-the-art baselines. The results show that VAN model outperforms the baselines in check-in prediction task and its performance is robust under different parameter settings.

Keywords

Location-based social network Check-in prediction User behavior Area attraction Neighborhood competition Matrix factorization 

Notes

References

  1. Backstrom L, Sun E, Marlow C (2010) Find me if you can: improving geographical prediction with social and spatial proximity. In: The 19th international conference on World Wide Web (WWW). ACM, New York, pp 61–70Google Scholar
  2. Blei DM, Ng AY, Jordan MI (2003) Latent Dirichlet allocation. J Mach Learn Res 3(Jan):993–1022zbMATHGoogle Scholar
  3. Blei DM, Jordan MI (2006) Variational inference for Dirichlet process mixtures. Bayesian Anal 1(1):121–143MathSciNetCrossRefzbMATHGoogle Scholar
  4. Boyd S, Vandenberghe L (2004) Convex optimization. Cambridge University Press, New YorkCrossRefzbMATHGoogle Scholar
  5. Chang J, Sun E (2011) Location 3: how users share and respond to location-based data on social networking sites. In: 5th international AAAI conference on weblogs and social media (ICWSM), pp 74–80Google Scholar
  6. Cheng C, Yang H, King I, Lyu MR (2012) Fused matrix factorization with geographical and social influence in location-based social networks. In: The 26th AAAI conference on artificial intelligence (AAAI), vol 12, pp 17–23Google Scholar
  7. Cho E, Myers SA, Leskovec J (2011) Friendship and mobility: user movement in location-based social networks. In: Seventeenth ACM SIGKDD international conference on knowledge discovery and data mining (SIGKDD). ACM, New York, pp 1082–1090Google Scholar
  8. Church RL, Murray AT (2009) Business site selection, location analysis, and GIS. Wiley Online Library, New YorkGoogle Scholar
  9. De Nadai M, Staiano J, Larcher R, Sebe N, Quercia D, Lepri B (2016) The death and life of great Italian cities: a mobile phone data perspective. In: The 25th international conference on World Wide Web (WWW), pp 413–423Google Scholar
  10. Doan TN, Lim EP (2016) Attractiveness versus competition: towards an unified model for user visitation. In: The 25th ACM international on conference on information and knowledge management (CIKM). ACM, New York, pp 2149–2154Google Scholar
  11. Doan TN, Lim EP (2017) Modeling check-in behavior with geographical neighborhood influence of venues. In: The 13th international conference on advanced data mining and applications (ADMA)Google Scholar
  12. Doan TN, Chua FCT, Lim EP (2015a) Mining business competitiveness from user visitation data. In: Eighth international conference on social computing, behavioral-cultural modeling, and prediction (SBP). Springer, Berlin, pp 283–289Google Scholar
  13. Doan TN, Chua FCT, Lim EP (2015b) On neighborhood effects in location-based social networks. In: IEEE/WIC/ACM international conference on web intelligence and intelligent agent technology (WI-IAT), vol 1. IEEE, Washington, pp 477–484Google Scholar
  14. Friedman J, Hastie T, Tibshirani R (2001) The elements of statistical learning. Springer series in statistics, vol 1. Springer, New YorkzbMATHGoogle Scholar
  15. Fu Y, Xiong H, Ge Y, Zheng Y, Yao Z, Zhou ZH (2016) Modeling of geographic dependencies for real estate ranking. ACM Trans Knowl Discov Data (TKDD) 11(1):11Google Scholar
  16. Gao H, Liu H (2015) Mining human mobility in location-based social networks. Synth Lect Data Min Knowl Discov 7(2):1–115CrossRefGoogle Scholar
  17. Gao H, Tang J, Liu H (2012a) Exploring social-historical ties on location-based social networks. In: ICWSMGoogle Scholar
  18. Gao H, Tang J, Liu H (2012b) gscorr: modeling geo-social correlations for new check-ins on location-based social networks. In: The 21st ACM international conference on information and knowledge management (CIKM). ACM, New York, pp 1582–1586Google Scholar
  19. Georgiev P, Noulas A, Mascolo C (2014) Where businesses thrive: predicting the impact of the olympic games on local retailers through location-based services data. In: The eighth international AAAI conference on weblogs and social media (ICWSM), AAAIGoogle Scholar
  20. Hsu H, Lachenbruch PA (2008) Paired t test. Wiley Encyclopedia of Clinical Trials, New YorkGoogle Scholar
  21. Hu L, Sun A, Liu Y (2014) Your neighbors affect your ratings: on geographical neighborhood influence to rating prediction. In: The 37th international ACM SIGIR conference on research and development in information retrieval (SIGIR). ACM, New York, pp 345–354Google Scholar
  22. Huff DL (1963) A probabilistic analysis of shopping center trade areas. Land Econ 39(1):81–90CrossRefGoogle Scholar
  23. Isaacman S, Becker R, Cáceres R, Martonosi M, Rowland J, Varshavsky A, Willinger W (2012) Human mobility modeling at metropolitan scales. In: The 10th international conference on mobile systems, applications, and services (MobiSys). ACM, New York, pp 239–252Google Scholar
  24. Jordan MI et al (1995) Why the logistic function? A tutorial discussion on probabilities and neural networksGoogle Scholar
  25. Karamshuk D, Noulas A, Scellato S, Nicosia V, Mascolo C (2013) Geo-spotting: mining online location-based services for optimal retail store placement. In: The 19th ACM SIGKDD international conference on knowledge discovery and data mining (SIGKDD), pp 793–801Google Scholar
  26. Koren Y, Bell R, Volinsky C et al (2009) Matrix factorization techniques for recommender systems. Computer 42(8):30–37CrossRefGoogle Scholar
  27. Lee DD, Seung HS (2001) Algorithms for non-negative matrix factorization. In: 14th advances in neural information processing systems (NIPS), pp 556–562Google Scholar
  28. Li H, Ge Y, Hong R, Zhu H (2016) Point-of-interest recommendations: learning potential check-ins from friends. In: The 22nd ACM SIGKDD international conference on knowledge discovery and data mining (SIGKDD), pp 975–984Google Scholar
  29. Li R, Wang S, Deng H, Wang R, Chang KCC (2012) Towards social user profiling: unified and discriminative influence model for inferring home locations. In: The 18th ACM SIGKDD international conference on knowledge discovery and data mining (SIGKDD). ACM, New York, pp 1023–1031Google Scholar
  30. Liang D, Charlin L, McInerney J, Blei DM (2016) Modeling user exposure in recommendation. In: The 25th international conference on World Wide Web (WWW), pp 951–961Google Scholar
  31. Lin J, Oentaryo R, Lim EP, Vu C, Vu A, Kwee A (2016a) Where is the goldmine? Finding promising business locations through Facebook data analytics. In: The 27th ACM conference on hypertext and social media (HT). ACM, New York, pp 93–102Google Scholar
  32. Lin J, Oentaryo RJ, Lim EP, Vu C, Vu A, Kwee AT, Prasetyo PK (2016b) A business zone recommender system based on Facebook and urban planning data. In: European conference on information retrieval. Springer, Berlin, pp 641–647Google Scholar
  33. Liu B, Fu Y, Yao Z, Xiong H (2013) Learning geographical preferences for point-of-interest recommendation. In: The 19th ACM SIGKDD international conference on knowledge discovery and data mining (SIGKDD). ACM, New York, pp 1043–1051Google Scholar
  34. Liu Y, Wei W, Sun A, Miao C (2014) Exploiting geographical neighborhood characteristics for location recommendation. In: The 23rd ACM international conference on information and knowledge management (CIKM). ACM, New York, pp 739–748Google Scholar
  35. Ma H, Yang H, Lyu MR, King I (2008) Sorec: social recommendation using probabilistic matrix factorization. In: The 17th ACM conference on information and knowledge management (CIKM). ACM, New York, pp 931–940Google Scholar
  36. Ma H, Zhou D, Liu C, Lyu MR, King I (2011) Recommender systems with social regularization. In: Proceedings of the fourth ACM international conference on web search and data mining. ACM, New York, pp 287–296Google Scholar
  37. Mnih A, Salakhutdinov RR (2008) Probabilistic matrix factorization. In: The 21th advances in neural information processing systems (NIPS), pp 1257–1264Google Scholar
  38. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E (2011) Scikit-learn: machine learning in python. J Mach Learn Res 12:2825–2830MathSciNetzbMATHGoogle Scholar
  39. Qu Y, Zhang J (2013) Trade area analysis using user generated mobile location data. In: The 22nd international conference on World Wide Web (WWW). ACM, New York, pp 1053–1064Google Scholar
  40. Quan X, Wenyin L, Dou W, Xiong H, Ge Y (2012) Link graph analysis for business site selection. IEEE Comput 45(3):64–69CrossRefGoogle Scholar
  41. Quercia D, Saez D (2014) Mining urban deprivation from foursquare: implicit crowdsourcing of city land use. IEEE Pervasive Comput 13(2):30–36CrossRefGoogle Scholar
  42. Rendle S, Freudenthaler C, Gantner Z, Schmidt-Thieme L (2009) BPR: Bayesian personalized ranking from implicit feedback. In: The 25th conference on uncertainty in artificial intelligence (UAI), pp 452–461Google Scholar
  43. Schmidt MN, Winther O, Hansen LK (2009) Bayesian non-negative matrix factorization. In: The 8th independent component analysis and signal separation (ICA), vol 9, pp 540–547Google Scholar
  44. Smarzaro R, Lima TFdM, Davis Jr CA (2017a) Could data from location-based social networks be used to support urban planning? In: The 26th international conference on World Wide Web (WWW)Google Scholar
  45. Smarzaro R, de Lima TFM, Davis Jr CA (2017b) Quality of urban life index from location-based social networks data: a case study in Belo Horizonte, Brazil. In: Volunteered geographic information and the future of geospatial data. IGI Global, Hershey, pp 185–207Google Scholar
  46. Song C, Qu Z, Blumm N, Barabási AL (2010) Limits of predictability in human mobility. Science 327(5968):1018–1021MathSciNetCrossRefzbMATHGoogle Scholar
  47. Wang C, Blei DM (2011) Collaborative topic modeling for recommending scientific articles. In: The 17th ACM SIGKDD international conference on knowledge discovery and data mining (SIGKDD). ACM, New York, pp 448–456Google Scholar
  48. Weng L, Flammini A, Vespignani A, Menczer F (2012) Competition among memes in a world with limited attention. Sci Rep 2:335CrossRefGoogle Scholar
  49. Yan XY, Wang WX, Gao ZY, Lai YC (2017) Universal model of individual and population mobility on diverse spatial scales. Nat Commun 8(1):1639CrossRefGoogle Scholar
  50. Yu Z, Zhang D, Yang D (2013) Where is the largest market: ranking areas by popularity from location based social networks. In: Ubiquitous intelligence and computing, 2013 IEEE 10th international conference on and 10th international conference on autonomic and trusted computing (UIC/ATC), pp 157–162Google Scholar
  51. Yu Z, Tian M, Wang Z, Guo B, Mei T (2016) Shop-type recommendation leveraging the data from social media and location-based services. ACM Trans Knowl Discov Data (TKDD) 11(1):1CrossRefGoogle Scholar
  52. Yuan J, Zheng Y, Xie X (2012) Discovering regions of different functions in a city using human mobility and pois. In: The 18th ACM SIGKDD international conference on knowledge discovery and data mining (SIGKDD). ACM, New York, pp 186–194Google Scholar
  53. Zhao S, King I, Lyu MR, Zeng J, Yuan M (2017) Mining business opportunities from location-based social networks. In: The 40th international ACM SIGIR conference on research and development in information retrieval (SIGIR), pp 1037–1040Google Scholar
  54. Zhao T, McAuley J, King I (2014) Leveraging social connections to improve personalized ranking for collaborative filtering. In: Proceedings of the 23rd ACM international conference on conference on information and knowledge management, CIKM ’14Google Scholar

Copyright information

© The Author(s) 2018

Authors and Affiliations

  1. 1.School of Information SystemsSingapore Management UniversitySingaporeSingapore

Personalised recommendations