Abstract
Mining geo-tagged social photo media has received large amounts of attention from researchers recently. Points of interest (POI) mining from a collection of geo-tagged photos is one of these problems. POI mining refers to the processes of pattern recognition (namely clustering), extraction and semantic annotation. However, based on unsupervised clustering methods, many POIs might not be mined. Additionally, there is a great challenge for the proper semantic annotation to data clusters after clustering. In practice, there are many applications which require the accuracy of semantic annotation and high quality of pattern recognition such as POI recommendation. In this paper, we study POI mining from a collection of geo-tagged photos in combination with proper semantic annotation by using additional POI information from high coverage external POI databases. We propose a novel POI mining framework by using two-level clustering, random walk and constrained clustering. In random walk clustering step, we separate a large-scale collection of geo-tagged photos into many clusters. In the constrained clustering step, we continue to divide the clusters that include many POIs into many sub-clusters, where the geo-tagged photos in a sub-cluster associate with a particular POI. Experimental results on two datasets of geo-tagged Flickr photos of two cities in California, USA have shown that the proposed method substantially outperforms existing approaches that are adapted to handle the problem.
Similar content being viewed by others
References
Basu S, Bilenko M, Mooney RJ (2004) A probabilistic framework for semi-supervised clustering. In: KDD
Bermingham L, Lee I (2014) Spatio-temporal sequential pattern mining for tourism sciences. Procedia Comput Sci 29:379–389
Chen WC, Battestini A, Gelfand N, Setlur V (2009). Visual summaries of popular landmarks from community photo collections. In: 2009 Conference Record of the Forty-Third Asilomar Conference on Signals, Systems and Computers (p 1248–1255). IEEE
Comaniciu D, Meer P (2002) Mean shift: a robust approach toward feature space analysis. IEEE Trans Pattern Anal Mach Intell 24(5):603–619
Crandall DJ, Backstrom L, Huttenlocher D, Kleinberg J (2009) Mapping the world’s photos. In: Proceedings of the 18th international conference on World Wide Web (p 761–770). ACM
Ester M, Kriegel HP, Sander J, Xu X (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: KDD (Vol. 96, No. 34, p 226–231)
Fuchs G, Stange H, Hecker D, Andrienko N, Andrienko G (2015) Constructing semantic interpretation of routine and anomalous mobility behaviors from big data. SIGSPATIAL Spec 7(1):27–34
Golub GH, Van Loan CF (1989) Matrix Computations. Johns Hopkins University Press
Han J, Lee H (2015) Adaptive landmark recommendations for travel planning: Personalizing and clustering landmarks using geo-tagged social media. Pervasive Mob Comput 18:4–17
Harel D, Koren Y (2001) On clustering using random walks. In: FST TCS 2001: Foundations of Software Technology and Theoretical Computer Science (p 18–41). Springer Berlin Heidelberg
Hu Y, Gao S, Janowicz K, Yu B, Li W, Prasad S (2015) Extracting and understanding urban areas of interest using geotagged photos. Comput Environ Urban Syst 54:240–254
Ji R, Gao Y, Zhong B, Yao H, Tian Q (2011) Mining flickr landmarks by modeling reconstruction sparsity. ACM Trans Multimed Comput Commun Appl (TOMM) 7(1):31
Jiang S, Qian X, Mei T, Fu Y (2016) Personalized travel sequence recommendation on multi-source big social media. IEEE Trans Big Data 2(1):43–56
Jiang S, Qian X, Shen J, Fu Y, Mei T (2015) Author topic model-based collaborative filtering for personalized POI recommendations. IEEE Trans Multimedia 17(6):907–918
Kennedy LS, Naaman M (2008) Generating diverse and representative image search results for landmarks. In: Proceedings of the 17th international conference on World Wide Web (p 297-306). ACM
Kennedy L, Naaman M, Ahern S, Nair R, Rattenbury T (2007) How flickr helps us make sense of the world: context and content in community-contributed media collections. In: Proceedings of the 15th international conference on Multimedia (p 631–640). ACM
Kisilevich S, Mansmann F, Keim D (2010) P-DBSCAN: a density based clustering algorithm for exploration and analysis of attractive areas using collections of geo-tagged photos. In: Proceedings of the 1st international conference and exhibition on computing for geospatial research & application (p 38). ACM
Kou NM, Yang Y, Gong Z (2015) Travel topic analysis: a mutually reinforcing method for geo-tagged photos. GeoInformatica 19(4):693–721
Kruger R, Thom D, Ertl T (2014) Semantic enrichment of movement behavior with foursquare-a visual analytics approach. IEEE Trans Vis Comput Graph 21(8):903–915
Kunze C, Hecht R (2015) Semantic enrichment of building data with volunteered geographic information to improve mappings of dwelling units and population. Comput Environ Urban Syst 53:4–18
Lacerda YA, Feitosa RGF, Esmeraldo GÁRM, Baptista CDS, Marinho LB (2012) Compass clustering: a new clustering method for detection of points of interest using personal collections of georeferenced and oriented photographs. In: Proceedings of the 18th Brazilian symposium on Multimedia and the Web (p 281–288). ACM
Laptev D, Tikhonov A, Serdyukov P, Gusev G (2014) Parameter-free discovery and recommendation of areas-of-interest. In: Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (p 113–122). ACM
Lee I, Cai G, Lee K (2014) Exploration of geo-tagged photos through data mining approaches. Expert Syst Appl 41(2):397–405
Lee DT, Schachter BJ (1980) Two algorithms for constructing a Delaunay triangulation. Int J Comput Inform Sci 9(3):219–242
Li Z, Liu J, Tang X (2008) Pairwise constraint propagation by semi-definite programming for semi-supervised classification. In: ICML 2008, p 576–583
Lim KH (2015) Recommending tours and places-of-interest based on user interests from geo-tagged photos. In: Proceedings of the 2015 ACM SIGMOD on PhD Symposium (p 33–38). ACM
Majid A, Chen L, Chen G, Mirza HT, Hussain I, Woodward J (2013) A context-aware personalized travel recommendation system based on geotagged social media data mining. Int J Geogr Inf Sci 27(4):662–684
Majid A, Chen L, Mirza HT, Hussain I, Chen G (2015) A system for mining interesting tourist locations and travel sequences from public geo-tagged photos. Data Knowl Eng 95:66–86
Maraziotis IA (2012) A semi-supervised fuzzy clustering algorithm applied to gene expression data. Pattern Recogn 45(1):637–648
Memon I, Chen L, Majid A, Lv M, Hussain I, Chen G (2015) Travel recommendation using geo-tagged photos in social media for tourist. Wirel Pers Commun 80(4):1347–1362
Meyer F (1994) Topographic distance and watershed lines. Signal Process 38(1):113–125
O’Hare N, Murdock V (2013) Modeling locations with social media. Inf Retr 16(1):30–62
Popescu A, Grefenstette G (2009) Deducing trip related information from flickr. In: Proceedings of the 18th international conference on World Wide Web (p 1183–1184). ACM
Popescu A, Grefenstette G, Moëllic P-A (2008) Gazetiki: automatic construction of a geographical gazetteer. In: Proc. of JCDL 2008 (Pittsburgh, PA)
Rattenbury T, Good N, Naaman M (2007) Towards automatic extraction of event and place semantics from flickr tags. In: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval (p 103–110). ACM
Rattenbury T, Good N, Naaman M (2007) Towards extracting flickr tag semantics. In: Proceedings of the 16th international conference on World Wide Web (p 1287–1288). ACM
Sedgewick R, Wayne K (2011) Algorithms (4th ed). Addison-Wesley
Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans Pattern Anal Mach Intell 22(8):888–905
Spyrou E, Mylonas P (2016) Analyzing Flickr metadata to extract location-based information and semantically organize its photo content. Neurocomputing 172:114–133
Tang W, Xiong H, Zhong S, Wu J (2007) Enhancing semi-supervised clustering: a feature projection perspective. In: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining (p 707–716). ACM
Thomee B, Arapakis I, Shamma DA (2016) Finding social points of interest from georeferenced and oriented online photographs. ACM Trans Multimed Comput Commun Appl (TOMM) 12(2):36
Tipping M, Schölkopf B, Jaakkola T, Richardson T (2001) A kernel approach for vector quantization with guaranteed distortion bounds. In: Artificial intelligence and statistics (p 129–129)
Vu HQ, Li G, Law R, Ye BH (2015) Exploring the travel behaviors of inbound tourists to Hong Kong using geotagged photos. Tour Manag 46:222–232
Wagstaff K, Cardie C, Rogers S, Schrödl S (2001) Constrained k-means clustering with background knowledge. In: ICML (Vol. 1, p 577–584)
Wen YT, Lei PR, Peng WC, Zhou XF (2014) Exploring social influence on location-based social networks. In: 2014 I.E. International Conference on Data Mining (p 1043–1048). IEEE
Weyand T, Leibe B (2015) Visual landmark recognition from Internet photo collections: A large-scale evaluation. Comput Vis Image Underst 135:1–15
Yan C, Zhang Y, Dai F, Wang X, Li L, Dai Q (2014) Parallel deblocking filter for HEVC on many-core processor. Electron Lett 50(5):367–368
Yan C, Zhang Y, Dai F, Zhang J, Li L, Dai Q (2014) Efficient parallel HEVC intra-prediction on many-core processor. Electron Lett 50(11):805–806
Yan C, Zhang Y, Dai F, Li L (2013) Highly parallel framework for HEVC motion estimation on many-core platform. In: Data Compression Conference (DCC), 2013 (p 63–72). IEEE
Yan C, Zhang Y, Xu J, Dai F, Li L, Dai Q, Wu F (2014) A highly parallel framework for HEVC coding unit partitioning tree decision on many-core processors. IEEE Signal Process Lett 21(5):573–576
Yan C, Zhang Y, Xu J, Dai F, Zhang J, Dai Q, Wu F (2014) Efficient parallel framework for HEVC motion estimation on many-core processors. IEEE Trans Circuits Syst Video Technol 24(12):2077–2089
Yang Y, Gong Z, U LH (2011) Identifying points of interest by self-tuning clustering. In: Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval (p 883–892). ACM
Yin Y, Shen Z, Zhang L, Zimmermann R (2015) Spatial-temporal tag mining for automatic geospatial video annotation. ACM Trans Multimed Comput Commun Appl (TOMM) 11(2):29
Zhang J, Wang S, Huang Q (2015) Location-based parallel tag completion for geo-tagged social image retrieval. In: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval (p 355–362). ACM
Zheng YT, Zha ZJ, Chua TS (2012) Mining travel patterns from geotagged photos. ACM Trans Intell Syst Technol (TIST) 3(3):56
Zhou X, Xu C, Kimmons B (2015) Detecting tourism destinations using scalable geospatial analysis based on cloud computing platform. Comput Environ Urban Syst 54:144–153
Acknowledgments
This work was supported by Institute for Information & Communications Technology Promotion (IITP) grant funded by the Korea government (MSIP) (No. R0101-16-0054, WiseKB: Big data based self-evolving knowledge base and reasoning platform).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Bui, TH., Park, SB. Point of interest mining with proper semantic annotation. Multimed Tools Appl 76, 23435–23457 (2017). https://doi.org/10.1007/s11042-016-4114-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-016-4114-7