Skip to main content
Log in

Point of interest mining with proper semantic annotation

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Mining geo-tagged social photo media has received large amounts of attention from researchers recently. Points of interest (POI) mining from a collection of geo-tagged photos is one of these problems. POI mining refers to the processes of pattern recognition (namely clustering), extraction and semantic annotation. However, based on unsupervised clustering methods, many POIs might not be mined. Additionally, there is a great challenge for the proper semantic annotation to data clusters after clustering. In practice, there are many applications which require the accuracy of semantic annotation and high quality of pattern recognition such as POI recommendation. In this paper, we study POI mining from a collection of geo-tagged photos in combination with proper semantic annotation by using additional POI information from high coverage external POI databases. We propose a novel POI mining framework by using two-level clustering, random walk and constrained clustering. In random walk clustering step, we separate a large-scale collection of geo-tagged photos into many clusters. In the constrained clustering step, we continue to divide the clusters that include many POIs into many sub-clusters, where the geo-tagged photos in a sub-cluster associate with a particular POI. Experimental results on two datasets of geo-tagged Flickr photos of two cities in California, USA have shown that the proposed method substantially outperforms existing approaches that are adapted to handle the problem.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Notes

  1. http://www.flickr.com/services/api/

  2. http://www.openstreetmap.org/

References

  1. Basu S, Bilenko M, Mooney RJ (2004) A probabilistic framework for semi-supervised clustering. In: KDD

  2. Bermingham L, Lee I (2014) Spatio-temporal sequential pattern mining for tourism sciences. Procedia Comput Sci 29:379–389

    Article  Google Scholar 

  3. Chen WC, Battestini A, Gelfand N, Setlur V (2009). Visual summaries of popular landmarks from community photo collections. In: 2009 Conference Record of the Forty-Third Asilomar Conference on Signals, Systems and Computers (p 1248–1255). IEEE

  4. Comaniciu D, Meer P (2002) Mean shift: a robust approach toward feature space analysis. IEEE Trans Pattern Anal Mach Intell 24(5):603–619

    Article  Google Scholar 

  5. Crandall DJ, Backstrom L, Huttenlocher D, Kleinberg J (2009) Mapping the world’s photos. In: Proceedings of the 18th international conference on World Wide Web (p 761–770). ACM

  6. Ester M, Kriegel HP, Sander J, Xu X (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: KDD (Vol. 96, No. 34, p 226–231)

  7. Fuchs G, Stange H, Hecker D, Andrienko N, Andrienko G (2015) Constructing semantic interpretation of routine and anomalous mobility behaviors from big data. SIGSPATIAL Spec 7(1):27–34

    Article  Google Scholar 

  8. Golub GH, Van Loan CF (1989) Matrix Computations. Johns Hopkins University Press

  9. Han J, Lee H (2015) Adaptive landmark recommendations for travel planning: Personalizing and clustering landmarks using geo-tagged social media. Pervasive Mob Comput 18:4–17

    Article  Google Scholar 

  10. Harel D, Koren Y (2001) On clustering using random walks. In: FST TCS 2001: Foundations of Software Technology and Theoretical Computer Science (p 18–41). Springer Berlin Heidelberg

  11. Hu Y, Gao S, Janowicz K, Yu B, Li W, Prasad S (2015) Extracting and understanding urban areas of interest using geotagged photos. Comput Environ Urban Syst 54:240–254

    Article  Google Scholar 

  12. Ji R, Gao Y, Zhong B, Yao H, Tian Q (2011) Mining flickr landmarks by modeling reconstruction sparsity. ACM Trans Multimed Comput Commun Appl (TOMM) 7(1):31

    Google Scholar 

  13. Jiang S, Qian X, Mei T, Fu Y (2016) Personalized travel sequence recommendation on multi-source big social media. IEEE Trans Big Data 2(1):43–56

    Article  Google Scholar 

  14. Jiang S, Qian X, Shen J, Fu Y, Mei T (2015) Author topic model-based collaborative filtering for personalized POI recommendations. IEEE Trans Multimedia 17(6):907–918

    Google Scholar 

  15. Kennedy LS, Naaman M (2008) Generating diverse and representative image search results for landmarks. In: Proceedings of the 17th international conference on World Wide Web (p 297-306). ACM

  16. Kennedy L, Naaman M, Ahern S, Nair R, Rattenbury T (2007) How flickr helps us make sense of the world: context and content in community-contributed media collections. In: Proceedings of the 15th international conference on Multimedia (p 631–640). ACM

  17. Kisilevich S, Mansmann F, Keim D (2010) P-DBSCAN: a density based clustering algorithm for exploration and analysis of attractive areas using collections of geo-tagged photos. In: Proceedings of the 1st international conference and exhibition on computing for geospatial research & application (p 38). ACM

  18. Kou NM, Yang Y, Gong Z (2015) Travel topic analysis: a mutually reinforcing method for geo-tagged photos. GeoInformatica 19(4):693–721

    Article  Google Scholar 

  19. Kruger R, Thom D, Ertl T (2014) Semantic enrichment of movement behavior with foursquare-a visual analytics approach. IEEE Trans Vis Comput Graph 21(8):903–915

    Article  Google Scholar 

  20. Kunze C, Hecht R (2015) Semantic enrichment of building data with volunteered geographic information to improve mappings of dwelling units and population. Comput Environ Urban Syst 53:4–18

    Article  Google Scholar 

  21. Lacerda YA, Feitosa RGF, Esmeraldo GÁRM, Baptista CDS, Marinho LB (2012) Compass clustering: a new clustering method for detection of points of interest using personal collections of georeferenced and oriented photographs. In: Proceedings of the 18th Brazilian symposium on Multimedia and the Web (p 281–288). ACM

  22. Laptev D, Tikhonov A, Serdyukov P, Gusev G (2014) Parameter-free discovery and recommendation of areas-of-interest. In: Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (p 113–122). ACM

  23. Lee I, Cai G, Lee K (2014) Exploration of geo-tagged photos through data mining approaches. Expert Syst Appl 41(2):397–405

    Article  Google Scholar 

  24. Lee DT, Schachter BJ (1980) Two algorithms for constructing a Delaunay triangulation. Int J Comput Inform Sci 9(3):219–242

    Article  MATH  MathSciNet  Google Scholar 

  25. Li Z, Liu J, Tang X (2008) Pairwise constraint propagation by semi-definite programming for semi-supervised classification. In: ICML 2008, p 576–583

  26. Lim KH (2015) Recommending tours and places-of-interest based on user interests from geo-tagged photos. In: Proceedings of the 2015 ACM SIGMOD on PhD Symposium (p 33–38). ACM

  27. Majid A, Chen L, Chen G, Mirza HT, Hussain I, Woodward J (2013) A context-aware personalized travel recommendation system based on geotagged social media data mining. Int J Geogr Inf Sci 27(4):662–684

    Article  Google Scholar 

  28. Majid A, Chen L, Mirza HT, Hussain I, Chen G (2015) A system for mining interesting tourist locations and travel sequences from public geo-tagged photos. Data Knowl Eng 95:66–86

    Article  Google Scholar 

  29. Maraziotis IA (2012) A semi-supervised fuzzy clustering algorithm applied to gene expression data. Pattern Recogn 45(1):637–648

    Article  MATH  Google Scholar 

  30. Memon I, Chen L, Majid A, Lv M, Hussain I, Chen G (2015) Travel recommendation using geo-tagged photos in social media for tourist. Wirel Pers Commun 80(4):1347–1362

    Article  Google Scholar 

  31. Meyer F (1994) Topographic distance and watershed lines. Signal Process 38(1):113–125

    Article  MATH  Google Scholar 

  32. O’Hare N, Murdock V (2013) Modeling locations with social media. Inf Retr 16(1):30–62

    Article  Google Scholar 

  33. Popescu A, Grefenstette G (2009) Deducing trip related information from flickr. In: Proceedings of the 18th international conference on World Wide Web (p 1183–1184). ACM

  34. Popescu A, Grefenstette G, Moëllic P-A (2008) Gazetiki: automatic construction of a geographical gazetteer. In: Proc. of JCDL 2008 (Pittsburgh, PA)

  35. Rattenbury T, Good N, Naaman M (2007) Towards automatic extraction of event and place semantics from flickr tags. In: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval (p 103–110). ACM

  36. Rattenbury T, Good N, Naaman M (2007) Towards extracting flickr tag semantics. In: Proceedings of the 16th international conference on World Wide Web (p 1287–1288). ACM

  37. Sedgewick R, Wayne K (2011) Algorithms (4th ed). Addison-Wesley

  38. Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans Pattern Anal Mach Intell 22(8):888–905

    Article  Google Scholar 

  39. Spyrou E, Mylonas P (2016) Analyzing Flickr metadata to extract location-based information and semantically organize its photo content. Neurocomputing 172:114–133

    Article  Google Scholar 

  40. Tang W, Xiong H, Zhong S, Wu J (2007) Enhancing semi-supervised clustering: a feature projection perspective. In: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining (p 707–716). ACM

  41. Thomee B, Arapakis I, Shamma DA (2016) Finding social points of interest from georeferenced and oriented online photographs. ACM Trans Multimed Comput Commun Appl (TOMM) 12(2):36

    Google Scholar 

  42. Tipping M, Schölkopf B, Jaakkola T, Richardson T (2001) A kernel approach for vector quantization with guaranteed distortion bounds. In: Artificial intelligence and statistics (p 129–129)

  43. Vu HQ, Li G, Law R, Ye BH (2015) Exploring the travel behaviors of inbound tourists to Hong Kong using geotagged photos. Tour Manag 46:222–232

    Article  Google Scholar 

  44. Wagstaff K, Cardie C, Rogers S, Schrödl S (2001) Constrained k-means clustering with background knowledge. In: ICML (Vol. 1, p 577–584)

  45. Wen YT, Lei PR, Peng WC, Zhou XF (2014) Exploring social influence on location-based social networks. In: 2014 I.E. International Conference on Data Mining (p 1043–1048). IEEE

  46. Weyand T, Leibe B (2015) Visual landmark recognition from Internet photo collections: A large-scale evaluation. Comput Vis Image Underst 135:1–15

    Article  Google Scholar 

  47. Yan C, Zhang Y, Dai F, Wang X, Li L, Dai Q (2014) Parallel deblocking filter for HEVC on many-core processor. Electron Lett 50(5):367–368

    Article  Google Scholar 

  48. Yan C, Zhang Y, Dai F, Zhang J, Li L, Dai Q (2014) Efficient parallel HEVC intra-prediction on many-core processor. Electron Lett 50(11):805–806

    Article  Google Scholar 

  49. Yan C, Zhang Y, Dai F, Li L (2013) Highly parallel framework for HEVC motion estimation on many-core platform. In: Data Compression Conference (DCC), 2013 (p 63–72). IEEE

  50. Yan C, Zhang Y, Xu J, Dai F, Li L, Dai Q, Wu F (2014) A highly parallel framework for HEVC coding unit partitioning tree decision on many-core processors. IEEE Signal Process Lett 21(5):573–576

    Article  Google Scholar 

  51. Yan C, Zhang Y, Xu J, Dai F, Zhang J, Dai Q, Wu F (2014) Efficient parallel framework for HEVC motion estimation on many-core processors. IEEE Trans Circuits Syst Video Technol 24(12):2077–2089

    Article  Google Scholar 

  52. Yang Y, Gong Z, U LH (2011) Identifying points of interest by self-tuning clustering. In: Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval (p 883–892). ACM

  53. Yin Y, Shen Z, Zhang L, Zimmermann R (2015) Spatial-temporal tag mining for automatic geospatial video annotation. ACM Trans Multimed Comput Commun Appl (TOMM) 11(2):29

    Google Scholar 

  54. Zhang J, Wang S, Huang Q (2015) Location-based parallel tag completion for geo-tagged social image retrieval. In: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval (p 355–362). ACM

  55. Zheng YT, Zha ZJ, Chua TS (2012) Mining travel patterns from geotagged photos. ACM Trans Intell Syst Technol (TIST) 3(3):56

    Google Scholar 

  56. Zhou X, Xu C, Kimmons B (2015) Detecting tourism destinations using scalable geospatial analysis based on cloud computing platform. Comput Environ Urban Syst 54:144–153

    Article  Google Scholar 

Download references

Acknowledgments

This work was supported by Institute for Information & Communications Technology Promotion (IITP) grant funded by the Korea government (MSIP) (No. R0101-16-0054, WiseKB: Big data based self-evolving knowledge base and reasoning platform).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Seong-Bae Park.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bui, TH., Park, SB. Point of interest mining with proper semantic annotation. Multimed Tools Appl 76, 23435–23457 (2017). https://doi.org/10.1007/s11042-016-4114-7

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-016-4114-7

Keywords

Navigation