Abstract
In this paper, we propose a new image-based algorithm to identify where a tourist is when visiting unfamiliar places. When the tourist takes a photo of an unfamiliar place, our algorithm can recognize where the tourist is by retrieving similar images from an image database, where location information is associated with each image. Our method is not only fusing global and local information but using a coarse-to-fine three-stage search process. We first extract image descriptors from the image taken by the tourist and retrieve a number of most relevant images from the database. Then, we re-rank these relevant images based on geometric consistency. Finally, our method determines where the tourist is by using an image-to-class distance measure. Promising performance of the proposed algorithm is demonstrated by the experiments.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Boiman, O., Shechtman, E., Irani, M.: In defense of nearest-neighbor based image classification. In: IEEE Computer Society Conference Vision and Pattern Recognition (2008)
Bosch, A., Zisserman, A., Muñoz, X.: Scene classification via pLSA. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3954, pp. 517–530. Springer, Heidelberg (2006)
Bosch, A., Zisserman, A., Munoz, X.: Image classification using random forests and ferns. In: IEEE International Conference on Computer Vision (2007)
Chang, C., Lin, C.: Libsvm: a library for support vector machines (2005)
Dance, C., Willamowski, J., Fan, L., Bray, C., Csurka, G.: Visual categorization with bags of keypoints. In: European Conference on Computer Vision International Workshop on Statistical Learning in Computer Vision. LNCS. Springer, Heidelberg (2004)
Fishler, M., Bolles, R.: Random sample consensus: a paradigm for model fitting with application to image analysis and automated cartography. Communications of the ACM 24, 381–395 (1981)
Harris, C., Stephens, M.: A combined corner and edge detector. In: Proceedings of the Alvey Vision Conference, pp. 147–152 (1988)
Hays, J., Efros, A.A.: IM2GPS: estimating geographic information from a single image. In: IEEE Computer Society Conference Vision and Pattern Recognition (2008)
Jarvelin, K., Kekalainen, J.: Cumulated gain-based evaluation of IR techniques. ACM Transactions on Information Systems (TOIS) 20, 422–446 (2002)
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: IEEE Computer Society Conference Vision and Pattern Recognition, pp. 2169–2178 (2006)
Li, F.F., Perona, P.: A bayesian hierarchical model for learning natural scene categories. In: IEEE Computer Society Conference Vision and Pattern Recognition (2005)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 2, 91–110 (2004)
Luo, Z., Li, H., Tang, J., Hong, R., Chua, T.S.: ViewFoucus: Explore places of interests on google maps using photos with view direction filtering. In: Proceeding of Multimedia. ACM, New York (2009)
Oliva, A., Torralba, A.: Modeling the shape of the scene: A holistic representation of the spatial envelope. International Journal of Computer Vision 42, 145–175 (2001)
Oliva, A., Torralba, A.: Building the gist of a scene: The role of global image features in recognition. Progress in Brain Research 155, 23–26 (2006)
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Object retrieval with large vocabularies and fast spatial matching. In: IEEE Computer Society Conference Vision and Pattern Recognition (2007)
Schaffalitzky, F., Zisserman, A.: Multi-view matching for unordered image sets, or how do I organize my holiday snaps? In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2350, pp. 414–431. Springer, Heidelberg (2002)
Smeulders, A.W.M., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-based image retrieval at the end of the early years. IEEE Transactions Pattern Analysis and Machine Intelligence 22, 1349–1380 (2000)
Szeliski, R.: Where am I?: ICCV 2005 computer vision contest, http://research.microsoft.com/iccv2005/contest/
Vogel, J., Schiele, B.: Semantic modeling of natural scenes for content-based image retrieval. International Journal of Computer Vision 72, 133–157 (2007)
Yeh, T., Tollmar, K., Darrell, T.: Searching the web with mobile images for location recognition. In: IEEE Computer Society Conference Vision and Pattern Recognition (2004)
Zhang, H., Low, C., Smoliar, S., Wu, J.: Video parsing, retrieval and browsing: an integrated and content-based solution. In: Proceeding of Multimedia, pp. 15–24. ACM, New York (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jhuo, IH., Chen, T., Lee, D.T. (2010). Scene Location Guide by Image-Based Retrieval. In: Boll, S., Tian, Q., Zhang, L., Zhang, Z., Chen, YP.P. (eds) Advances in Multimedia Modeling. MMM 2010. Lecture Notes in Computer Science, vol 5916. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-11301-7_22
Download citation
DOI: https://doi.org/10.1007/978-3-642-11301-7_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-11300-0
Online ISBN: 978-3-642-11301-7
eBook Packages: Computer ScienceComputer Science (R0)