Scene Location Guide by Image-Based Retrieval

Jhuo, I-Hong; Chen, Tsuhan; Lee, D. T.

doi:10.1007/978-3-642-11301-7_22

Scene Location Guide by Image-Based Retrieval

I-Hong Jhuo^21,22,
Tsuhan Chen²³ &
D. T. Lee^21,22

Conference paper

2089 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5916))

Abstract

In this paper, we propose a new image-based algorithm to identify where a tourist is when visiting unfamiliar places. When the tourist takes a photo of an unfamiliar place, our algorithm can recognize where the tourist is by retrieving similar images from an image database, where location information is associated with each image. Our method is not only fusing global and local information but using a coarse-to-fine three-stage search process. We first extract image descriptors from the image taken by the tourist and retrieve a number of most relevant images from the database. Then, we re-rank these relevant images based on geometric consistency. Finally, our method determines where the tourist is by using an image-to-class distance measure. Promising performance of the proposed algorithm is demonstrated by the experiments.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Boiman, O., Shechtman, E., Irani, M.: In defense of nearest-neighbor based image classification. In: IEEE Computer Society Conference Vision and Pattern Recognition (2008)
Google Scholar
Bosch, A., Zisserman, A., Muñoz, X.: Scene classification via pLSA. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3954, pp. 517–530. Springer, Heidelberg (2006)
Chapter Google Scholar
Bosch, A., Zisserman, A., Munoz, X.: Image classification using random forests and ferns. In: IEEE International Conference on Computer Vision (2007)
Google Scholar
Chang, C., Lin, C.: Libsvm: a library for support vector machines (2005)
Google Scholar
Dance, C., Willamowski, J., Fan, L., Bray, C., Csurka, G.: Visual categorization with bags of keypoints. In: European Conference on Computer Vision International Workshop on Statistical Learning in Computer Vision. LNCS. Springer, Heidelberg (2004)
Google Scholar
Fishler, M., Bolles, R.: Random sample consensus: a paradigm for model fitting with application to image analysis and automated cartography. Communications of the ACM 24, 381–395 (1981)
Article Google Scholar
Harris, C., Stephens, M.: A combined corner and edge detector. In: Proceedings of the Alvey Vision Conference, pp. 147–152 (1988)
Google Scholar
Hays, J., Efros, A.A.: IM2GPS: estimating geographic information from a single image. In: IEEE Computer Society Conference Vision and Pattern Recognition (2008)
Google Scholar
Jarvelin, K., Kekalainen, J.: Cumulated gain-based evaluation of IR techniques. ACM Transactions on Information Systems (TOIS) 20, 422–446 (2002)
Article Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: IEEE Computer Society Conference Vision and Pattern Recognition, pp. 2169–2178 (2006)
Google Scholar
Li, F.F., Perona, P.: A bayesian hierarchical model for learning natural scene categories. In: IEEE Computer Society Conference Vision and Pattern Recognition (2005)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 2, 91–110 (2004)
Article Google Scholar
Luo, Z., Li, H., Tang, J., Hong, R., Chua, T.S.: ViewFoucus: Explore places of interests on google maps using photos with view direction filtering. In: Proceeding of Multimedia. ACM, New York (2009)
Google Scholar
Oliva, A., Torralba, A.: Modeling the shape of the scene: A holistic representation of the spatial envelope. International Journal of Computer Vision 42, 145–175 (2001)
Article MATH Google Scholar
Oliva, A., Torralba, A.: Building the gist of a scene: The role of global image features in recognition. Progress in Brain Research 155, 23–26 (2006)
Article Google Scholar
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Object retrieval with large vocabularies and fast spatial matching. In: IEEE Computer Society Conference Vision and Pattern Recognition (2007)
Google Scholar
Schaffalitzky, F., Zisserman, A.: Multi-view matching for unordered image sets, or how do I organize my holiday snaps? In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2350, pp. 414–431. Springer, Heidelberg (2002)
Chapter Google Scholar
Smeulders, A.W.M., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-based image retrieval at the end of the early years. IEEE Transactions Pattern Analysis and Machine Intelligence 22, 1349–1380 (2000)
Article Google Scholar
Szeliski, R.: Where am I?: ICCV 2005 computer vision contest, http://research.microsoft.com/iccv2005/contest/
Vogel, J., Schiele, B.: Semantic modeling of natural scenes for content-based image retrieval. International Journal of Computer Vision 72, 133–157 (2007)
Article Google Scholar
Yeh, T., Tollmar, K., Darrell, T.: Searching the web with mobile images for location recognition. In: IEEE Computer Society Conference Vision and Pattern Recognition (2004)
Google Scholar
Zhang, H., Low, C., Smoliar, S., Wu, J.: Video parsing, retrieval and browsing: an integrated and content-based solution. In: Proceeding of Multimedia, pp. 15–24. ACM, New York (1995)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Information Engineering, National Taiwan University, Taipei, Taiwan
I-Hong Jhuo & D. T. Lee
Institute of Information Science, Academia Sinica, Taipei, Taiwan
I-Hong Jhuo & D. T. Lee
School of Electrical and Computer Engineering, Cornell University, Ithaca, NY, 14853, USA
Tsuhan Chen

Authors

I-Hong Jhuo
View author publications
You can also search for this author in PubMed Google Scholar
Tsuhan Chen
View author publications
You can also search for this author in PubMed Google Scholar
D. T. Lee
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Oldenburg, Germany
Susanne Boll
University of Texas at San Antonio,, TX, San Antonio, USA
Qi Tian
Microsoft Research Asia, Beijing, P.R. China
Lei Zhang
Southwest University, Beibei, Chongqing, China
Zili Zhang
School of Engineering and Information Technology, Deakin University, 221 Burwood Highway, Vic, 3125, Australia
Yi-Ping Phoebe Chen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jhuo, IH., Chen, T., Lee, D.T. (2010). Scene Location Guide by Image-Based Retrieval. In: Boll, S., Tian, Q., Zhang, L., Zhang, Z., Chen, YP.P. (eds) Advances in Multimedia Modeling. MMM 2010. Lecture Notes in Computer Science, vol 5916. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-11301-7_22

Download citation

DOI: https://doi.org/10.1007/978-3-642-11301-7_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-11300-0
Online ISBN: 978-3-642-11301-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics