A Fast Image Retrieval Method Based on A Quantization Tree

Wang, Xiaochun; Wang, Xiali; Wilkes, Don Mitchell

doi:10.1007/978-981-13-9217-7_10

Xiaochun Wang⁴,
Xiali Wang⁵ &
Don Mitchell Wilkes⁶

543 Accesses
2 Citations

Abstract

Traditional content-based image retrieval technology expresses the content of images by low-level features, giving rise to the “semantic gap” problem. Therefore, it is of great significance to obtain better accuracy of retrieval by fusing the visual spatial semantic content of images into content-based image retrieval algorithms. To help retrieval algorithms improve the ability of image analysis and understanding and improve the accuracy of the retrieval algorithm, in this chapter, we propose a new image retrieval methodology that begins by partitioning images into small overlapping image patches, on which feature vectors in the form of color histograms are exacted. Next, feature vectors extracted from a small number of training images are clustered to obtain the knowledge base consisting of visual vocabulary words, and images in the whole database are parsed by the knowledge base and their size-reduced versions are stored separately and subsequently used as an index. Finally the query image is partitioned and parsed, its similarities with the indexed images in the database are calculated, and the most similar images are output. The focus of this paper is on a fast visual vocabulary tree (a fast approximate nearest neighbor search tree), Quantization Tree, to quickly parse the query images so as to partially resolve the semantic gap problem between the semantic image content and the low-level image features. Extensive experimental results for object recognition tasks in an indoor environment as well as in an outdoor environment demonstrate the effectiveness and the efficiency of the Quantization Tree.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 159.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Amit, Y., & Geman, D. (1977). Shape quantization and reconstruction with randomized trees. Neural Computation, 9(7), 1545–1588.
Article Google Scholar
Beis, J. S., & Lowe, D. G. (1997). Shape indexing using approximate nearest neighbor search in high-dimensional spaces. In Proceedings of the 1997 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR ‘97) (pp. 1000–1006), Puerto Rico.
Google Scholar
Bentley, J. L. (1970). Multidimensional binary search trees in database applications. IEEE Transactions on Software Engineering, 5(4), 333–340.
Article Google Scholar
Chen, D., Tsai, S. S., & Chandrasekhar, V. (2009). Robust image retrieval using multiview scalable vocabulary trees. In Visual Communications & Image Processing. International Society for Optics and Photonics.
Google Scholar
Duch, A., Estivill-Castro, V., & Martinez, C. (1998). Randomized K-dimensional binary search trees. In K.-Y. Chwa & O. H. Ibarra (Eds.), ISAAC ‘98, LNCS (Vol. 1533, pp. 199–209). Berlin Heidelberg: Springer.
Google Scholar
Fukushima, K. (1980). Neocognition: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biological Cybernetics, 36(4), 193–202.
Article Google Scholar
Geurts, P., Ernst, D., & Wehenkel, L. (2006). Extremely randomized trees. Machine Learning, 63(1), 3–42.
Article Google Scholar
Guttman, A. (1984). R-trees: A dynamic index structure for spatial searching. In Proceedings of the ACM International Conference on Management of Data (SIGMOD ‘84) (pp. 47–57), New York, NY.
Google Scholar
Hunter, J. E., Tugcu, M., Wang, X., Costello, C., & Wilkes, D. M. (2011). Exploiting sparse representations in very high-dimensional feature spaces obtained from patch based processing. Machine Vision and Applications, 22(3), 449–460.
Google Scholar
Karypis, G., Han, E.-H., & Kumar, V. (1999). CHAMELEON: A hierarchical clustering algorithm using dynamic modeling. IEEE Computer, 32(8), 68–75.
Article Google Scholar
Katayama, N., & Satoh, S. (1997). The SR-tree: An index structure for high-dimensional nearest neighbor queries. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD ‘97) (pp. 369–380), Tucson, Arizona.
Google Scholar
Konolige, K., Bowman, J., Chen, J. D., Mihelich, P., Calonder, M., Lepetit, V., et al. (2010). View-based maps. International Journal of Robotics Research, 29(8), 941–957.
Article Google Scholar
Laszlo, M., & Mukherjee, S. (2005). Minimum spanning tree partitioning algorithm for microaggregation. IEEE Transactions on Knowledge and Data Engineering, 17(7), 902–911.
Article Google Scholar
Lepetit, V., & Fua, P. (2006). Keypoint recognition using randomized trees. IEEE Transactions on Pattern Analysis and Machine Intelligence, 28(9), 1465–1479.
Article Google Scholar
Lowe, D. G. (1999). Object recognition from local scale-invariant features. In Proceedings of the International Conference on Computer Vision (ICCV ‘99) (pp. 1150–1157).
Google Scholar
Moosmann, F., Triggs, B., & Jurie, F. (2006). Fast discriminative visual codebooks using randomized clustering forest. In Proceedings of International Conference on Neural Information Processing Systems. MIT Press.
Google Scholar
Nister, D., & Stewenius, H. (2006). Scalable recognition with a vocabulary tree. In Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR ‘06) (Vol. 2, pp. 2161–2168), Washington, DC, USA.
Google Scholar
Pereira, F. C. N., Tishby, N., & Lee, L. (1993). Distributional clustering of English words. In Proceeding of the 31st Meeting of the Association for Computational Linguistics (pp. 183–190), Columbus, OH.
Google Scholar
Quinlan, J. R. (1992). C4.5: Program for machine learning. Morgan Kaufmann.
Google Scholar
Riemenschneider, H., Donoser, M., & Bischof, H. (2009). Bag of optical flow volumes for image sequence recognition. In Proceedings of British Machine Vision Conference (BMVC).
Google Scholar
Silpa-Anan, C., & Hartley, R. (2008). Optimised KD-trees for fast image descriptor matching. In Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR ‘08) (pp. 1–8).
Google Scholar
Swain, M. J., & Ballard, D. H. (1991). Color indexing. International Journal of Computer Vision, 7(1), 11–32.
Article Google Scholar
Veltkamp, R. C., Tanase, M., & Sent, D. (2001). Features in content-based image retrieval systems: A Survey. Computational Imaging and Vision, 36(03), 97–124.
Article Google Scholar
Wang, X., Tugcu, M., Hunter, J. E., & Wilkes, D. M. (2009a). Exploration of configural representation in landmark learning using working memory toolkit. Pattern Recognition Letters, 30(1), 66–79.
Article Google Scholar
Wang, X., Wang, X. L., & Wilkes, D. M. (2009b). A divide-and-conquer approach for minimum spanning tree-based clustering. IEEE Transactions on Knowledge and Data Engineering, 21(7), 945–958.
Article Google Scholar
White, D. A., & Jain, R. (1996). Similarity indexing with the SS-tree. In Proceedings of the 12th IEEE International Conference on Data Engineering (ICDE ‘96) (pp. 516–523), Washington, DC.
Google Scholar
Zhao, Z., & Elgammal, A. (2008). Information theoretic key frame selection for action recognition. In Proceedings of British Machine Vision Conference (BMVC).
Google Scholar

Download references

Author information

Authors and Affiliations

School of Software Engineering, Xi’an Jiaotong University, Xi’an, Shaanxi, China
Xiaochun Wang
School of Information Engineering, Chang’an University, Xi’an, Shaanxi, China
Xiali Wang
Department of Electrical Engineering and Computer Science, Vanderbilt University, Nashville, TN, USA
Don Mitchell Wilkes

Authors

Xiaochun Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xiali Wang
View author publications
You can also search for this author in PubMed Google Scholar
Don Mitchell Wilkes
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiaochun Wang .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Wang, X., Wang, X., Wilkes, D.M. (2020). A Fast Image Retrieval Method Based on A Quantization Tree. In: Machine Learning-based Natural Scene Recognition for Mobile Robot Localization in An Unknown Environment. Springer, Singapore. https://doi.org/10.1007/978-981-13-9217-7_10

Download citation

DOI: https://doi.org/10.1007/978-981-13-9217-7_10
Published: 13 August 2019
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-9216-0
Online ISBN: 978-981-13-9217-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

A Fast Image Retrieval Method Based on A Quantization Tree