Abstract
We propose a new Web image gathering system which employs a part-based object recognition method. The novelty of our work is introducing the bag-of-keypoints representation into an Web image gathering task instead of color histogram or segmented regions our previous system used. The bag-of-keypoints representation has been proven that it has the excellent ability to represent image concepts in the context of visual object categorization / recognition in spite of its simplicity. Most of object recognition work assumed that complete training data is available. On the other hand, in the Web image gathering task, since images associated with the given keywords are gathered from the Web fully-automatically, complete training images cannot be available.
In this paper, we combine the HTML-based automatic positive training image selection and the bag-of-keypoints-based image selection with an SVM which is a supervised machine learning method. This combination enables the system to gather many images related to given concepts with high precision fully automatically needing no human intervention. Our main objective is to examine if the bag-of-keypoints model is also effective for the Web image gathering task where training images always include some noise. By the experiments, we show the new system outperforms our previous systems, other systems and Google Image Search greatly.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Csurka, G., Bray, C., Dance, C., Fan, L.: Visual categorization with bags of keypoints. In: Proc. of ECCV Workshop on Statistical Learning in Computer Vision, pp. 1–22 (2004)
Feng, H., Shi, R., Chua, T.: A bootstrapping framework for annotating and retrieving WWW images. In: Proc. of ACM International Conference Multimedia, pp. 960–967 (2004)
Fergus, R., Perona, P., Zisserman, A.: A visual category filter for google images. In: Proc. of European Conference on Computer Vision, pp. 242–255 (2004)
Katayama, N., Satoh, S.: The SR-tree: An index structure for high-dimensional nearest neighbor queries. In: Proc. of ACM SIGMOD International Conference on Management of Data, pp. 369–380 (1997)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)
Nowak, E., Jurie, F., Triggs, W., Vision, M.: Sampling strategies for bag-of-features image classification. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3954, pp. 490–503. Springer, Heidelberg (2006)
Sun, Y., Shimada, S., Morimoto, M.: Visual pattern discovery using Web images. In: Proc. of ACM SIGMM International Workshop on Multimedia Information Retrieval, pp. 127–136 (2006)
Yanai, K.: Image collector: An image-gathering system from the World-Wide Web employing keyword-based search engines. In: Proc. of IEEE International Conference on Multimedia and Expo, pp. 704–707 (2001)
Yanai, K.: Generic image classification using visual knowledge on the web. In: Proc. of ACM International Conference Multimedia, pp. 67–76 (2003)
Yanai, K., Barnard, K.: Probabilistic Web image gathering. In: Proc. of ACM SIGMM International Workshop on Multimedia Information Retrieval, pp. 57–64 (2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yanai, K. (2008). Web Image Gathering with a Part-Based Object Recognition Method. In: Satoh, S., Nack, F., Etoh, M. (eds) Advances in Multimedia Modeling. MMM 2008. Lecture Notes in Computer Science, vol 4903. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77409-9_28
Download citation
DOI: https://doi.org/10.1007/978-3-540-77409-9_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77407-5
Online ISBN: 978-3-540-77409-9
eBook Packages: Computer ScienceComputer Science (R0)