A Ground-Truth Training Set for Hierarchical Clustering in Content-based Image Retrieval

Huijsmans, D. P.; Sebe, N.; Lew, M. S.

doi:10.1007/3-540-40053-2_44

A Ground-Truth Training Set for Hierarchical Clustering in Content-based Image Retrieval

D. P. Huijsmans⁵,
N. Sebe⁵ &
M. S. Lew⁵

Conference paper
First Online: 01 January 2003

522 Accesses
2 Citations

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1929))

Abstract

Progress in Content-Based Image Retrieval (CBIR) is ham- pered by the absence of well-documented and validated test-sets that provide ground-truth for the performance evaluation of image indexing, retrieval and clustering tasks. For quick access to large (tenthousands or millions of images) digital image collections a hierarchically structured indexing or browsing mechanism based on clusters of similar images at various coarse to fine levels is highly wanted. The Leiden 19th-Century Portrait Database (LCPD), that consists of over 16,000 scanned studio portraits (so-called Cartes de Visite CdV), happens to have a clearly delineated set of clusters in the studio logo backside images. Clusters of similar or semantically identical logos can also be formed on a number of levels that show a clear hierarchy. The Leiden Imaging and Multimedia Group is constructing a CD-ROM with a well-documented set of studio portraits and logos that can serve as ground-truth for feature performance evaluation in domains beside color-indexing. Its grey-level image lay-out characteristics are also described by various precalculated feature vector sets. For both portraits (near copy pairs) and studio logos (clusters of identical logos) test-sets will be provided and described at various clustering levels. The statistically significant number of test-set images embedded in a realistically large environment of narrow-domain images are presented to the CBIR community to enable selection of more optimal indexing and retrieval approaches as part of an internationally defined test-set that comprises test-sets specifically designed for color-, texture- and shape retrieval evaluation.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Murtagh, F. (ed.): Special Issue on Clustering and Classification. The Computer Journal 41–8 (1998)
Google Scholar
Murtagh, F.: A Survey of Recent Advances in Hierarchical Clustering Algorithms. The Computer Journal 26 (1983) 354–359
MATH Google Scholar
Hartigan, J. A.: Clustering Algorithms. Wiley (1975)
Google Scholar
Dimai, A.: Assessment of Effectiveness of Content Based Image Retrieval Systems. Conf. Proc. Visual’99 LNCS 1614 (1999) 525–532
Google Scholar
Ma, W., Zhang, H.: Benchmarking of Image Features for Content-based Retrieval. IEEE (1998) 253–257
Google Scholar
DeVijver, P.A., Kittler, J.: Pattern Recognition A Statistical Approach. Prentice-Hall (1982)
Google Scholar
Kittler, J., Hatef, M., Duin, R.P.W.: Combining Classifiers. IEEE Proc ICPR’96 (1996) 2B 897–901
Google Scholar
Sebe, N., Lew, M., Huijsmans, D.P.: Which Ranking Metric is Optimal? With Applications in Image Retrieval and Stereo Matching. Conf Proc ICPR’98 (1998) 265–271
Google Scholar
Huijsmans, D.P., Lew, M.S., Denteneer, D.: Quality Measures for Interactive Image Retrieval with a Performance Evaluation of Two 3x3 Texel-Based Methods. Conf. Proc. ICIAP’97 LNCS 1311 (1997) 22–29
Google Scholar

Download references

Author information

Authors and Affiliations

LIACS, Leiden University, RA Leiden, P.O. Box 9512, 2300, The Netherlands
D. P. Huijsmans, N. Sebe & M. S. Lew

Authors

D. P. Huijsmans
View author publications
You can also search for this author in PubMed Google Scholar
N. Sebe
View author publications
You can also search for this author in PubMed Google Scholar
M. S. Lew
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Claude Bernard University of Lyon, LISI - 502, INSA de Lyon, 69621, Villeurbanne Cedex, France
Robert Laurini

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Huijsmans, D.P., Sebe, N., Lew, M.S. (2000). A Ground-Truth Training Set for Hierarchical Clustering in Content-based Image Retrieval. In: Laurini, R. (eds) Advances in Visual Information Systems. VISUAL 2000. Lecture Notes in Computer Science, vol 1929. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-40053-2_44

Download citation

DOI: https://doi.org/10.1007/3-540-40053-2_44
Published: 11 February 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41177-2
Online ISBN: 978-3-540-40053-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics