Approximation of Linear Discriminant Analysis for Word Dependent Visual Features Selection

Glotin, Hervé; Tollari, Sabrina; Giraudet, Pascale

doi:10.1007/11558484_22

Approximation of Linear Discriminant Analysis for Word Dependent Visual Features Selection

Hervé Glotin²⁰,
Sabrina Tollari²⁰ &
Pascale Giraudet²¹

Conference paper

1153 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3708))

Abstract

To automatically determine a set of keywords that describes the content of a given image is a difficult problem, because of (i) the huge dimension number of the visual space and (ii) the unsolved object segmentation problem. Therefore, in order to solve matter (i), we present a novel method based on an Approximation of Linear Discriminant Analysis (ALDA) from the theoretical and practical point of view. Application of ALDA is more generic than usual LDA because it doesn’t require explicit class labelling of each training sample, and however allows efficient estimation of the visual features discrimination power. This is particularly interesting because of (ii) and the expensive manually object segmentation and labelling tasks on large visual database. In first step of ALDA, for each word w _k, the train set is split in two, according if images are labelled or not by w _k. Then, under weak assumptions, we show theoretically that Between and Within variances of these two sets are giving good estimates of the best discriminative features for w _k. Experimentations are conducted on COREL database, showing an efficient word adaptive feature selection, and a great enhancement (+37%) of an image Hierarchical Ascendant Classification (HAC) for which ALDA saves also computational cost reducing by 90% the visual features space.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Barnard, K., Duygulu, P., Freitas, N., Forsyth, D., Blei, D., Jordan, M.: Matching words and pictures. Jour. of Machine Learning Research 3 (2003)
Google Scholar
Barnard, K., Duygulu, P., Guru, R., Gabbur, P., Forsyth, D.: The effects of segmentation and feature choice in a translation model of object recognition. In: Computer Vision and Pattern Recognition, pp. 675–682 (2003)
Google Scholar
Duda, R., Hart, P., Stork, D.: Pattern Classification. Wiley, Chichester (2000)
Google Scholar
Luettin, J., Potamianos, C.N.G.: Hierarchical discriminant features for audio-visual LVCSR. In: Proc. of IEEE Int. Conf. ASSP (2001)
Google Scholar
Glotin, H., Tollari, S.: Fast image auto-annotation with visual vector approximation clusters. In: IEEE EURASIP Content-Based Multimedia Indexing (2005)
Google Scholar
Gosselin, P., Cord, M.: A comparison of active classification methods for content-based image retrieval. In: Proc. CVDB04 with SIGMOD 2004, Paris (2004)
Google Scholar
Liu, Q., Huang, R., Lu, H., Ma, S.: Face recognition using kernel based Fisher discriminant analysis. In: Proc. of Automatic Face & Gesture Recognition (2002)
Google Scholar
Monay, F., Gatica-Perez, D.: On image auto-annotation with latent space models. In: Proc. ACM Int. Conf. on Multimedia (ACM MM), pp. 275–278 (2003)
Google Scholar
Muller, H., Marchand-Maillet, S., Pun, T.: The truth about corel - evaluation in image retrieval. In: Lew, M., Sebe, N., Eakins, J.P. (eds.) CIVR 2002. LNCS, vol. 2383, Springer, Heidelberg (2002)
Chapter Google Scholar
Neti, C., Potamianos, G., Luettin, J., Matthews, I., Glotin, H., Vergyri, D.: Large-vocabulary audio-visual speech recognition: A summary of the J. Hopkins Summer 2000 Wksp. In: IEEE Wksp. Multimedia Signal Process. (2001)
Google Scholar
Shi, J., Malik, J.: Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(8), 888–905 (2000)
Article Google Scholar
Tollari, S., Glotin, H.: Keyword dependant selection of visual features and their heterogeneity for image content-based interpretation. Technical Report LSIS.RR.2005.003, LSIS, Similar content submitted to ACMMM 2005 (2005)
Google Scholar
Tollari, S., Glotin, H., Le Maitre, J.: Enhancement of textual images classification using segmented visual contents for image search engine. Multimedia Tools and Applications 25(3), 405–417 (2005)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Laboratoire Sciences de l’Information et des Systèmes-LSIS CNRS UMR6168,
Hervé Glotin & Sabrina Tollari
Département de Biologie, Université du Sud Toulon-Var, F-83957 cedex, La Garde, France
Pascale Giraudet

Authors

Hervé Glotin
View author publications
You can also search for this author in PubMed Google Scholar
Sabrina Tollari
View author publications
You can also search for this author in PubMed Google Scholar
Pascale Giraudet
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

DGA/D4S/MRIS, 94114, Arcueil, France
Jacques Blanc-Talon
Ghent University, 9000, Gent, Belgium
Wilfried Philips
Wireless Technologies Lab, CSIRO ICT Centre, NSW 2122, Marsfield, Australia
Dan Popescu
University of Antwerp, 2610, Wilrijk, Belgium
Paul Scheunders

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Glotin, H., Tollari, S., Giraudet, P. (2005). Approximation of Linear Discriminant Analysis for Word Dependent Visual Features Selection. In: Blanc-Talon, J., Philips, W., Popescu, D., Scheunders, P. (eds) Advanced Concepts for Intelligent Vision Systems. ACIVS 2005. Lecture Notes in Computer Science, vol 3708. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11558484_22

Download citation

DOI: https://doi.org/10.1007/11558484_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29032-2
Online ISBN: 978-3-540-32046-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics