Skip to main content

Approximation of Linear Discriminant Analysis for Word Dependent Visual Features Selection

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3708))

Abstract

To automatically determine a set of keywords that describes the content of a given image is a difficult problem, because of (i) the huge dimension number of the visual space and (ii) the unsolved object segmentation problem. Therefore, in order to solve matter (i), we present a novel method based on an Approximation of Linear Discriminant Analysis (ALDA) from the theoretical and practical point of view. Application of ALDA is more generic than usual LDA because it doesn’t require explicit class labelling of each training sample, and however allows efficient estimation of the visual features discrimination power. This is particularly interesting because of (ii) and the expensive manually object segmentation and labelling tasks on large visual database. In first step of ALDA, for each word w k , the train set is split in two, according if images are labelled or not by w k . Then, under weak assumptions, we show theoretically that Between and Within variances of these two sets are giving good estimates of the best discriminative features for w k . Experimentations are conducted on COREL database, showing an efficient word adaptive feature selection, and a great enhancement (+37%) of an image Hierarchical Ascendant Classification (HAC) for which ALDA saves also computational cost reducing by 90% the visual features space.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Barnard, K., Duygulu, P., Freitas, N., Forsyth, D., Blei, D., Jordan, M.: Matching words and pictures. Jour. of Machine Learning Research 3 (2003)

    Google Scholar 

  2. Barnard, K., Duygulu, P., Guru, R., Gabbur, P., Forsyth, D.: The effects of segmentation and feature choice in a translation model of object recognition. In: Computer Vision and Pattern Recognition, pp. 675–682 (2003)

    Google Scholar 

  3. Duda, R., Hart, P., Stork, D.: Pattern Classification. Wiley, Chichester (2000)

    Google Scholar 

  4. Luettin, J., Potamianos, C.N.G.: Hierarchical discriminant features for audio-visual LVCSR. In: Proc. of IEEE Int. Conf. ASSP (2001)

    Google Scholar 

  5. Glotin, H., Tollari, S.: Fast image auto-annotation with visual vector approximation clusters. In: IEEE EURASIP Content-Based Multimedia Indexing (2005)

    Google Scholar 

  6. Gosselin, P., Cord, M.: A comparison of active classification methods for content-based image retrieval. In: Proc. CVDB04 with SIGMOD 2004, Paris (2004)

    Google Scholar 

  7. Liu, Q., Huang, R., Lu, H., Ma, S.: Face recognition using kernel based Fisher discriminant analysis. In: Proc. of Automatic Face & Gesture Recognition (2002)

    Google Scholar 

  8. Monay, F., Gatica-Perez, D.: On image auto-annotation with latent space models. In: Proc. ACM Int. Conf. on Multimedia (ACM MM), pp. 275–278 (2003)

    Google Scholar 

  9. Muller, H., Marchand-Maillet, S., Pun, T.: The truth about corel - evaluation in image retrieval. In: Lew, M., Sebe, N., Eakins, J.P. (eds.) CIVR 2002. LNCS, vol. 2383, Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  10. Neti, C., Potamianos, G., Luettin, J., Matthews, I., Glotin, H., Vergyri, D.: Large-vocabulary audio-visual speech recognition: A summary of the J. Hopkins Summer 2000 Wksp. In: IEEE Wksp. Multimedia Signal Process. (2001)

    Google Scholar 

  11. Shi, J., Malik, J.: Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(8), 888–905 (2000)

    Article  Google Scholar 

  12. Tollari, S., Glotin, H.: Keyword dependant selection of visual features and their heterogeneity for image content-based interpretation. Technical Report LSIS.RR.2005.003, LSIS, Similar content submitted to ACMMM 2005 (2005)

    Google Scholar 

  13. Tollari, S., Glotin, H., Le Maitre, J.: Enhancement of textual images classification using segmented visual contents for image search engine. Multimedia Tools and Applications 25(3), 405–417 (2005)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Glotin, H., Tollari, S., Giraudet, P. (2005). Approximation of Linear Discriminant Analysis for Word Dependent Visual Features Selection. In: Blanc-Talon, J., Philips, W., Popescu, D., Scheunders, P. (eds) Advanced Concepts for Intelligent Vision Systems. ACIVS 2005. Lecture Notes in Computer Science, vol 3708. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11558484_22

Download citation

  • DOI: https://doi.org/10.1007/11558484_22

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-29032-2

  • Online ISBN: 978-3-540-32046-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics