Skip to main content

Improving the Accuracy of Global Feature Fusion Based Image Categorisation

  • Conference paper
Semantic Multimedia (SAMT 2007)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4816))

Included in the following conference series:

Abstract

In this paper we consider the task of categorising images of the Corel collection into semantic classes. In our earlier work, we demonstrated that state-of-the-art accuracy of supervised categorising of these images could be improved significantly by fusion of a large number of global image features. In this work, we preserve the general framework, but improve the components of the system: we modify the set of image features to include interest point histogram features, perform elementary feature classification with support vector machines (SVM) instead of self-organising map (SOM) based classifiers, and fuse the classification results with either an additive, multiplicative or SVM-based technique. As the main result of this paper, we are able to achieve a significant improvement of image categorisation accuracy by applying these generic state-of-the-art image content analysis techniques.

Supported by the Academy of Finland in the projects Neural methods in information retrieval based on automatic content analysis and relevance feedback and Finnish Centre of Excellence in Adaptive Informatics Research. Special thanks to Xiaojun Qi and Yutao Han for helping with the experimental setup.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Andrews, S., Tsochantaridis, I., Hoffman, T.: Support vector machines for multiple-instance learning. In: NIPS 15, pp. 561–568. MIT Press, Cambridge (2003)

    Google Scholar 

  2. Chang, C.-C., Lin, C.-J.: LIBSVM: a library for support vector machines, Software (2001), available at http://www.csie.ntu.edu.tw/~cjlin/libsvm

  3. Chen, Y., Zwang, J.Z.: Image categorization by learning and reasoning with regions. Journal of Machine Learning Research 5, 913–939 (2004)

    Google Scholar 

  4. Snoek, C.G.M., et al.: The MediaMill TRECVID 2006 semantic video search engine. In: TRECVID. TRECVID Online Proceedings (November 2006), http://www-nlpir.nist.gov/projects/tvpubs/tv.pubs.org.html

  5. Everingham, M., Zisserman, A., Williams, C.K.I., Van Gool, L.: The PASCAL Visual Object Classes Challenge (VOC2006) Results (2006), http://www.pascal-network.org/challenges/VOC/voc2006/results.pdf

  6. Hämäläinen, P., Aila, T., Takala, T., Alander, J.: Mutated kd-tree importance sampling. In: SCAI 2006. Proceedings of the The Ninth Scandinavian Conference on Artificial Intelligence, Espoo, Finland, October 2006, pp. 39–45 (2006)

    Google Scholar 

  7. Hauptmann, A.G., Chen, M.-Y., Christel, M., Lin, W.-H., Yan, R., Yang, J.: Multi-lingual broadcast news retrieval. In: TRECVID. TRECVID Online Proceedings (November 2006), http://www-nlpir.nist.gov/projects/tvpubs/tv.pubs.org.html

  8. ISO/IEC: Information technology - Multimedia content description interface - Part 3: Visual, 15938-3:2002(E) (2002)

    Google Scholar 

  9. Kajiya, J.T.: The rendering equation. In: SIGGRAPH 1986, pp. 143–150 (1986)

    Google Scholar 

  10. Kohonen, T.: Self-Organizing Maps, 3rd edn. Springer Series in Information Sciences, vol. 30. Springer, Berlin (2001)

    MATH  Google Scholar 

  11. Koikkalainen, P., Oja, E.: Self-organizing hierarchical feature maps. In: Proceedings of International Joint Conference on Neural Networks, San Diego, CA, USA, vol. II, pp. 279–284 (1990)

    Google Scholar 

  12. Laaksonen, J., Koskela, M., Oja, E.: PicSOM—Self-organizing image retrieval with MPEG-7 content descriptions. IEEE Transactions on Neural Networks 13(4), 841–853 (2002)

    Article  Google Scholar 

  13. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)

    Article  Google Scholar 

  14. Mikolajcyk, K., Schmid, C.: Scale and affine point invariant interest point detectors. International Journal of Computer Vision 60(1), 68–86 (2004)

    Google Scholar 

  15. Over, P., Ianeva, T., Kraaij, W., Smeaton, A.F.: TRECVID 2006 - an introduction. In: TRECVID. TRECVID Online Proceedings (November 2006), http://www-nlpir.nist.gov/projects/tvpubs/tv.pubs.org.html

  16. Qi, X., Han, Y.: Incorporating multiple SVMs for automatic image annotation. Pattern Recognition 40, 728–741 (2007)

    Article  MATH  Google Scholar 

  17. Viitaniemi, V., Laaksonen, J.: Empirical investigations on benchmark tasks for automatic image annotation. In: VISUAL 2007. LNCS, vol. 4781, pp. 93–104. Springer, Heidelberg (2007)

    Google Scholar 

  18. Viitaniemi, V., Laaksonen, J.: Evaluating the performance in automatic image annotation: example case by adaptive fusion of global image features. Signal Processing: Image Communications 22(6), 557–568 (2007)

    Article  Google Scholar 

  19. Wu, T.-F., Lin, C.-J., Weng, R.C.: Probability estimates for multi-class classification by pairwise coupling. J. of Machine Learning Research 5, 975–1005 (2005)

    MathSciNet  Google Scholar 

  20. Zhang, J., Marszałek, M., Lazebnik, S., Schmid, C.: Local features and kernels for classification of texture and object categories: a comprehensive study. International Journal of Computer Vision 73(2), 213–238 (2007)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Bianca Falcidieno Michela Spagnuolo Yannis Avrithis Ioannis Kompatsiaris Paul Buitelaar

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Viitaniemi, V., Laaksonen, J. (2007). Improving the Accuracy of Global Feature Fusion Based Image Categorisation. In: Falcidieno, B., Spagnuolo, M., Avrithis, Y., Kompatsiaris, I., Buitelaar, P. (eds) Semantic Multimedia. SAMT 2007. Lecture Notes in Computer Science, vol 4816. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77051-0_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-77051-0_1

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-77033-6

  • Online ISBN: 978-3-540-77051-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics