LSA-Based Automatic Acquisition of Semantic Image Descriptions

Basili, Roberto; Petitti, Riccardo; Saracino, Dario

doi:10.1007/978-3-540-77051-0_4

Roberto Basili¹,
Riccardo Petitti² &
Dario Saracino²

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4816))

Included in the following conference series:

International Conference on Semantic and Digital Media Technologies

566 Accesses
3 Citations

Abstract

Web multimedia documents are characterized by visual and linguistic information expressed by structured pages of images and texts. The suitable combinations able to generalize semantic aspects of the overall multimedia information clearly depend on applications. In this paper, an unsupervised image classification technique combining features from different media levels is proposed. In particular linguistic descriptions derived through Information Extraction from Web pages are here integrated with visual features by means of Latent Semantic Analysis. Although the higher expressivity increases the complexity of the learning process, the dimensionality reduction implied by LSA makes it largely applicable. The evaluation over an image classification task confirms that the proposed model outperforms other methods acting on the individual levels. The resulting method is cost-effective and can be easily applied to semi-automatic image semantic labeling tasks as foreseen in collaborative annotation scenarios.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Alsabti, K., Ranka, S., Singh, V.: An efficient k-means clustering algorithm. In: First Workshop High Performance Data Mining (1998)
Google Scholar
Basili, R., Moschitti, A.: Automatic Text Categorization: from Information Retrieval to Support Vector Learning. Aracne (2005)
Google Scholar
Berry, M.W., Dumais, S.T., O’Brien, G.W.: Using linear algebra for intelligent information retrieval. SIAM Review 37(4), 573–595 (1995)
Article MATH MathSciNet Google Scholar
Deerwester, S., Dumais, S., Furnas, G., Harshman, R., Landauer, T.: Indexing by latent semantic analysis. Journal of the American Society for Information Science 41(6), 391–407 (1990)
Article Google Scholar
Hare, J.S., Lewis, P.H., Enser, P.G.B., Sandom, C.J.: Mind the gap: Another look at the problem of the semantic gap in image retrieval. In: Proceedings of Multimedia Content Analysis, Management and Retrieval 2006 SPIE (2006)
Google Scholar
Monay, F., Gatica-Perez, D.: On image auto-annotation with latent space models. In: Proceedings of the 11th annual ACM international conference on Multimedia (2003)
Google Scholar
RWTH: Lti-lib - computer vision library. Website, University of Aachen (September 2006)
Google Scholar
Salton, G.: Automatic Text Processing–The Transformation, Analysis, and Retrieval of Information by Computer. Addison-Wesley, Reading, Massachusetts (1989)
Google Scholar
Smeulders, A.W.M., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-based image retrieval at the end of the early years. IEEE Transactions on Pattern Analysis and Machine Intelligence 12(22), 1349–1380 (2000)
Article Google Scholar
van Rijsbergen, C.J.: The Geometry of Information Retrieval. Cambridge University Press, Cambridge (2004)
MATH Google Scholar
Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York (1995)
MATH Google Scholar
Witten, I.H., Frank, E.: Data Mining: Practical machine learning tools and techniques. Morgan Kaufmann, San Francisco (2005)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

University of Rome ”Tor Vergata”, Department of Computer Science, Systems and Production, Roma, Italy
Roberto Basili
Exprivia S.p.A, Via Cristoforo Colombo 456, 00145, Roma, Italy
Riccardo Petitti & Dario Saracino

Authors

Roberto Basili
View author publications
You can also search for this author in PubMed Google Scholar
Riccardo Petitti
View author publications
You can also search for this author in PubMed Google Scholar
Dario Saracino
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Bianca Falcidieno Michela Spagnuolo Yannis Avrithis Ioannis Kompatsiaris Paul Buitelaar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Basili, R., Petitti, R., Saracino, D. (2007). LSA-Based Automatic Acquisition of Semantic Image Descriptions. In: Falcidieno, B., Spagnuolo, M., Avrithis, Y., Kompatsiaris, I., Buitelaar, P. (eds) Semantic Multimedia. SAMT 2007. Lecture Notes in Computer Science, vol 4816. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77051-0_4

Download citation

DOI: https://doi.org/10.1007/978-3-540-77051-0_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77033-6
Online ISBN: 978-3-540-77051-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics