Image to Text Translation by Multi-Label Classification

Nasierding, Gulisong; Kouzani, Abbas Z.

doi:10.1007/978-3-642-14932-0_31

Gulisong Nasierding^23,24 &
Abbas Z. Kouzani²⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6216))

Included in the following conference series:

International Conference on Intelligent Computing

2226 Accesses
3 Citations

Abstract

This paper presents an image to text translation platform consisting of image segmentation, region features extraction, region blobs clustering, and translation components. Different multi-label learning method is suggested for realizing the translation component. Empirical studies show that the predictive performance of the translation component is better than its counterparts when employed a dual-random ensemble multi-label classification algorithm that tested on the scene image dataset under all the selected evaluation criteria; while multi-label k-nearest neighbor learning algorithm performed nicely on jmlr2003 dataset. This achievement can facilitate formation of image to text translation and image annotation systems. The findings of this work suggest that different learning algorithms can be used for translating different type of images into text more effectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Duygulu, P., Barnard, K., de Freitas, J.F.G., Forsyth, D.A.: Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary. In: Seventh European Conference on Computer Vision (ECCV), vol. (4), pp. 97–112 (2002)
Google Scholar
Barnard, K., Duygulu, P., Forsyth, D., de Freitas, N., Blei, D., Jordan, M.I.: Matching Words and Pictures. J. Machine Learning Research 3, 1107–1135 (2003)
Article MATH Google Scholar
Nasierding, G., Kouzani, A.Z.: Image to Text Translation: A Review. In: Proceedings of international Conference on Humanized Systems, Beijing, pp. 378–383 (2008)
Google Scholar
Tsai, C.-F., Huang, C.: Automatic Annotating Images with Keywords: A Review of Image Annotation Systems. Recent Patterns on Computer Science 1, 55–68 (2008)
Article Google Scholar
Song, H., Li, X.: Automatic Image Annotation based on Improved Relevance Model. In: Asia-Pacific Conference on Information Processing. IEEE Computer Society Press, Los Alamitos (2009)
Google Scholar
Wang, M., Zhou, X., Chua, T.-S.: Automatic Image Annotation via Local Multi-Label Classification. In: Proceedings of the international conference on Content-based image and video retrieval (CIVR’08), Niagara Falls, Canada, pp. 17–26 (2008)
Google Scholar
Kang, F., Jin, R., Sukthankar, R.: Correlated Label Propagation with Application to Multi-label Learning. In: Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06), pp. 291–294, 1719–1726 (2006)
Google Scholar
Nasierding, G., Tsoumakas, G., Kouzani, A.Z.: Clustering Based Multi-Label Classification for Image Annotation and Retrieval. In: IEEE International Conference on Systems, Man, and Cybernetics, pp. 4627–4632 (2009)
Google Scholar
Boutell, M.R., Luo, V., Shen, X., Brown, C.M.: Learning Multi-label Scene Classification. Pattern Recognition 37, 1757–1771 (2004)
Article Google Scholar
Fu”rnkranz, J., Hullermeier, E., Mencia, E.L., Brinker, K.: Multilabel Classification via Calibrated Label Ranking. Journal of Machine Learning 73, 133–153 (2008)
Article Google Scholar
Tsoumakas, G., Katakis, I., Vlahavas, I.: Mining Multi-label Data. In: Maimon, O., Rokach, L. (eds.) Data Mining and Knowledge Discovery Handbook, 2nd edn. Springer, Heidelberg (2010)
Google Scholar
Kouzani, A.Z., Nasierding, G.: Multi-label Classification by BCH Code and Random Forests. J. Recent Trends in Engineering 2(1), 113–116 (2009)
Google Scholar
Zhang, M.L., Zhou, Z.H.: ML – KNN: A Lazy Learning Approach to Multi-Label Learning. Pattern Recognition 40(7), 2038–2048 (2007)
Article MATH Google Scholar
Tsoumakas, G., Katakis, I., Vlahavas, I.: Random k-Labelsets for Multi-Label Classification. IEEE Transactions on Knowledge Discovery and Data Engineering (2010)
Google Scholar
Tsoumakas, G., Katakis, I., Vlahavas, I.: Effective and Efficient Multi-label Classification in Domains with Large Number of Labels. In: Proceedings of ECML/PKDD 2008 Workshop on Mining Multidimensional Data (MMD’08), Antwerp, Belgium (2008)
Google Scholar
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann, San Francisco (2005)
MATH Google Scholar
Ho, T.K.: The Random Subspace Method for Constructing Decision Forests. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(8), 832–844 (1998)
Article Google Scholar
Bryll, R., Gutierrez-Osuna, R., Quek, F.: Attribute Bagging: Improving Accuracy of Classifier Ensembles by Using Random Feature Subsets. Pattern Recognition 36(6), 1291–1302 (2003)
Article MATH Google Scholar
Nasierding, G., Duc, B.V., Lee, S.L.A., Kouzani, A.Z.: Dual-Random Ensemble Method for Multi-Label Classification of Biological Data. In: IEEE International Symposium on Bioelectronics and Bioinformatics, RMIT, Melbourne, pp. 49–52 (December 2009)
Google Scholar
Duda, R., Hart, R., Stork, D.: Pattern Classification, 2nd edn. Wiley, New York (2001)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Xinjiang Normal University, No. 19 Xin Yi Rd, Urumqi, P.R. China, 830054
Gulisong Nasierding
School of Engineering, Deakin University, Geelong, VIC 3217, Australia
Gulisong Nasierding & Abbas Z. Kouzani

Authors

Gulisong Nasierding
View author publications
You can also search for this author in PubMed Google Scholar
Abbas Z. Kouzani
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Chinese Academy of Sciences, Intelligent Computing Laboratory, P.O. Box 1130, 230031, Hefei, Anhui, China
De-Shuang Huang
Department of Chemistry, University of Louisville, 2320 South Brook Street, 40292, Louisville, KY, USA
Xiang Zhang
Department of Computational Sciences, National Institute of Astrophysics Optics and Electronics, Luis E. Erro #1, 72840, Tonantzintla, Puebla, Mexico
Carlos Alberto Reyes García
Department of Computing, The Hong Kong Polytechnic University, Hong Kong, China
Lei Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nasierding, G., Kouzani, A.Z. (2010). Image to Text Translation by Multi-Label Classification. In: Huang, DS., Zhang, X., Reyes García, C.A., Zhang, L. (eds) Advanced Intelligent Computing Theories and Applications. With Aspects of Artificial Intelligence. ICIC 2010. Lecture Notes in Computer Science(), vol 6216. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14932-0_31

Download citation

DOI: https://doi.org/10.1007/978-3-642-14932-0_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14931-3
Online ISBN: 978-3-642-14932-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics