Skip to main content

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6216))

Included in the following conference series:

Abstract

This paper presents an image to text translation platform consisting of image segmentation, region features extraction, region blobs clustering, and translation components. Different multi-label learning method is suggested for realizing the translation component. Empirical studies show that the predictive performance of the translation component is better than its counterparts when employed a dual-random ensemble multi-label classification algorithm that tested on the scene image dataset under all the selected evaluation criteria; while multi-label k-nearest neighbor learning algorithm performed nicely on jmlr2003 dataset. This achievement can facilitate formation of image to text translation and image annotation systems. The findings of this work suggest that different learning algorithms can be used for translating different type of images into text more effectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Duygulu, P., Barnard, K., de Freitas, J.F.G., Forsyth, D.A.: Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary. In: Seventh European Conference on Computer Vision (ECCV), vol. (4), pp. 97–112 (2002)

    Google Scholar 

  2. Barnard, K., Duygulu, P., Forsyth, D., de Freitas, N., Blei, D., Jordan, M.I.: Matching Words and Pictures. J. Machine Learning Research 3, 1107–1135 (2003)

    Article  MATH  Google Scholar 

  3. Nasierding, G., Kouzani, A.Z.: Image to Text Translation: A Review. In: Proceedings of international Conference on Humanized Systems, Beijing, pp. 378–383 (2008)

    Google Scholar 

  4. Tsai, C.-F., Huang, C.: Automatic Annotating Images with Keywords: A Review of Image Annotation Systems. Recent Patterns on Computer Science 1, 55–68 (2008)

    Article  Google Scholar 

  5. Song, H., Li, X.: Automatic Image Annotation based on Improved Relevance Model. In: Asia-Pacific Conference on Information Processing. IEEE Computer Society Press, Los Alamitos (2009)

    Google Scholar 

  6. Wang, M., Zhou, X., Chua, T.-S.: Automatic Image Annotation via Local Multi-Label Classification. In: Proceedings of the international conference on Content-based image and video retrieval (CIVR’08), Niagara Falls, Canada, pp. 17–26 (2008)

    Google Scholar 

  7. Kang, F., Jin, R., Sukthankar, R.: Correlated Label Propagation with Application to Multi-label Learning. In: Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06), pp. 291–294, 1719–1726 (2006)

    Google Scholar 

  8. Nasierding, G., Tsoumakas, G., Kouzani, A.Z.: Clustering Based Multi-Label Classification for Image Annotation and Retrieval. In: IEEE International Conference on Systems, Man, and Cybernetics, pp. 4627–4632 (2009)

    Google Scholar 

  9. Boutell, M.R., Luo, V., Shen, X., Brown, C.M.: Learning Multi-label Scene Classification. Pattern Recognition 37, 1757–1771 (2004)

    Article  Google Scholar 

  10. Fu”rnkranz, J., Hullermeier, E., Mencia, E.L., Brinker, K.: Multilabel Classification via Calibrated Label Ranking. Journal of Machine Learning 73, 133–153 (2008)

    Article  Google Scholar 

  11. Tsoumakas, G., Katakis, I., Vlahavas, I.: Mining Multi-label Data. In: Maimon, O., Rokach, L. (eds.) Data Mining and Knowledge Discovery Handbook, 2nd edn. Springer, Heidelberg (2010)

    Google Scholar 

  12. Kouzani, A.Z., Nasierding, G.: Multi-label Classification by BCH Code and Random Forests. J. Recent Trends in Engineering 2(1), 113–116 (2009)

    Google Scholar 

  13. Zhang, M.L., Zhou, Z.H.: ML – KNN: A Lazy Learning Approach to Multi-Label Learning. Pattern Recognition 40(7), 2038–2048 (2007)

    Article  MATH  Google Scholar 

  14. Tsoumakas, G., Katakis, I., Vlahavas, I.: Random k-Labelsets for Multi-Label Classification. IEEE Transactions on Knowledge Discovery and Data Engineering (2010)

    Google Scholar 

  15. Tsoumakas, G., Katakis, I., Vlahavas, I.: Effective and Efficient Multi-label Classification in Domains with Large Number of Labels. In: Proceedings of ECML/PKDD 2008 Workshop on Mining Multidimensional Data (MMD’08), Antwerp, Belgium (2008)

    Google Scholar 

  16. Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann, San Francisco (2005)

    MATH  Google Scholar 

  17. Ho, T.K.: The Random Subspace Method for Constructing Decision Forests. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(8), 832–844 (1998)

    Article  Google Scholar 

  18. Bryll, R., Gutierrez-Osuna, R., Quek, F.: Attribute Bagging: Improving Accuracy of Classifier Ensembles by Using Random Feature Subsets. Pattern Recognition 36(6), 1291–1302 (2003)

    Article  MATH  Google Scholar 

  19. Nasierding, G., Duc, B.V., Lee, S.L.A., Kouzani, A.Z.: Dual-Random Ensemble Method for Multi-Label Classification of Biological Data. In: IEEE International Symposium on Bioelectronics and Bioinformatics, RMIT, Melbourne, pp. 49–52 (December 2009)

    Google Scholar 

  20. Duda, R., Hart, R., Stork, D.: Pattern Classification, 2nd edn. Wiley, New York (2001)

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Nasierding, G., Kouzani, A.Z. (2010). Image to Text Translation by Multi-Label Classification. In: Huang, DS., Zhang, X., Reyes García, C.A., Zhang, L. (eds) Advanced Intelligent Computing Theories and Applications. With Aspects of Artificial Intelligence. ICIC 2010. Lecture Notes in Computer Science(), vol 6216. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14932-0_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-14932-0_31

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-14931-3

  • Online ISBN: 978-3-642-14932-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics