Abstract
This paper presents an image to text translation platform consisting of image segmentation, region features extraction, region blobs clustering, and translation components. Different multi-label learning method is suggested for realizing the translation component. Empirical studies show that the predictive performance of the translation component is better than its counterparts when employed a dual-random ensemble multi-label classification algorithm that tested on the scene image dataset under all the selected evaluation criteria; while multi-label k-nearest neighbor learning algorithm performed nicely on jmlr2003 dataset. This achievement can facilitate formation of image to text translation and image annotation systems. The findings of this work suggest that different learning algorithms can be used for translating different type of images into text more effectively.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Duygulu, P., Barnard, K., de Freitas, J.F.G., Forsyth, D.A.: Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary. In: Seventh European Conference on Computer Vision (ECCV), vol. (4), pp. 97–112 (2002)
Barnard, K., Duygulu, P., Forsyth, D., de Freitas, N., Blei, D., Jordan, M.I.: Matching Words and Pictures. J. Machine Learning Research 3, 1107–1135 (2003)
Nasierding, G., Kouzani, A.Z.: Image to Text Translation: A Review. In: Proceedings of international Conference on Humanized Systems, Beijing, pp. 378–383 (2008)
Tsai, C.-F., Huang, C.: Automatic Annotating Images with Keywords: A Review of Image Annotation Systems. Recent Patterns on Computer Science 1, 55–68 (2008)
Song, H., Li, X.: Automatic Image Annotation based on Improved Relevance Model. In: Asia-Pacific Conference on Information Processing. IEEE Computer Society Press, Los Alamitos (2009)
Wang, M., Zhou, X., Chua, T.-S.: Automatic Image Annotation via Local Multi-Label Classification. In: Proceedings of the international conference on Content-based image and video retrieval (CIVR’08), Niagara Falls, Canada, pp. 17–26 (2008)
Kang, F., Jin, R., Sukthankar, R.: Correlated Label Propagation with Application to Multi-label Learning. In: Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06), pp. 291–294, 1719–1726 (2006)
Nasierding, G., Tsoumakas, G., Kouzani, A.Z.: Clustering Based Multi-Label Classification for Image Annotation and Retrieval. In: IEEE International Conference on Systems, Man, and Cybernetics, pp. 4627–4632 (2009)
Boutell, M.R., Luo, V., Shen, X., Brown, C.M.: Learning Multi-label Scene Classification. Pattern Recognition 37, 1757–1771 (2004)
Fu”rnkranz, J., Hullermeier, E., Mencia, E.L., Brinker, K.: Multilabel Classification via Calibrated Label Ranking. Journal of Machine Learning 73, 133–153 (2008)
Tsoumakas, G., Katakis, I., Vlahavas, I.: Mining Multi-label Data. In: Maimon, O., Rokach, L. (eds.) Data Mining and Knowledge Discovery Handbook, 2nd edn. Springer, Heidelberg (2010)
Kouzani, A.Z., Nasierding, G.: Multi-label Classification by BCH Code and Random Forests. J. Recent Trends in Engineering 2(1), 113–116 (2009)
Zhang, M.L., Zhou, Z.H.: ML – KNN: A Lazy Learning Approach to Multi-Label Learning. Pattern Recognition 40(7), 2038–2048 (2007)
Tsoumakas, G., Katakis, I., Vlahavas, I.: Random k-Labelsets for Multi-Label Classification. IEEE Transactions on Knowledge Discovery and Data Engineering (2010)
Tsoumakas, G., Katakis, I., Vlahavas, I.: Effective and Efficient Multi-label Classification in Domains with Large Number of Labels. In: Proceedings of ECML/PKDD 2008 Workshop on Mining Multidimensional Data (MMD’08), Antwerp, Belgium (2008)
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann, San Francisco (2005)
Ho, T.K.: The Random Subspace Method for Constructing Decision Forests. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(8), 832–844 (1998)
Bryll, R., Gutierrez-Osuna, R., Quek, F.: Attribute Bagging: Improving Accuracy of Classifier Ensembles by Using Random Feature Subsets. Pattern Recognition 36(6), 1291–1302 (2003)
Nasierding, G., Duc, B.V., Lee, S.L.A., Kouzani, A.Z.: Dual-Random Ensemble Method for Multi-Label Classification of Biological Data. In: IEEE International Symposium on Bioelectronics and Bioinformatics, RMIT, Melbourne, pp. 49–52 (December 2009)
Duda, R., Hart, R., Stork, D.: Pattern Classification, 2nd edn. Wiley, New York (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nasierding, G., Kouzani, A.Z. (2010). Image to Text Translation by Multi-Label Classification. In: Huang, DS., Zhang, X., Reyes García, C.A., Zhang, L. (eds) Advanced Intelligent Computing Theories and Applications. With Aspects of Artificial Intelligence. ICIC 2010. Lecture Notes in Computer Science(), vol 6216. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14932-0_31
Download citation
DOI: https://doi.org/10.1007/978-3-642-14932-0_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14931-3
Online ISBN: 978-3-642-14932-0
eBook Packages: Computer ScienceComputer Science (R0)