A Study on Multi-label Classification

Tawiah, Clifford A.; Sheng, Victor S.

doi:10.1007/978-3-642-39736-3_11

A Study on Multi-label Classification

Clifford A. Tawiah²⁰ &
Victor S. Sheng²⁰

Conference paper

2003 Accesses
4 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7987))

Abstract

Multi-label classifications exist in many real world applications. This paper empirically studies the performance of a variety of multi-label classification algorithms. Some of them are developed based on problem transformation. Some of them are developed based on adaption. Our experimental results show that the adaptive Multi-Label K-Nearest Neighbor performs the best, followed by Random k-Label Set, followed by Classifier Chain and Binary Relevance. Adaboost.MH performs the worst, followed by Pruned Problem Transformation. Our experimental results also provide us the confidence of existing correlations among multi-labels. These insights shed light for future research directions on multi-label classifications.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 49.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Tsoumakas, G., Katakis, I., Vlahavas, I.: Mining Multi-label Data. In: Maimon, O., Rokach, L. (eds.) Data Mining and Knowledge Discovery Handbook, 2nd edn. Springer (2010)
Google Scholar
Klimt, B., Yang, Y.: Introducing the Enron corpus. In: First Conference on Email and Anti-Spam, CEAS (2004)
Google Scholar
Read, J., Pfahringer, B., Holmes, G., Frank, E.: Classifier chains for multi-label classification. Machine Learning 85(3), 333–359 (2011)
Article Google Scholar
Tsoumakas, G., Ioannis, K.: Multi-label Classification: An Overview. International Journal of Data Warehousing and Mining (2007)
Google Scholar
Min-Ling, Z., Zhou, Z.: ML-KNN: A lazy learning approach to multi-label learning. Pattern Recognition 40(7), 2038–2048 (2007)
Article MATH Google Scholar
Arunadevi, J., Rajamani, V.: An Evolutionary Multi Label Classification Using Associative Rule Mining. International Journal of Soft Computing 6(2), 20–25 (2011)
Article Google Scholar
Tsoumakas, G., Katakis, I., Vlahavas, I.: Random k-labelsets for multilabel classification. IEEE Transactions on Knowledge and Data Engineering 23(7), 1079–1089 (2011)
Article Google Scholar
Yu-Yin, S., Zhang, Y., Zhi-Hua, Z.: Multi-label learning with weak label. In: Twenty-Fourth AAAI Conference on Artificial Intelligence (2010)
Google Scholar
Alvares, C.E., Monard, M.C., Metz, J.: Multi-label Problem Transformation Methods: A Case Study. CLEI Electronic Journal 14(1), 4 (2011)
Google Scholar
Tsoumakas, G., et al.: Mulan: A java library for multi-label learning. Journal of Machine Learning Research 1, 1–48 (2010)
Google Scholar
Tao, L., Zhang, C., Zhu, S.: Empirical studies on multi-label classification. In: The Proceedings of the 18th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2006 (2006)
Google Scholar
Huang, D.-S., McGinnity, M., Heutte, L., Zhang, X.-P. (eds.): ICIC 2010. CCIS, vol. 93. Springer, Heidelberg (2010)
Google Scholar
Colin, C., Mohammad, S.M., de Bruijn, B.: Binary classifiers and latent sequence models for emotion detection in suicide notes. Biomedical Informatics Insights 5(suppl. 1), 147 (2012)
Google Scholar
Jesse, R.: A pruned problem transformation method for multi-label classification. In: Proc. 2008 New Zealand Computer Science Research Student Conference, NZCSRS 2008 (2008)
Google Scholar
Yoav, F., Schapire, R., Abe, N.: A short introduction to boosting. Journal-Japanese Society for Artificial Intelligence 14, 771–780 (1999): 1612
Google Scholar
Min-Ling, Z., Zhang, K.: Multi-label learning by exploiting label dependency. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM (2010)
Google Scholar
Schapire, R.E., Singer, Y.: Boostexter: A boosting-based system for text categorization. Machine Learning 39(2/3), 135–168 (2000)
Article MATH Google Scholar
Tsoumakas, G., Zhang, M.-L., Zhou, Z.-H.: Tutorial on learning from multi-label data. In: ECML/PKDD 2009, Bled, Slovenia (2009), http://www.ecmlpkdd2009.net/wp-content/uploads/2009/08/learningfrom-multi-label-data.pdf
Kumar, V., Wu, X.: Adaboost, The top ten algorithms in data mining, ch. 7, pp. 127–144. CRC Press (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Central Arkansas, Conway, Arkansas, USA, 72035
Clifford A. Tawiah & Victor S. Sheng

Authors

Clifford A. Tawiah
View author publications
You can also search for this author in PubMed Google Scholar
Victor S. Sheng
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Computer Vision and Applied Computer Sciences, IBaI, Leipzig, Germany
Petra Perner

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tawiah, C.A., Sheng, V.S. (2013). A Study on Multi-label Classification. In: Perner, P. (eds) Advances in Data Mining. Applications and Theoretical Aspects. ICDM 2013. Lecture Notes in Computer Science(), vol 7987. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39736-3_11

Download citation

DOI: https://doi.org/10.1007/978-3-642-39736-3_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-39735-6
Online ISBN: 978-3-642-39736-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics