Abstract
Multi-label classification has received more attention recently in the fields of data mining and machine learning. Though many approaches have been proposed, the critical issue of how to combine single labels to form a multi-label remains challenging. In this work, we propose a novel multi-label classification approach that each label is represented by two exclusive events: the label is selected or not selected. Then a weighted graph is used to represent all the events and their correlations. The multi-label learning is transformed into finding a constrained minimum cut of the weighted graph. In the experiments, we compare the proposed approach with the state-of-the-art multi-label classifier ML-KNN, and the results show that the new approach is efficient in terms of all the popular metrics used to evaluate multi-label classification performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Brinker, K., Hüllermeier, E.: Case-based multilabel ranking. In: M.M. Veloso, M.M. Veloso (eds.) IJCAI, pp. 702–707. (2007).
Cheng, W., Hüllermeier, E.: Combining instance-based learning and logistic regression for multilabel classification. Machine Learn. 76(2–3), 211–225. http://dx.doi.org/10.1007/s10994-009-5127-5 (2009). doi:10.1007/s10994-009-5127-5
Clare, A., King, R.D.: Knowledge discovery in multi-label phenotype data. In: PKDD '01: Proceedings of the 5th European Conference on Principles of Data Mining and Knowledge Discovery, pp. 42–53. Springer-Verlag, London, UK (2001)
De Comité, F., Gilleron, R., Tommasi, M.: Learning multi-label alternating decision trees from texts and data. pp. 251–274. http://dx.doi.org/10.1007/3-540-45065-3_4 (2003). doi:10.1007/3-540-45065-3_4
Dembczynski, K., Waegeman, W., Cheng, W., H!§ ullermeier, E.: On label dependence in multi-label classification. In: MLD 2010: 2nd International Workshop on learning from Multi-Label Data (2010)
Elisseeff, A., Weston, J.: Kernel methods for multi-labelled classification and categorical regression problems. In: Advances in Neural Information Processing Systems 14, pp. 681–687. MIT Press (2001)
Elisseeff, A., Weston, J.: A kernel method for multi-labelled classification. In: Annual ACM Conference on Research and Development in Information Retrieval, pp. 274–281. http://citeseerx.ist.psu.edu/viewdoc/summary? (2005). doi:10.1.1.18.24 23http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.18.24 23
Fujino, A., Isozaki, H.: Multi-label classification using logistic regression models for ntcir-7 patent mining task. In: Proceedings of NTCIR-7 Workshop Meeting (2008)
Ghamrawi, N., McCallum, A.: Collective multi-label classification. In: CIKM '05: Proceedings of the 14th ACM International Conference on Information and Knowledge Management, pp. 195–200. ACM, New York, NY, USA (2005). doi:http://doi.acm.org/10.1145/1099554.1099591
Godbole, S., Sarawagi, S.: Discriminative methods for multi-labeled classification. In: Proceedings of the 8th Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 22–30. Springer (2004)
Gross, J., Yellen, J.: Graph Theory and its Applications. CRC Press, Boca Raton (1998)
McCallum, A.K.: Multi-label text classification with a mixture model trained by EM algorithm. http://citeseer.ist.psu.edu/mccallum99multilabel.html (1999).
Nakos, G., Joyner, D.: Linear algebra with applications, pp. 472–473. Brooks/Cole Publishing Company. Pacific Grove, California, United States. (1998)
Ramage, D., Hall, D., Nallapati, R., Manning, C.D.: Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, pp. 248–256. Association for Computational Linguistics, Singapore. http://www.aclweb.org/anthology/D/D09/D09-1026 (2009).
Read, J., Pfahringer, B., Holmes, G.: Multi-label classification using ensembles of pruned sets. In: ICDM '08: Proceedings of the 2008 Eighth IEEE International Conference on Data Mining, vol. 0, pp. 995–1000. IEEE Computer Society, Washington, DC, USA. http://dx.doi.org/10.1109/ICDM.2008.74 (2008) doi:10.1109/ICDM.2008.74.
Read, J., Pfahringer, B., Holmes, G., Frank, E.: Classifier chains for multi-label classification. In: ECML PKDD '09: Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases, pp. 254–269. Springer-Verlag, Berlin, Heidelberg (2009).
Rousu, J., Saunders, C., Szedmak, S., Shawe-Taylor, J.: Kernel-based learning of hierarchical multilabel classification models. J. Machine Learn. Res. 7, 1601–1626 (2006)
Schapire, R.E., Singer, Y.: Boostexter: a boosting-based system for text categorization. Machine Learn. 39(2/3), 135–168. http://citeseerx.ist.psu.edu/viewdoc/summary? (2000). doi:10.1.1.33.16 66
Spielman, D.: Spectral graph theory and its applications. Foundations of Computer Science, 2007. 48th Annual IEEE Symposium on FOCS '07, pp. 29–38 (2007)
Spyromitros, E., Tsoumakas, G., Vlahavas, I.: An empirical study of lazy multilabel classification algorithms. In: SETN '08: Proceedings of the 5th Hellenic conference on Artificial Intelligence, pp. 401–406. Springer-Verlag, Berlin, Heidelberg (2008) doi:http://dx.doi.org/10.1007/978-3-540-87881-0_40
Streich, A., Buhmann, J.: Classification of multi-labeled data: A generative approach. pp. 390–405. http://dx.doi.org/10.1007/978-3-540-87481-2_26 (2008). doi:10.1007/978-3-540-87481-2_26
Tenenboim, L., Rokach, L., Shapira, B.: Identification of label dependencies for multi-label classification. In: MLD 2010: 2nd International Workshop on learning from Multi-Label Data (2010)
Tsoumakas, G., Katakis, I.: Multi label classification: an overview. Int. J. Data Warehousing Mining 3(3), 1–13. http://mlkd.csd.auth.gr/publication_details.asp?publicationID =219 (2007).
Tsoumakas, G., Katakis, I., Vlahava, I.: Mining multi-label data. In: Maimon, O., Rokach, L. (eds.) Data mining and knowledge discovery handbook, 2nd edn, pp. 667–685. Springer New York (2010)
Tsoumakas, G., Katakis, I., Vlahavas, I.: A review of multi-label classification methods. Proceedings of the 2nd ADBIS Workshop on Data Mining and Knowledge Discovery (ADMKD 2006) (2006)
Tsoumakas, G., Vlahavas, I.: Random k-labelsets: An ensemble method for multilabel classification. In: ECML '07: Proceedings of the 18th European Conference on Machine Learning, pp. 406–417. Springer-Verlag, Berlin, Heidelberg (2007). doi:http://dx.doi.org/10.1007/978-3-540-74958-5-38
Ueda, N., Saito, K.: Parametric mixture models for multi-labeled text. http://citeseer.ist.psu.edu/ueda03parametric.html (2002)http://citeseer.ist.psu.edu/ueda03parametric.html
Vens, C., Struyf, J., Schietgat, L., Džeroski, S., Blockeel, H.: Decision trees for hierarchical multi-label classification. Machine Learn. 2(73), 185–214. http://dx.doi.org/10.1007/s10994-008-5077-3 (2008). doi:10.1007/s10994-008-5077-3
Wang, H., Huang, M., Zhu, X.: A generative probabilistic model for multi-label classification. In: ICDM '08: Proceedings of the 2008 Eighth IEEE International Conference on Data Mining, pp. 628–637. IEEE Computer Society, Washington, DC, USA. http://dx.doi.org/10.1109/ICDM.2008.86 (2008). doi:10.1109/ICDM.2008.86
Wikipedia: Cut (Graph theory) (9 Jan 2011). http://en.wikipedia.org/wiki/Cut_(graph_theory)
Zhang, M.L., Pe na, J.M., Robles, V.: Feature selection for multi-label naive Bayes classification. Inf. Sci. 179(19), 3218–3229 (2009). doi:http://dx.doi.org/10.1016/j.ins.2009.06.010
Zhang, M.L., Zhou, Z.H.: Ml-knn: A lazy learning approach to multi-label learning. Pattern Recognit. 40(7), 2038–2048. http://dx.doi.org/10.1016/j.patcog.2006.12.019 (2007). doi:10.1016/j.patcog.2006.12.019
Zhang, M.L., Zhou, Z.H.: Ml-knn codes. http://lamda.nju.edu.cn/datacode/MLkNN.htm (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Qu, G., Sethi, I., Hartrick, C., Zhang, H. (2015). Multi-label Classification with a Constrained Minimum Cut Model. In: Abou-Nasr, M., Lessmann, S., Stahlbock, R., Weiss, G. (eds) Real World Data Mining Applications. Annals of Information Systems, vol 17. Springer, Cham. https://doi.org/10.1007/978-3-319-07812-0_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-07812-0_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07811-3
Online ISBN: 978-3-319-07812-0
eBook Packages: Business and EconomicsBusiness and Management (R0)