Skip to main content

Multi-label Classification with a Constrained Minimum Cut Model

  • Chapter
  • First Online:
Real World Data Mining Applications

Part of the book series: Annals of Information Systems ((AOIS,volume 17))

  • 2856 Accesses

Abstract

Multi-label classification has received more attention recently in the fields of data mining and machine learning. Though many approaches have been proposed, the critical issue of how to combine single labels to form a multi-label remains challenging. In this work, we propose a novel multi-label classification approach that each label is represented by two exclusive events: the label is selected or not selected. Then a weighted graph is used to represent all the events and their correlations. The multi-label learning is transformed into finding a constrained minimum cut of the weighted graph. In the experiments, we compare the proposed approach with the state-of-the-art multi-label classifier ML-KNN, and the results show that the new approach is efficient in terms of all the popular metrics used to evaluate multi-label classification performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Brinker, K., Hüllermeier, E.: Case-based multilabel ranking. In: M.M. Veloso, M.M. Veloso (eds.) IJCAI, pp. 702–707. (2007).

    Google Scholar 

  2. Cheng, W., Hüllermeier, E.: Combining instance-based learning and logistic regression for multilabel classification. Machine Learn. 76(2–3), 211–225. http://dx.doi.org/10.1007/s10994-009-5127-5 (2009). doi:10.1007/s10994-009-5127-5

  3. Clare, A., King, R.D.: Knowledge discovery in multi-label phenotype data. In: PKDD '01: Proceedings of the 5th European Conference on Principles of Data Mining and Knowledge Discovery, pp. 42–53. Springer-Verlag, London, UK (2001)

    Google Scholar 

  4. De Comité, F., Gilleron, R., Tommasi, M.: Learning multi-label alternating decision trees from texts and data. pp. 251–274. http://dx.doi.org/10.1007/3-540-45065-3_4 (2003). doi:10.1007/3-540-45065-3_4

  5. Dembczynski, K., Waegeman, W., Cheng, W., H!§ ullermeier, E.: On label dependence in multi-label classification. In: MLD 2010: 2nd International Workshop on learning from Multi-Label Data (2010)

    Google Scholar 

  6. Elisseeff, A., Weston, J.: Kernel methods for multi-labelled classification and categorical regression problems. In: Advances in Neural Information Processing Systems 14, pp. 681–687. MIT Press (2001)

    Google Scholar 

  7. Elisseeff, A., Weston, J.: A kernel method for multi-labelled classification. In: Annual ACM Conference on Research and Development in Information Retrieval, pp. 274–281. http://citeseerx.ist.psu.edu/viewdoc/summary? (2005). doi:10.1.1.18.24 23http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.18.24 23

  8. Fujino, A., Isozaki, H.: Multi-label classification using logistic regression models for ntcir-7 patent mining task. In: Proceedings of NTCIR-7 Workshop Meeting (2008)

    Google Scholar 

  9. Ghamrawi, N., McCallum, A.: Collective multi-label classification. In: CIKM '05: Proceedings of the 14th ACM International Conference on Information and Knowledge Management, pp. 195–200. ACM, New York, NY, USA (2005). doi:http://doi.acm.org/10.1145/1099554.1099591

  10. Godbole, S., Sarawagi, S.: Discriminative methods for multi-labeled classification. In: Proceedings of the 8th Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 22–30. Springer (2004)

    Google Scholar 

  11. Gross, J., Yellen, J.: Graph Theory and its Applications. CRC Press, Boca Raton (1998)

    Google Scholar 

  12. McCallum, A.K.: Multi-label text classification with a mixture model trained by EM algorithm. http://citeseer.ist.psu.edu/mccallum99multilabel.html (1999).

  13. Nakos, G., Joyner, D.: Linear algebra with applications, pp. 472–473. Brooks/Cole Publishing Company. Pacific Grove, California, United States. (1998)

    Google Scholar 

  14. Ramage, D., Hall, D., Nallapati, R., Manning, C.D.: Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, pp. 248–256. Association for Computational Linguistics, Singapore. http://www.aclweb.org/anthology/D/D09/D09-1026 (2009).

  15. Read, J., Pfahringer, B., Holmes, G.: Multi-label classification using ensembles of pruned sets. In: ICDM '08: Proceedings of the 2008 Eighth IEEE International Conference on Data Mining, vol. 0, pp. 995–1000. IEEE Computer Society, Washington, DC, USA. http://dx.doi.org/10.1109/ICDM.2008.74 (2008) doi:10.1109/ICDM.2008.74.

  16. Read, J., Pfahringer, B., Holmes, G., Frank, E.: Classifier chains for multi-label classification. In: ECML PKDD '09: Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases, pp. 254–269. Springer-Verlag, Berlin, Heidelberg (2009).

    Google Scholar 

  17. Rousu, J., Saunders, C., Szedmak, S., Shawe-Taylor, J.: Kernel-based learning of hierarchical multilabel classification models. J. Machine Learn. Res. 7, 1601–1626 (2006)

    Google Scholar 

  18. Schapire, R.E., Singer, Y.: Boostexter: a boosting-based system for text categorization. Machine Learn. 39(2/3), 135–168. http://citeseerx.ist.psu.edu/viewdoc/summary? (2000). doi:10.1.1.33.16 66

  19. Spielman, D.: Spectral graph theory and its applications. Foundations of Computer Science, 2007. 48th Annual IEEE Symposium on FOCS '07, pp. 29–38 (2007)

    Google Scholar 

  20. Spyromitros, E., Tsoumakas, G., Vlahavas, I.: An empirical study of lazy multilabel classification algorithms. In: SETN '08: Proceedings of the 5th Hellenic conference on Artificial Intelligence, pp. 401–406. Springer-Verlag, Berlin, Heidelberg (2008) doi:http://dx.doi.org/10.1007/978-3-540-87881-0_40

  21. Streich, A., Buhmann, J.: Classification of multi-labeled data: A generative approach. pp. 390–405. http://dx.doi.org/10.1007/978-3-540-87481-2_26 (2008). doi:10.1007/978-3-540-87481-2_26

  22. Tenenboim, L., Rokach, L., Shapira, B.: Identification of label dependencies for multi-label classification. In: MLD 2010: 2nd International Workshop on learning from Multi-Label Data (2010)

    Google Scholar 

  23. Tsoumakas, G., Katakis, I.: Multi label classification: an overview. Int. J. Data Warehousing Mining 3(3), 1–13. http://mlkd.csd.auth.gr/publication_details.asp?publicationID =219 (2007).

  24. Tsoumakas, G., Katakis, I., Vlahava, I.: Mining multi-label data. In: Maimon, O., Rokach, L. (eds.) Data mining and knowledge discovery handbook, 2nd edn, pp. 667–685. Springer New York (2010)

    Google Scholar 

  25. Tsoumakas, G., Katakis, I., Vlahavas, I.: A review of multi-label classification methods. Proceedings of the 2nd ADBIS Workshop on Data Mining and Knowledge Discovery (ADMKD 2006) (2006)

    Google Scholar 

  26. Tsoumakas, G., Vlahavas, I.: Random k-labelsets: An ensemble method for multilabel classification. In: ECML '07: Proceedings of the 18th European Conference on Machine Learning, pp. 406–417. Springer-Verlag, Berlin, Heidelberg (2007). doi:http://dx.doi.org/10.1007/978-3-540-74958-5-38

  27. Ueda, N., Saito, K.: Parametric mixture models for multi-labeled text. http://citeseer.ist.psu.edu/ueda03parametric.html (2002)http://citeseer.ist.psu.edu/ueda03parametric.html

  28. Vens, C., Struyf, J., Schietgat, L., Džeroski, S., Blockeel, H.: Decision trees for hierarchical multi-label classification. Machine Learn. 2(73), 185–214. http://dx.doi.org/10.1007/s10994-008-5077-3 (2008). doi:10.1007/s10994-008-5077-3

  29. Wang, H., Huang, M., Zhu, X.: A generative probabilistic model for multi-label classification. In: ICDM '08: Proceedings of the 2008 Eighth IEEE International Conference on Data Mining, pp. 628–637. IEEE Computer Society, Washington, DC, USA. http://dx.doi.org/10.1109/ICDM.2008.86 (2008). doi:10.1109/ICDM.2008.86

  30. Wikipedia: Cut (Graph theory) (9 Jan 2011). http://en.wikipedia.org/wiki/Cut_(graph_theory)

  31. Zhang, M.L., Pe na, J.M., Robles, V.: Feature selection for multi-label naive Bayes classification. Inf. Sci. 179(19), 3218–3229 (2009). doi:http://dx.doi.org/10.1016/j.ins.2009.06.010

  32. Zhang, M.L., Zhou, Z.H.: Ml-knn: A lazy learning approach to multi-label learning. Pattern Recognit. 40(7), 2038–2048. http://dx.doi.org/10.1016/j.patcog.2006.12.019 (2007). doi:10.1016/j.patcog.2006.12.019

  33. Zhang, M.L., Zhou, Z.H.: Ml-knn codes. http://lamda.nju.edu.cn/datacode/MLkNN.htm (2009)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Guangzhi Qu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Qu, G., Sethi, I., Hartrick, C., Zhang, H. (2015). Multi-label Classification with a Constrained Minimum Cut Model. In: Abou-Nasr, M., Lessmann, S., Stahlbock, R., Weiss, G. (eds) Real World Data Mining Applications. Annals of Information Systems, vol 17. Springer, Cham. https://doi.org/10.1007/978-3-319-07812-0_5

Download citation

Publish with us

Policies and ethics