Skip to main content

Multi-label Learning by Hyperparameters Calibration for Treating Class Imbalance

  • Conference paper
  • First Online:
Hybrid Artificial Intelligent Systems (HAIS 2018)

Abstract

Multi-label learning has been becoming an increasingly active area into the machine learning community due to a wide variety of real world problems. However, only over the past few years class balancing for these kind of problems became a topic of interest. In this paper, we present a novel method named hyperparameter calibration to treat class imbalance in a multi-label problem, to this aim we develop an extensive analysis over four real-world databases and two own synthetic databases exhibiting different ratios of imbalance. The empirical analysis shows that the proposed method is able to improve the classification performance when it is combined with three of the most widely used strategies for treating multi-label classification problems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Train and test sets for emotions, scene, yeast and cal500 databases were obtained from http://simidat.ujaen.es/~research/MLSMOTE/index.html#datasets.

  2. 2.

    https://afgiraldofo@bitbucket.org/afgiraldofo/e1071.git.

References

  1. Tsoumakas, G., Katakis, I.: Multi-label classification: an overview. In. J. Data Warehous. Min. 3(3), 1–3 (2006)

    Google Scholar 

  2. Fürnkranz, J., Hüllermeier, E., Mencía, E.L., Brinker, K.: Multilabel classification via calibrated label ranking. Mach. Learn. 73(2), 133–153 (2008)

    Article  Google Scholar 

  3. Read, J., Pfahringer, B., Holmes, G., Frank, E.: Classifier chains for multi-label classification. Mach. Learn. 85(3), 333 (2011)

    Article  MathSciNet  Google Scholar 

  4. Giraldo-Forero, A., Jaramillo-Garzón, J., Castellanos-Dominguez, C.: A comparison of multi-label techniques based on problem transformation for protein functional prediction. In: Proceedings of the 35th Annual International Conference of the EMBS, pp. 2688–2691. IEEE (2013)

    Google Scholar 

  5. Koziarski, M., Wożniak, M.: CCR: A combined cleaning and resampling algorithm for imbalanced data classification. Int. J. Appl. Math. Comput. Sci. 27(4), 727–736 (2017)

    Article  MathSciNet  Google Scholar 

  6. Batuwita, R., Palade, V.: Class Imbalance Learning Methods for Support Vector Machines (2013)

    Google Scholar 

  7. Charte, F., Rivera, A.J., del Jesus, M.J., Herrera, F.: MLSMOTE: Approaching imbalanced multilabel learning through synthetic instance generation. Knowl-Based Syst. 89, 385–397 (2015)

    Article  Google Scholar 

  8. Tahir, M.A., Kittler, J., Yan, F.: Inverse random under sampling for class imbalance problem and its application to multi-label classification. Pattern Recognit. 45(10), 3738–3750 (2012)

    Article  Google Scholar 

  9. Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. (TIST) 2(3), (2011). Article No. 27

    Article  Google Scholar 

  10. Tomás, J.T., Spolaôr, N., Cherman, E.A., Monard, M.C.: A framework to generate synthetic multi-label datasets. Electron. Notes Theor. Comput. Sci. 302, 155–176 (2014)

    Article  Google Scholar 

  11. Charte, F., Rivera, A.J., del Jesus, M.J., Herrera, F.: Addressing imbalance in multilabel classification: measures and random resampling algorithms. Neurocomputing 163, 3–16 (2015)

    Article  Google Scholar 

  12. Wieczorkowska, A., Synak, P., Raś, Z.W.: Multi-label classification of emotions in music. In: Intelligent Information Processing and Web Mining, pp. 307–315. Springer, Heidelberg (2006)

    Google Scholar 

  13. Boutell, M.R., Luo, J., Shen, X., Brown, C.M.: Learning multi-label scene classification. Pattern Recognit. 37(9), 1757–1771 (2004)

    Article  Google Scholar 

  14. Elisseeff, A., Weston, J.: A kernel method for multi-labelled classification. In: Advances in Neural Information Processing Systems, pp. 681–687 (2002)

    Google Scholar 

  15. Turnbull, D., Barrington, L., Torres, D., Lanckriet, G.: Semantic annotation and retrieval of music and sound effects. IEEE Trans. Audio, Speech, Lang. Process. 16(2), 467–476 (2008)

    Article  Google Scholar 

  16. Meyer, D., Wien, F.T.: Support vector machines. R News 1(3), 23–26 (2001)

    MathSciNet  Google Scholar 

  17. Hsu, C.W., Chang, C.C., Lin, C.J., et al.: A Practical Guide to Support Vector Classification (2010)

    Google Scholar 

  18. Giraldo-Forero, A.F., Jaramillo-Garzón, J.A., Castellanos-Domínguez, C.G.: Evaluation of example-based measures for multi-label classification performance. In: Ortuño, F., Rojas, I. (eds.) IWBBIO 2015. LNCS, vol. 9043, pp. 557–564. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-16483-0_54

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Andrés Felipe Giraldo-Forero .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Giraldo-Forero, A.F., Cardona-Escobar, A.F., Castro-Ospina, A.E. (2018). Multi-label Learning by Hyperparameters Calibration for Treating Class Imbalance. In: de Cos Juez, F., et al. Hybrid Artificial Intelligent Systems. HAIS 2018. Lecture Notes in Computer Science(), vol 10870. Springer, Cham. https://doi.org/10.1007/978-3-319-92639-1_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-92639-1_27

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-92638-4

  • Online ISBN: 978-3-319-92639-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics