Classification of Multi-labeled Data: A Generative Approach

Streich, Andreas P.; Buhmann, Joachim M.

doi:10.1007/978-3-540-87481-2_26

Andreas P. Streich¹ &
Joachim M. Buhmann¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5212))

Included in the following conference series:

Joint European Conference on Machine Learning and Knowledge Discovery in Databases

5651 Accesses
8 Citations

Abstract

Multi-label classification assigns a data item to one or several classes. This problem of multiple labels arises in fields like acoustic and visual scene analysis, news reports and medical diagnosis. In a generative framework, data with multiple labels can be interpreted as additive mixtures of emissions of the individual sources. We propose a deconvolution approach to estimate the individual contributions of each source to a given data item. Similarly, the distributions of multi-label data are computed based on the source distributions. In experiments with synthetic data, the novel approach is compared to existing models and yields more accurate parameter estimates, higher classification accuracy and ameliorated generalization to previously unseen label sets. These improvements are most pronounced on small training data sets. Also on real world acoustic data, the algorithm outperforms other generative models, in particular on small training data sets.

Download to read the full chapter text

Chapter PDF

Asymptotic analysis of estimators on multi-label data

Article Open access 09 July 2014

Large scale multi-label learning using Gaussian processes

Article Open access 14 April 2021

Revisiting Machine Learning from Crowds a Mixture Model for Grouping Annotations

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Arons, B.: A review of the cocktail party effect. Journal of the American Voice I/O Society 12, 35–50 (1992)
Google Scholar
Boutell, M., Luo, J., Shen, X., Brown, C.: Learning multi-label scene classification. Pattern Recognition, 1757–1771 (2004)
Google Scholar
Zhu, S., Ji, X., Xu, W., Gong, Y.: Multi-labelled classification using maximum entropy method. In: Proceedings of SIGIR 2005 (2005)
Google Scholar
Dietterich, T.G., Bakiri, G.: Solving multiclass learning problems via error-correcting output codes. J. of Articificial Intelligence Research 2, 263–286 (1995)
MATH Google Scholar
Clare, A., King, R.D.: Knowledge discovery in multi-label phenotype data. In: Siebes, A., De Raedt, L. (eds.) PKDD 2001. LNCS (LNAI), vol. 2168, pp. 42–53. Springer, Heidelberg (2001)
Chapter Google Scholar
Elisseeff, A., Weston, J.: Kernel methods for multi-labelled classification and categorical regression problems. In: Proceedings of NIPS 2002 (2002)
Google Scholar
Joachims, T.: Text categorization with support vector machines: learning with many relevant features. In: Nédellec, C., Rouveirol, C. (eds.) ECML 1998. LNCS, vol. 1398. Springer, Heidelberg (1998)
Chapter Google Scholar
McCallum, A.K.: Multi-label text classification with a mixture model trained by EM. In: Proceedings of NIPS 1999 (1999)
Google Scholar
Tsoumakas, G., Katakis, I.: Multi label classification: An Overview. Int. J. of Data Warehousing and Mining 3(3), 1–13 (2007)
Google Scholar
Caruana, R.: Multitask learning. Machine Learning 28(1), 41–75 (1997)
Article Google Scholar
Pols, L.: Spectral analysis and identification of Dutch vowels in monosyllabic words. PhD thesis, Free University of Amsterdam (1966)
Google Scholar
Rabiner, L.R.: A tutorial on hidden markov models and selected applications in speech recognition. In: Readings in speech recognition, pp. 267–296 (1990)
Google Scholar
Hastie, T., Tibshirani, R.: Discriminant analysis by Gaussian Mixtures. J. of the Royal Statist. Soc. B 58, 155–176 (1996)
MATH MathSciNet Google Scholar
Dempster, A., Laird, N., Rubin, D.: Maximum likelihood from incomplete data via the EM algorithm. J. of the Royal Statist. Soc. B 39(1), 138 (1977)
MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Computational Science, ETH Zurich, 8092, Zurich, Switzerland
Andreas P. Streich & Joachim M. Buhmann

Authors

Andreas P. Streich
View author publications
You can also search for this author in PubMed Google Scholar
Joachim M. Buhmann
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Walter Daelemans Bart Goethals Katharina Morik

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Streich, A.P., Buhmann, J.M. (2008). Classification of Multi-labeled Data: A Generative Approach. In: Daelemans, W., Goethals, B., Morik, K. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2008. Lecture Notes in Computer Science(), vol 5212. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87481-2_26

Download citation

DOI: https://doi.org/10.1007/978-3-540-87481-2_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87480-5
Online ISBN: 978-3-540-87481-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Classification of Multi-labeled Data: A Generative Approach

Abstract

Chapter PDF

Similar content being viewed by others

Asymptotic analysis of estimators on multi-label data

Large scale multi-label learning using Gaussian processes

Revisiting Machine Learning from Crowds a Mixture Model for Grouping Annotations

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Classification of Multi-labeled Data: A Generative Approach

Abstract

Chapter PDF

Similar content being viewed by others

Asymptotic analysis of estimators on multi-label data

Large scale multi-label learning using Gaussian processes

Revisiting Machine Learning from Crowds a Mixture Model for Grouping Annotations

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation