Learning from Crowds in Multi-dimensional Classification Domains

Hernández-González, Jerónimo; Inza, Iñaki; Lozano, José A.

doi:10.1007/978-3-642-40643-0_36

Learning from Crowds in Multi-dimensional Classification Domains

Jerónimo Hernández-González²⁶,
Iñaki Inza²⁶ &
José A. Lozano²⁶

Conference paper

1635 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8109))

Abstract

Learning from crowds is a recently fashioned supervised classification framework where the true/real labels of the training instances are not available. However, each instance is provided with a set of noisy class labels, each indicating the class-membership of the instance according to the subjective opinion of an annotator. The additional challenges involved in the extension of this framework to the multi-label domain are explored in this paper. A solution to this problem combining a Structural EM strategy and the multi-dimensional Bayesian network models as classifiers is presented.

Using real multi-label datasets adapted to the crowd framework, the designed experiments try to shed some lights on the limits of learning to classify from the multiple and imprecise information of supervision.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bielza, C., Li, G., Larrañaga, P.: Multi-dimensional classification with Bayesian networks. International Journal of Approximate Reasoning 52(6), 705–727 (2011)
Article MathSciNet MATH Google Scholar
Brodley, C.E., Friedl, M.A.: Identifying mislabeled training data. Journal of Artificial Intelligence Research 11, 131–167 (1999)
MATH Google Scholar
Cour, T., Sapp, B., Taskar, B.: Learning from partial labels. Journal of Machine Learning Research 12, 1501–1536 (2011)
MathSciNet Google Scholar
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society. Series B (Methodological) 39(1), 1–38 (1977)
MathSciNet MATH Google Scholar
Friedman, N.: Learning belief networks in the presence of missing values and hidden variables. In: Proceedings of the 14th ICML, pp. 125–133 (1997)
Google Scholar
López-Cruz, P.L., Larrañaga, P., DeFelipe, J., Bielza, C.: Bayesian network modeling of the consensus between experts: An application to neuron classification. International Journal of Approximate Reasoning (in press, 2013)
Google Scholar
McLachlan, G.J., Krishnan, T.: The EM Algorithm and Extensions (Wiley Series in Probability and Statistics). Wiley Interscience (1997)
Google Scholar
Nguyen, Q., Valizadegan, H., Hauskrecht, M.: Learning classification with auxiliary probabilistic information. In: Proceedings of the 11th IEEE International Conference on Data Mining (ICDM 2011), pp. 477–486 (2011)
Google Scholar
Raykar, V.C., Yu, S., Zhao, L.H., Valadez, G.H., Florin, C., Bogoni, L., Moy, L.: Learning from crowds. Journal of Machine Learning Research 11, 1297–1322 (2010)
MathSciNet Google Scholar
Rodríguez, J.D., Martínez, A.P., Arteta, D., Tejedor, D., Lozano, J.A.: Using multidimensional bayesian network classifiers to assist the treatment of multiple sclerosis. IEEE Transactions on Systems, Man, and Cybernetics 42(6), 1705–1715 (2012)
Google Scholar
Sellamanickam, S., Tiwari, C., Selvaraj, S.K.: Regularized structured output learning with partial labels. In: Proceedings of the 12th SDM, pp. 1059–1070 (2012)
Google Scholar
Sheng, V.S., Provost, F.J., Ipeirotis, P.G.: Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the International Conference on Knowledge Discovery and Data Mining, pp. 614–622 (2008)
Google Scholar
Smyth, P., Fayyad, U., Burl, M., Perona, P., Baldi, P.: Inferring ground truth from subjective labelling of venus images. In: Proceedings of the Advances in Neural Information Processing Systems (NIPS), pp. 1085–1092 (1994)
Google Scholar
Snow, R., O’Connor, B., Jurafsky, D., Ng, A.Y.: Cheap and fast - but is it good? evaluating non-expert annotations for natural language tasks. In: Proceedings of the Conference on Empirical Methods in NLP, pp. 254–263 (2008)
Google Scholar
Sun, Y.Y., Zhang, Y., Zhou, Z.H.: Multi-label learning with weak label. In: Proceedings of the 24th AAAI Conference on Artificial Intelligence, AAAI 2010 (2010)
Google Scholar
Younes, Z., abdallah, F., Denœux, T.: Evidential multi-label classification approach to learning from data with imprecise labels. In: Hüllermeier, E., Kruse, R., Hoffmann, F. (eds.) IPMU 2010. LNCS, vol. 6178, pp. 119–128. Springer, Heidelberg (2010)
Chapter Google Scholar
Zhang, M.L., Zhou, Z.H.: A review on multi-label learning algorithms. IEEE Transactions on Knowledge and Data Engineering ( in press, 2013)
Google Scholar
Zhu, X., Wu, X., Chen, Q.: Eliminating class noise in large datasets. In: Proceedings of the 20th ICML, pp. 920–927 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Intelligent Systems Group, University of the Basque Country UPV/EHU, Spain
Jerónimo Hernández-González, Iñaki Inza & José A. Lozano

Authors

Jerónimo Hernández-González
View author publications
You can also search for this author in PubMed Google Scholar
Iñaki Inza
View author publications
You can also search for this author in PubMed Google Scholar
José A. Lozano
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Universidad Politécnica de Madrid, 28660, Madrid, Spain
Concha Bielza
Universidad de Almería, 04120, Almería, Spain
Antonio Salmerón
Universdad de A Coruña, 15071, A Coruña, Spain
Amparo Alonso-Betanzos
Universidad Complutense de Madrid, 28040, Madrid, Spain
J. Ignacio Hidalgo
Universidad de Jaén, 23071, Jaén, Spain
Luis Martínez
Universidad Pablo de Olavide, 41013, Sevilla, Spain
Alicia Troncoso
Universidad de Salamanca, 37008, Salamanca, Spain
Emilio Corchado & Juan M. Corchado &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hernández-González, J., Inza, I., Lozano, J.A. (2013). Learning from Crowds in Multi-dimensional Classification Domains. In: Bielza, C., et al. Advances in Artificial Intelligence. CAEPIA 2013. Lecture Notes in Computer Science(), vol 8109. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40643-0_36

Download citation

DOI: https://doi.org/10.1007/978-3-642-40643-0_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40642-3
Online ISBN: 978-3-642-40643-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics