A Novel Weakly Supervised Problem: Learning from Positive-Unlabeled Proportions

Hernández-González, Jerónimo; Inza, Iñaki; Lozano, Jose A.

doi:10.1007/978-3-319-24598-0_1

Jerónimo Hernández-González²⁰,
Iñaki Inza²⁰ &
Jose A. Lozano²⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9422))

Included in the following conference series:

Conference of the Spanish Association for Artificial Intelligence

1059 Accesses
2 Citations

Abstract

Standard supervised classification learns a classifier from a set of labeled examples. Alternatively, in the field of weakly supervised classification different frameworks have been presented where the training data cannot be certainly labeled. In this paper, the novel problem of learning from positive-unlabeled proportions is presented. The provided examples are unlabeled and the only class information available consists of the proportions of positive and unlabeled examples in different subsets of the training dataset. An expectation-maximization method that learns Bayesian network classifiers from this kind of data is proposed. A set of experiments has been designed with the objective of shedding light on the capability of learning from this kind of data throughout different scenarios of increasing complexity.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Achache, H., Revel, A.: Endometrial receptivity markers, the journey to successful embryo implantation. Hum. Reprod. Update 12(6), 731–746 (2006)
Article Google Scholar
Brooks, S.P.: Markov chain monte carlo method and its application. J. R. Stat. Soc. Ser. D-Statist. 47(1), 69–100 (1998)
Article Google Scholar
Calvo, B., Larrañaga, P., Lozano, J.A.: Learning Bayesian classifiers from positive and unlabeled examples. Pattern Recogn. Lett. 28(16), 2375–2384 (2007)
Article Google Scholar
Chapelle, O., Schölkopf, B., Zien, A.: Semi-Supervised Learning. The MIT Press, Cambridge (2006)
Book Google Scholar
Cooper, G.F., Herskovits, E.: A Bayesian method for the induction of probabilistic networks from data. Mach. Learn. 9(4), 309–347 (1992)
MATH Google Scholar
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. Ser. B-Stat. Methodol. 39(1), 1–38 (1977)
MATH MathSciNet Google Scholar
Ebner, T., Moser, M., Sommergruber, M., Tews, G.: Selection based on morphological assessment of oocytes and embryos at different stages of preimplantation development: a review. Hum. Reprod. Update 9(3), 251–262 (2003)
Article Google Scholar
Friedman, N.: Learning belief networks in the presence of missing values and hidden variables. In: Proceedings of the 14th ICML, pp. 125–133 (1997)
Google Scholar
Friedman, N., Geiger, D., Goldszmidt, M.: Bayesian network classifiers. Mach. Learn. 29(2–3), 131–163 (1997)
Article MATH Google Scholar
Gilks, W.R., Richardson, S., Spiegelhalter, D.J.: Markov Chain Monte Carlo in Practice. Chapman & Hall, London (1996)
MATH Google Scholar
Hand, D.J., Yu, K.: Idiot’s Bayes–not so stupid after all? Int. Stat. Rev. 69(3), 385–398 (2001)
MATH Google Scholar
Heckerman, D.: A tutorial on learning with bayesian networks. Technical report MSR-TR-95-06, Learning in Graphical Models (1995)
Google Scholar
Hernández-González, J., Inza, I., Crisol-Ortíz, L., Guembe, M.A., Iñarra, M.J., Lozano, J.A.: Novel weakly supervised classification techniques for human assisted reproduction: a case study. Stat. Med. (2015, Submitted)
Google Scholar
McLachlan, G.J., Krishnan, T.: The EM Algorithm and Extensions. (Wiley Series in Probability and Statistics). Wiley-Interscience, New York (1997)
Google Scholar
Mitchell, T.: Machine Learning. McGraw Hill, New York (1997)
MATH Google Scholar
Musicant, D.R., Christensen, J.M., Olson, J.F.: Supervised learning by training on aggregate outputs. In: Proceedings of the 7th IEEE International Conference on Data Mining (ICDM 2007), pp. 252–261 (2007)
Google Scholar
Patrizi, G., Manna, C., Moscatelli, C., Nieddu, L.: Pattern recognition methods in human-assisted reproduction. Int. Trans. Oper. Res. 11(4), 365–379 (2004)
Article MATH Google Scholar
Quadrianto, N., Smola, A.J., Caetano, T.S., Le, Q.V.: Estimating labels from label proportions. J. Mach. Learn. Res. 10, 2349–2374 (2009)
MATH MathSciNet Google Scholar
Sahami, M.: Learning limited dependence Bayesian classifiers. In: Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining (KDD 1996), pp. 335–338 (1996)
Google Scholar
Weidmann, N., Frank, E., Pfahringer, B.: A two-level learning method for generalized multi-instance problems. In: Lavrač, N., Gamberger, D., Todorovski, L., Blockeel, H. (eds.) ECML 2003. LNCS (LNAI), vol. 2837, pp. 468–479. Springer, Heidelberg (2003)
Chapter Google Scholar
Zhu, X., Goldberg, A.B.: Introduction to Semi-Supervised Learning. Synthesis Lectures on Artificial Intelligence and Machine Learning. Morgan & Claypool Publishers, San Rafael (2009)
MATH Google Scholar

Download references

Acknowledgments

This work has been partially supported by the Basque Government (IT609-13) and the Spanish Ministry of Economy and Competitiveness MINECO (TIN2013-41272-P). Jerónimo Hernández-González holds a grant (FPU) from the Spanish Ministry of Education, Culture and Sports.

Author information

Authors and Affiliations

Intelligent Systems Group, University of the Basque Country UPV/EHU, Donostia - San Sebastián, Spain
Jerónimo Hernández-González, Iñaki Inza & Jose A. Lozano

Authors

Jerónimo Hernández-González
View author publications
You can also search for this author in PubMed Google Scholar
Iñaki Inza
View author publications
You can also search for this author in PubMed Google Scholar
Jose A. Lozano
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jerónimo Hernández-González .

Editor information

Editors and Affiliations

University of Castilla-La Mancha, Albacete, Spain
José M. Puerta
University of Castilla-La Mancha, Albacete, Spain
José A. Gámez
University of Cadiz, Cadiz, Spain
Bernabe Dorronsoro
Public University of Navarre, Pamplona, Spain
Edurne Barrenechea
Pablo de Olavide University, Sevilla, Spain
Alicia Troncoso
Department of Civil Engineering, University of Burgos, Burgos, Spain
Bruno Baruque
Public University of Navarre, Pamplona, Spain
Mikel Galar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hernández-González, J., Inza, I., Lozano, J.A. (2015). A Novel Weakly Supervised Problem: Learning from Positive-Unlabeled Proportions. In: Puerta, J., et al. Advances in Artificial Intelligence. CAEPIA 2015. Lecture Notes in Computer Science(), vol 9422. Springer, Cham. https://doi.org/10.1007/978-3-319-24598-0_1

Download citation

DOI: https://doi.org/10.1007/978-3-319-24598-0_1
Published: 14 November 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24597-3
Online ISBN: 978-3-319-24598-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics