Tractable Semi-supervised Learning of Complex Structured Prediction Models

Chang, Kai-Wei; Sundararajan, S.; Keerthi, S. Sathiya

doi:10.1007/978-3-642-40994-3_12

Kai-Wei Chang²³,
S. Sundararajan²⁴ &
S. Sathiya Keerthi²⁵

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8190))

Included in the following conference series:

Joint European Conference on Machine Learning and Knowledge Discovery in Databases

5970 Accesses
1 Citations

Abstract

Semi-supervised learning has been widely studied in the literature. However, most previous works assume that the output structure is simple enough to allow the direct use of tractable inference/learning algorithms (e.g., binary label or linear chain). Therefore, these methods cannot be applied to problems with complex structure. In this paper, we propose an approximate semi-supervised learning method that uses piecewise training for estimating the model weights and a dual decomposition approach for solving the inference problem of finding the labels of unlabeled data subject to domain specific constraints. This allows us to extend semi-supervised learning to general structured prediction problems. As an example, we apply this approach to the problem of multi-label classification (a fully connected pairwise Markov random field). Experimental results on benchmark data show that, in spite of using approximations, the approach is effective and yields good improvements in generalization performance over the plain supervised method. In addition, we demonstrate that our inference engine can be applied to other semi-supervised learning frameworks, and extends them to solve problems with complex structure.

Download to read the full chapter text

Chapter PDF

Robust and sparse label propagation for graph-based semi-supervised classification

Article 05 July 2021

Zhiwen Hua & Youlong Yang

Hierarchical Multilabel Classification with Optimal Path Prediction

Article 28 April 2016

Zhengya Sun, Yangyang Zhao, … Hongwei Hao

Incremental predictive clustering trees for online semi-supervised multi-target regression

Article Open access 28 October 2020

Aljaž Osojnik, Panče Panov & Sašo Džeroski

References

Brefeld, U., Scheffer, T.: Semi-supervised learning for structured output variables. In: ICML (2006)
Google Scholar
Chang, M.W., Ratinov, L.A., Roth, D.: Structured learning with constrained conditional models. Machine Learning 88(3), 399–431 (2012)
Article MathSciNet MATH Google Scholar
Chang, Y.W., Collins, M.: Exact decoding of phrase-based translation models through lagrangian relaxation. In: EMNLP (2011)
Google Scholar
Chen, G., Song, Y., Wang, F., Zhang, C.: Semi-supervised multi-label learning by solving a sylvester equation. In: SDM, pp. 410–419 (2008)
Google Scholar
Dhillon, P.S., Keerthi, S.S., Bellare, K., Chapelle, O., Sellamanickam, S.: Deterministic annealing for semi-supervised structured output learning. In: AISTATS (2012)
Google Scholar
Finley, T., Joachims, T.: Training structural SVMs when exact inference is intractable. In: ICML, pp. 304–311 (2008)
Google Scholar
Ganchev, K., Graca, J., Gillenwater, J., Taskar, B.: Posterior regularization for structured latent variable models. JMLR 11 (2010)
Google Scholar
Guo, Y., Schuurmans, D.: Semi-supervised multi-label classification - a simultaneous large-margin, subspace learning approach. In: Flach, P.A., De Bie, T., Cristianini, N. (eds.) ECML PKDD 2012, Part II. LNCS, vol. 7524, pp. 355–370. Springer, Heidelberg (2012)
Chapter Google Scholar
Hazan, T., Shashua, A.: Norm-product belief propagation: Primal-dual message-passing for approximate inference. CoRR (2009)
Google Scholar
Hazan, T., Urtasun, R.: Efficient learning of structured predictors in general graphical models. CoRR (2012)
Google Scholar
Huang, S.J., Zhou, Z.H., Zhou, Z.H.: Multi-label learning by exploiting label correlations locally. In: AAAI (2012)
Google Scholar
Joachims, T.: Transductive inference for text classification using support vector machines. In: ICML, pp. 200–209. Morgan Kaufmann (1999)
Google Scholar
Joachims, T., Finley, T., Yu, C.N.J.: Cutting-plane training of structural SVMs. Machine Learning 77(1), 27–59 (2009)
Article MATH Google Scholar
Jojic, V., Gould, S., Koller, D.: Accelerated dual decomposition for MAP inference. In: ICML (2010)
Google Scholar
Komodakis, N.: Efficient training for pairwise or higher order crfs via dual decomposition. In: CVPR (2011)
Google Scholar
Komodakis, N., Paragios, N., Tziritas, G.: MRF energy minimization and beyond via dual decomposition. PAMI 33(3), 531–552 (2011)
Article Google Scholar
Koo, T., Rush, A.M., Collins, M., Jaakkola, T., Sontag, D.: Dual decomposition for parsing with non-projective head automata. In: EMNLP (2010)
Google Scholar
Kulesza, A., Pereira, F.: Structured learning with approximate inference. In: NIPS (2008)
Google Scholar
Lee, C.H., Jiao, F., Wang, S., Schuurmans, D., Greiner, R.: Learning to model spatial dependency: Semi-supervised discriminative random fields. In: NIPS (2006)
Google Scholar
Lindsay, B.G.: Composite likelihood methods. Contemporary Mathematics 80, 221–239 (1988)
Article MathSciNet Google Scholar
Liu, D.C., Nocedal, J.: On the limited memory BFGS method for large scale optimization. Math. Program. 45(3), 503–528 (1989)
Article MathSciNet MATH Google Scholar
Liu, Y., Jin, R., Yang, L.: Semi-supervised multi-label learning by constrained non-negative matrix factorization. In: AAAI, pp. 421–426 (2006)
Google Scholar
Mann, G.S., McCallum, A.: Generalized expectation criteria for semi-supervised learning with weakly labeled data. JMLR 11, 955–984 (2010)
MathSciNet MATH Google Scholar
Martins, A.F.T., Figueiredo, M.A.T., Aguiar, P.M.Q., Smith, N.A., Xing, E.P.: Alternating directions dual decomposition. CoRR (2012)
Google Scholar
Meshi, O., Globerson, A.: An alternating direction method for dual MAP LP relaxation. In: Gunopulos, D., Hofmann, T., Malerba, D., Vazirgiannis, M. (eds.) ECML PKDD 2011, Part II. LNCS, vol. 6912, pp. 470–483. Springer, Heidelberg (2011)
Chapter Google Scholar
Meshi, O., Sontag, D., Jaakkola, T., Globerson, A.: Learning efficiently with approximate inference via dual losses. In: ICML (2010)
Google Scholar
Pearl, J.: Probabilistic reasoning in intelligent systems: networks of plausible inference (1988)
Google Scholar
Pletscher, P., Wulff, S.: LPQP for MAP: Putting LP solvers to better use. In: ICML (2012)
Google Scholar
Samdani, R., Chang, M., Roth, D.: Unified expectation maximization. In: NAACL (2012)
Google Scholar
Samdani, R., Roth, D.: Efficient decomposed learning for structured prediction. In: ICML (2012)
Google Scholar
Seah, C.W., Tsang, I.W., Ong, Y.S.: Transductive ordinal regression. In: TNNLS, pp. 1074–1086 (2012)
Google Scholar
Sutton, C., McCallum, A.: Piecewise training for structured prediction. Machine Learning 77(2-3), 165–194 (2009)
Article Google Scholar
Vedaldi, A.: A MATLAB wrapper of SVM^struct (2011), http://www.vlfeat.org/~vedaldi/code/svm-struct-matlab.html
Xu, L., Wilkinson, D., Schuurmans, D.: Discriminative unsupervised learning of structured predictors. In: ICML (2006)
Google Scholar
Yang, Y.: An evaluation of statistical approaches to text categorization. Information Retrieval 1, 69–90 (1999)
Article Google Scholar
Yu, C.N.: Transductive learning of structural SVMs via prior knowledge constraints. In: AISTATS (2012)
Google Scholar
Yuille, A.L., Rangarajan, A.: The concave-convex procedure. Neural Computation (2003)
Google Scholar
Zha, Z.J., Mei, T., Wang, J., Wang, Z., Hua, X.S.: Graph-based semi-supervised learning with multiple labels. J. Visual Communication and Image Representation 20(2), 97–103 (2009)
Article Google Scholar
Zhang, Y., Schneider, J.: A composite likelihood view for multi-label classification. In: AISTATS (2012)
Google Scholar
Zien, A., Brefeld, U., Scheffer, T.: Transductive support vector machines for structured variables. In: ICML (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Computer Science, University of Illinois at Urbana-Champaign, IL, USA
Kai-Wei Chang
Microsoft Research India, Bangalore, India
S. Sundararajan
Cloud and Information Services Lab, Microsoft, Mountain View, CA, USA
S. Sathiya Keerthi

Authors

Kai-Wei Chang
View author publications
You can also search for this author in PubMed Google Scholar
S. Sundararajan
View author publications
You can also search for this author in PubMed Google Scholar
S. Sathiya Keerthi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, Katholieke Universiteit Leuven, Celestijnenlaan 200A, 3001, Leuven, Belgium
Hendrik Blockeel
Fraunhofer IAIS, Department of Knowledge Discovery, Schloss Birlinghoven, University of Bonn, 53754, Sankt Augustin, Germany
Kristian Kersting
LIACS, Universiteit Leiden, Niels Bohrweg 1, 2333 CA, Leiden, The Netherlands
Siegfried Nijssen
Department of Computer Science and Engineering, Czech Technical University, Technicka 2, 16627, Prague 6, Czech Republic
Filip Železný

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chang, KW., Sundararajan, S., Keerthi, S.S. (2013). Tractable Semi-supervised Learning of Complex Structured Prediction Models. In: Blockeel, H., Kersting, K., Nijssen, S., Železný, F. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2013. Lecture Notes in Computer Science(), vol 8190. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40994-3_12

Download citation

DOI: https://doi.org/10.1007/978-3-642-40994-3_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40993-6
Online ISBN: 978-3-642-40994-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Tractable Semi-supervised Learning of Complex Structured Prediction Models

Abstract

Chapter PDF

Similar content being viewed by others

Robust and sparse label propagation for graph-based semi-supervised classification

Hierarchical Multilabel Classification with Optimal Path Prediction

Incremental predictive clustering trees for online semi-supervised multi-target regression

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Tractable Semi-supervised Learning of Complex Structured Prediction Models

Abstract

Chapter PDF

Similar content being viewed by others

Robust and sparse label propagation for graph-based semi-supervised classification

Hierarchical Multilabel Classification with Optimal Path Prediction

Incremental predictive clustering trees for online semi-supervised multi-target regression

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation