Constraint Classification: A New Approach to Multiclass Classification

Har-Peled, Sariel; Roth, Dan; Zimak, Dav

doi:10.1007/3-540-36169-3_29

Constraint Classification: A New Approach to Multiclass Classification

Sariel Har-Peled⁴,
Dan Roth⁴ &
Dav Zimak⁴

Conference paper
First Online: 01 January 2002

683 Accesses
78 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2533))

Abstract

In this paper, we present a newviewof multiclass classification and introduce the constraint classification problem, a generalization that captures many flavors of multiclass classification. We provide the first optimal, distribution independent bounds for many multiclass learning algorithms, including winner-take-all (WTA). Based on our view, we present a learning algorithm that learns via a single linear classifier in high dimension. In addition to the distribution independent bounds, we provide a simple margin-based analysis improving generalization bounds for linear multiclass support vector machines.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

M. Anthony and P. Bartlett. Neural Network Learning: Theoretical Foundations. Cambridge University Press, Cambridge, England, 1999.
MATH Google Scholar
Chidanand Apte, Fred Damerau, and Sholom M. Weiss. Automated learning of decision rules for text categorization. Information Systems, 12(3):233–251, 1994.
Google Scholar
E. Allwein, R.E. Schapire, and Y. Singer. Reducing multiclass to binary: A unifying approach for margin classifiers. In Proc. 17th International Conf. on Machine Learning, pages 9–16. Morgan Kaufmann, San Francisco, CA, 2000.
Google Scholar
S. Ben-David, N. Cesa-Bianchi, D. Haussler, and P. Long. Characterizations of learnability for classes of 0,..., n-valued functions. J. Comput. Sys. Sci., 50(1):74–86, 1995.
Article MATH MathSciNet Google Scholar
E. Brill. Some advances in transformation-based part of speech tagging. In AAAI, Vol. 1, pages 722–727, 1994.
Google Scholar
A. Carlson, C. Cumby, J. Rosen, and D. Roth. The SNoW learning architecture. Technical Report UIUCDCS-R-99-2101, UIUC Computer Science Department, May 1999.
Google Scholar
K. Crammer and Y. Singer. On the learnability and design of output codes for multiclass problems. In Computational Learing Theory, pages 35–46, 2000.
Google Scholar
K. Crammer and Y. Singer. On the algorithmic implementation of multiclass kernel-based vector machines. J. Machine Learning Research, 2 (December):265–292, 2001.
Article Google Scholar
K. Crammer and Y. Singer. Ultraconservative online algorithms for multiclass problems. In COLT/EuroCOLT, pages 99–115, 2001.
Google Scholar
Nello Cristianini and John Shawe-Taylor. An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods. Cambridge University Press, 2000.
Google Scholar
I. Dagan, Y. Karov, and D. Roth. Mistake-driven learning in text categorization. In EMNLP-97, The Second Conference on Empirical Methods in Natural Language Processing, pages 55–63, 1997.
Google Scholar
T. Hastie and R. Tibshirani. Classification by pairwise coupling. In NIPS-10, The 1997 Conference on Advances in Neural Information Processing Systems, pages 507–513. MIT Press, 1998.
Google Scholar
F. Jelinek. Statistical Methods for Speech Recognition. The MIT Press, Cambridge, Massachusetts, 1998.
Google Scholar
T. Kohonen. Sel-Organizing Maps. Springer Verlag, NewYork, 3rd edition, 2001.
Google Scholar
Y. Le Cun, B. Boser, J. Denker, D. Hendersen, R. Howard, W. Hubbard, and L. Jackel. Backpropagation applied to handwritten zip code recognition. Neural Computation, 1:pp 541, 1989.
Article Google Scholar
D. Lee and H. Seung. Unsupervised learning by convex and conic coding. In Michael C. Mozer, Michael I. Jordan, and Thomas Petsche, editors, Advances in Neural Information Processing Systems, volume 9, page 515. The MIT Press, 1997.
Google Scholar
W. Maass. On the computational power of winner-take-all. Neural Computation, 12(11):2519–2536, 2000.
Article MathSciNet Google Scholar
D. Roth. Learning to resolve natural language ambiguities: A unified approach. In Proc. of AAAI, pages 806–813, 1998.
Google Scholar
D. Roth and D. Zelenko. Part of speech tagging using a network of linear separators. In COLING-ACL 98, The 17th International Conference on Computational Linguistics, pages 1136–1142, 1998.
Google Scholar
R.E. Schapire. Using output codes to boost multiclass learning problems. In Proc. 14th Internat. Conf. on Machine Learning, pages 313–321. Morgan Kaufmann, 1997.
Google Scholar
V. Vapnik. Statistical Learning Theory. Wiley, 605 Third Avenue, New York, New York, 10158–10212, 1998.
Google Scholar
J. Weston and C. Watkins. Support vector machines for multiclass pattern recognition. In Proceedings of the Seventh European Symposium On Artificial Neural Networks, 4 1999.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Illinois, IL 61801, Urbana
Sariel Har-Peled, Dan Roth & Dav Zimak

Authors

Sariel Har-Peled
View author publications
You can also search for this author in PubMed Google Scholar
Dan Roth
View author publications
You can also search for this author in PubMed Google Scholar
Dav Zimak
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dipartimento di Tecnologie dell’Informazione, Università degli Studi di Milano, via Bramante 65, 26013, Crema (CR), Italy
Nicolò Cesa-Bianchi
Department of Computer Science, Tokyo Institute of Technology, 2-12-1, Ohokayama Meguro Ward, 152-8552, Tokyo, Japan
Masayuki Numao
Institut für Theoretische Informatik, Universität zu Lübeck, Wallstr. 40, 23560, Lübeck, Germany
Rüdiger Reischuk

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Har-Peled, S., Roth, D., Zimak, D. (2002). Constraint Classification: A New Approach to Multiclass Classification. In: Cesa-Bianchi, N., Numao, M., Reischuk, R. (eds) Algorithmic Learning Theory. ALT 2002. Lecture Notes in Computer Science(), vol 2533. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36169-3_29

Download citation

DOI: https://doi.org/10.1007/3-540-36169-3_29
Published: 08 November 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00170-6
Online ISBN: 978-3-540-36169-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics