Datum-Wise Classification: A Sequential Approach to Sparsity

Dulac-Arnold, Gabriel; Denoyer, Ludovic; Preux, Philippe; Gallinari, Patrick

doi:10.1007/978-3-642-23780-5_34

Gabriel Dulac-Arnold²³,
Ludovic Denoyer²³,
Philippe Preux²⁴ &
…
Patrick Gallinari²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6911))

Included in the following conference series:

Joint European Conference on Machine Learning and Knowledge Discovery in Databases

2971 Accesses
13 Citations
1 Altmetric

Abstract

We propose a novel classification technique whose aim is to select an appropriate representation for each datapoint, in contrast to the usual approach of selecting a representation encompassing the whole dataset. This datum-wise representation is found by using a sparsity inducing empirical risk, which is a relaxation of the standard L ₀ regularized risk. The classification problem is modeled as a sequential decision process that sequentially chooses, for each datapoint, which features to use before classifying. Datum-Wise Classification extends naturally to multi-class tasks, and we describe a specific case where our inference has equivalent complexity to a traditional linear classifier, while still using a variable number of features. We compare our classifier to classical L ₁ regularized linear models (L ₁-SVM and LARS) on a set of common binary and multi-class datasets and show that for an equal average number of features used we can get improved performance using our method.

This work was partially supported by the French National Agency of Research (Lampada ANR-09-EMER-007).

Download to read the full chapter text

Chapter PDF

Multiple Bayesian discriminant functions for high-dimensional massive data classification

Article 28 October 2016

Multi-dimensional Bayesian network classifiers: A survey

Article 11 July 2020

Model Selection for Classification with a Large Number of Classes

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Tibshirani, R.: Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society. Series B (January 1994)
Google Scholar
Sutton, R., Barto, A.: Reinforcement Learning. MIT Press, Cambridge (1998)
Google Scholar
Efron, B., Hastie, T., Johnstone, I., Tibshirani, R.: Least-angle regression. Annals of statistics 32(2), 407–499 (2004)
Article MathSciNet MATH Google Scholar
Puterman, M.L.: Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley, Chichester (1994)
Book MATH Google Scholar
Har-Peled, S., Roth, D., Zimak, D.: Constraint classification: A new approach to multiclass classification. Algorithmic Learning Theory, 1–11 (2002)
Google Scholar
Lagoudakis, M.G., Parr, R.: Reinforcement learning as classification: Leveraging modern classifiers. In: ICML 2003 (2003)
Google Scholar
Fan, R., Chang, K., Hsieh, C., Wang, X., Lin, C.: LIBLINEAR: A library for large linear classification. JMLR 9, 1871–1874 (2008)
MATH Google Scholar
Guyon, I., Elisseefi, A.: An Introduction to Variable and Feature Selection. Journal of Machine Learning Research 3(7-8), 1157–1182 (2003)
MATH Google Scholar
Girgin, S., Preux, P.: Feature discovery in reinforcement learning using genetic programming. In: O’Neill, M., Vanneschi, L., Gustafson, S., Esparcia Alcázar, A.I., De Falco, I., Della Cioppa, A., Tarantino, E. (eds.) EuroGP 2008. LNCS, vol. 4971, pp. 218–229. Springer, Heidelberg (2008)
Chapter Google Scholar
Gaudel, R., Sebag, M.: Feature Selection as a One-Player Game. In: ICML (2010)
Google Scholar
Xu, Z., Zhang, H., Wang, Y., Chang, X., Liang, Y.: L1/2 regularization. Science China Information Sciences 53(6), 1159–1169 (2010)
Article MathSciNet Google Scholar
Ertin, E.: Reinforcement learning and design of nonparametric sequential decision networks. In: Proceedings of SPIE, pp. 40–47 (2002)
Google Scholar
Ji, S., Carin, L.: Cost-sensitive feature acquisition and classification. Pattern Recognition 40(5), 1474–1485 (2007)
Article MATH Google Scholar
Póczos, B., Abbasi-Yadkori, Y., Szepesvári, C., Greiner, R., Sturtevant, N.: Learning when to stop thinking and do something! In: ICML 2009, pp. 1–8 (2009)
Google Scholar
Dulac-Arnold, G., Denoyer, L., Gallinari, P.: Text Classification: A Sequential Reading Approach. In: ECIR, pp. 411–423 (2011)
Google Scholar
Preda, M.: Adaptive building of decision trees by reinforcement learning. In: Proceedings of the 7th WSEAS, pp. 34–39 (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Université Pierre et Marie Curie - UPMC, LIP6 Case 169, 4 Place Jussieu, 75005, Paris, France
Gabriel Dulac-Arnold, Ludovic Denoyer & Patrick Gallinari
LIFL (UMR CNRS) & INRIA Lille Nord-Europe Université de Lille, Villeneuve d’Ascq, France
Philippe Preux

Authors

Gabriel Dulac-Arnold
View author publications
You can also search for this author in PubMed Google Scholar
Ludovic Denoyer
View author publications
You can also search for this author in PubMed Google Scholar
Philippe Preux
View author publications
You can also search for this author in PubMed Google Scholar
Patrick Gallinari
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Informatics and Telecommunications, University of Athens, Panepistimioupolis, Ilisia, 15784, Athens, Greece
Dimitrios Gunopulos
Google Switzerland GmbH, Brandschenkestrasse 110, 8002, Zurich, Switzerland
Thomas Hofmann
Department of Computer Science, University of Bari “Aldo Moro”, via Orabona 4, 70125, Bari, Italy
Donato Malerba
Deptartment of Informatics, Athens University of Economics and Business, Patision 76, 10434, Athens, Greece
Michalis Vazirgiannis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dulac-Arnold, G., Denoyer, L., Preux, P., Gallinari, P. (2011). Datum-Wise Classification: A Sequential Approach to Sparsity. In: Gunopulos, D., Hofmann, T., Malerba, D., Vazirgiannis, M. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2011. Lecture Notes in Computer Science(), vol 6911. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23780-5_34

Download citation

DOI: https://doi.org/10.1007/978-3-642-23780-5_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23779-9
Online ISBN: 978-3-642-23780-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Datum-Wise Classification: A Sequential Approach to Sparsity

Abstract

Chapter PDF

Similar content being viewed by others

Multiple Bayesian discriminant functions for high-dimensional massive data classification

Multi-dimensional Bayesian network classifiers: A survey

Model Selection for Classification with a Large Number of Classes

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Datum-Wise Classification: A Sequential Approach to Sparsity

Abstract

Chapter PDF

Similar content being viewed by others

Multiple Bayesian discriminant functions for high-dimensional massive data classification

Multi-dimensional Bayesian network classifiers: A survey

Model Selection for Classification with a Large Number of Classes

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation