Skip to main content

Regularized Classifiers for Information Retrieval

  • Conference paper
Advances in Artificial Intelligence (Canadian AI 2005)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3501))

  • 1172 Accesses

Abstract

We study a class of binary regularized least-squares classifiers (RLSC) for information retrieval tasks whose training involve the solution of a unique linear system of equations. Any implementation of RLSC algorithms face two major difficulties: the large size and the density of the Gram matrix. In this paper, we present a numerical investigation of an implementation based on the preconditioned conjugate gradient and introduce a novel reduced RBF kernel which is shown to improve the sparseness of the system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Hoerl, A.E., Kennard, R.W.: Ridge regression: biased estimation for nonorthogonal problems. Technometrics 12(1), 55–67 (1970)

    Article  MATH  MathSciNet  Google Scholar 

  2. Joachims, T.: Text categorization with support vector machines: Learning with many relevant features. In: Nédellec, C., Rouveiol, C. (eds.) Proceedings of the European Conference on Machine Learning, pp. 137–142. Springer, Berlin (1998)

    Google Scholar 

  3. Lin, C.-J., Saigal, R.: An Incomplete Cholesky Factorization for Dense Matrices. BIT 40, 536–558 (2000)

    Article  MATH  MathSciNet  Google Scholar 

  4. Mukherjee, S., Niyogi, P., Rifkin, R., Poggio, T.: Statistical Learning: CVloostability is sufficient for generalization and necessary and sufficient for consistency of Empirical Risk Minimization, AI memo (2002)

    Google Scholar 

  5. Platt, J.C.: Sequential minimal optimization: A fast algorithm for training support vector machines. Technical Report MSR-TR-98-14, Microsoft Research (1998)

    Google Scholar 

  6. Poggio, T., Smale, S.: The Mathematics of Learning: Dealing with Data. Notices of the AMS (May 2003)

    Google Scholar 

  7. Rennie, J.D.M.: Using Part-of-Speech Information for Transfer in Text Classification (December 2003)

    Google Scholar 

  8. Rifkin, R.: Everything Old Is New Again: A Fresh Look at Historical Approaches in Machine Learning. PhD thesis, Massachusetts Institute of Technology (2002)

    Google Scholar 

  9. Rifkin, R., Klautau, A.: Defense of One-Vs-All Classification. Journal of Machine Learning Research 5, 101–141 (2004)

    MathSciNet  Google Scholar 

  10. Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York (1995)

    MATH  Google Scholar 

  11. Lapack library, http://www.csit.fsu.edu/~burkardt/f_src/lapack/lapack.html

  12. MNIST-Yann LeCun, http://yann.lecun.com/exdb/mnist/index.html

  13. Reuters-21578 Corpus, http://www.att.research.com/~lewis/reuters21578.html

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Brahmi, A., Ech-Cherif, A. (2005). Regularized Classifiers for Information Retrieval. In: Kégl, B., Lapalme, G. (eds) Advances in Artificial Intelligence. Canadian AI 2005. Lecture Notes in Computer Science(), vol 3501. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11424918_46

Download citation

  • DOI: https://doi.org/10.1007/11424918_46

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-25864-3

  • Online ISBN: 978-3-540-31952-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics