Advertisement

Enhancing SVMs for Text Classification

  • Catarina Silva
  • Bernardete Ribeiro
Part of the Studies in Computational Intelligence book series (SCI, volume 255)

Abstract

The previous chapter introduced kernel-based techniques and their baseline application to text classification. In this chapter we develop and explore learning techniques that integrate knowledge in the classification task to improve the performance of support vector machines (SVMs) in text classification applications.

The introduction of unlabeled data in the learning stage is investigated. With the deluge of digital text data, unlabeled texts are ubiquitous. Whether it is the Internet, email servers, database files or plain file systems, the sources for digital texts are countless. However, such texts are usually unlabeled, and their labeling is mostly manual and costly. Therefore, a research field on the study and use of these unlabeled texts has been emerging. It is further exploited the potential of using several learning machines organized in a committee. Knowing that there is no unique classifier that suits all situations, the focus is on using the diversity of classifiers to enhance performance.

Keywords

Active Learning Background Knowledge Unlabeled Data Small Split Digital Text 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Catarina Silva
    • Bernardete Ribeiro

      There are no affiliations available

      Personalised recommendations