A Two-View CoTraining Rule Induction System for Information Extraction

  • Jing Xiao
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4114)


Information extraction is becoming an important task due to the vast growth of the online texts. Pattern rule induction is one kind of main methods to do information extraction. Manually constructing pattern rules is tedious and error prone. In this paper, we present GRID_CoTrain, a weakly supervised paradigm by bootstrapping GRID (a supervised rule induction system) with co-training and active learning. We also utilize external knowledge resource such as WordNet and existing ontology knowledge to optimize the learned pattern rules.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Jing Xiao
    • 1
  1. 1.Department of Computer Science, SUN Yat-Sen University, Guangzhou, 510275China

Personalised recommendations