Knowledge-Based Partial Matching: An Efficient Form Classification Method

  • Yungcheol Byun
  • Joongbae Kim
  • Yeongwoo Choi
  • Gyeonghwan Kim
  • Yillbyung Lee
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2390)


An efficient method of classifying form is proposed in this paper. Our method identifies a small number of matching areas by their distinctive images with respect to their layout structure and then form classification is performed by matching only these local regions. The process is summarized as follows. First, the form is partitioned into rectangular regions along the locations of lines of the forms. The disparity in each partitioned region of the comparing form images is measured. The penalty for each partitioned area is computed by using the pre-printed text, filled-in data, and the size of a partitioned area. The disparity and penalty are considered to compute the score to select final matching areas. By using our approach, the redundant matching areas are not processed and a feature vector of good quality can be extracted.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. [1]
    T. Watanabe, Document Analysis and Recogntion, IEICE Trans. Inf. & Syst., Vol. E82-D, No. 3, pp. 601–610, 1999.Google Scholar
  2. [2]
    T. Sobue and T. Watanabe, Identification of Item Fields in Table-form Documents with/without Line Segments, MVA, pp. 522–525, 1996.Google Scholar
  3. [3]
    R. G. Casey, D. R. Ferguson, K. Mohiuddin and E. Walach, Intelligent forms processing system, MVA, Vol. 5, pp. 511–529, 1992.Google Scholar
  4. [4]
    J. Mao, M. Abayan, and K. Mohiuddin, A Model-Based Form Processing Sub-System, ICDAR, pp. 691–695, 1996.Google Scholar
  5. [5]
    S. W. Lam, L. Javanbakht, and S. N. Srihari, Anatomy of a form reader, ICDAR, pp. 506–509, 1993.Google Scholar
  6. [6]
    A. Ting, M. K. Leung, S.-C. H, and K.-Y. Chan, A Syntactic Business Form Classifier, ICDAR, pp. 301–304, 1995.Google Scholar
  7. [7]
    S. L. Taylor, R. Fritzson, and J. A. Pastor, Extraction of data from preprinted forms, MVA, Vol. 5, pp. 211–222, 1992.CrossRefGoogle Scholar
  8. [8]
    Y. Ishitani, Model Matching Based on Association Graph for Form Image Understanding, ICDAR, pp. 287–292, 1995.Google Scholar
  9. [9]
    P. Heroux, S. Diana. A. Ribert, and E. Trupin, Classification Method Study for Automatic Form Class Identification, IWFHR, pp. 926–928, 1998.Google Scholar
  10. [10]
    C. L. Yu, Y. Y. Tang, and C. Y. Suen, Document Architecture Language Approach to Document Processing, ICDAR, pp. 103–106, 1993.Google Scholar
  11. [11]
    R. Lorie, A System for exploiting Syntactic and Semantic Knowledge, DAS, pp. 277–294, 1994.Google Scholar
  12. [12]
    Y. Byun and Y. Lee, Efficient Form Processing Methods for Various Kinds of Form Documents, DAS, pp. 153–156, 1998.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2002

Authors and Affiliations

  • Yungcheol Byun
    • 1
  • Joongbae Kim
    • 1
  • Yeongwoo Choi
    • 2
  • Gyeonghwan Kim
    • 3
  • Yillbyung Lee
    • 4
  1. 1.Dept. of ECElectronics and Telecommunications Research InstituteKorea
  2. 2.Dept. of Computer ScienceSookmyung Women’s UniversityKorea
  3. 3.Dept. of Electronic EngineeringSogang UniversityKorea
  4. 4.Dept. of Computer ScienceYonsei UniversityKorea

Personalised recommendations