Skip to main content

Knowledge-Based Partial Matching: An Efficient Form Classification Method

  • Conference paper
  • First Online:
  • 735 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2390))

Abstract

An efficient method of classifying form is proposed in this paper. Our method identifies a small number of matching areas by their distinctive images with respect to their layout structure and then form classification is performed by matching only these local regions. The process is summarized as follows. First, the form is partitioned into rectangular regions along the locations of lines of the forms. The disparity in each partitioned region of the comparing form images is measured. The penalty for each partitioned area is computed by using the pre-printed text, filled-in data, and the size of a partitioned area. The disparity and penalty are considered to compute the score to select final matching areas. By using our approach, the redundant matching areas are not processed and a feature vector of good quality can be extracted.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. T. Watanabe, Document Analysis and Recogntion, IEICE Trans. Inf. & Syst., Vol. E82-D, No. 3, pp. 601–610, 1999.

    Google Scholar 

  2. T. Sobue and T. Watanabe, Identification of Item Fields in Table-form Documents with/without Line Segments, MVA, pp. 522–525, 1996.

    Google Scholar 

  3. R. G. Casey, D. R. Ferguson, K. Mohiuddin and E. Walach, Intelligent forms processing system, MVA, Vol. 5, pp. 511–529, 1992.

    Google Scholar 

  4. J. Mao, M. Abayan, and K. Mohiuddin, A Model-Based Form Processing Sub-System, ICDAR, pp. 691–695, 1996.

    Google Scholar 

  5. S. W. Lam, L. Javanbakht, and S. N. Srihari, Anatomy of a form reader, ICDAR, pp. 506–509, 1993.

    Google Scholar 

  6. A. Ting, M. K. Leung, S.-C. H, and K.-Y. Chan, A Syntactic Business Form Classifier, ICDAR, pp. 301–304, 1995.

    Google Scholar 

  7. S. L. Taylor, R. Fritzson, and J. A. Pastor, Extraction of data from preprinted forms, MVA, Vol. 5, pp. 211–222, 1992.

    Article  Google Scholar 

  8. Y. Ishitani, Model Matching Based on Association Graph for Form Image Understanding, ICDAR, pp. 287–292, 1995.

    Google Scholar 

  9. P. Heroux, S. Diana. A. Ribert, and E. Trupin, Classification Method Study for Automatic Form Class Identification, IWFHR, pp. 926–928, 1998.

    Google Scholar 

  10. C. L. Yu, Y. Y. Tang, and C. Y. Suen, Document Architecture Language Approach to Document Processing, ICDAR, pp. 103–106, 1993.

    Google Scholar 

  11. R. Lorie, A System for exploiting Syntactic and Semantic Knowledge, DAS, pp. 277–294, 1994.

    Google Scholar 

  12. Y. Byun and Y. Lee, Efficient Form Processing Methods for Various Kinds of Form Documents, DAS, pp. 153–156, 1998.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Byun, Y., Kim, J., Choi, Y., Kim, G., Lee, Y. (2002). Knowledge-Based Partial Matching: An Efficient Form Classification Method. In: Blostein, D., Kwon, YB. (eds) Graphics Recognition Algorithms and Applications. GREC 2001. Lecture Notes in Computer Science, vol 2390. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45868-9_3

Download citation

  • DOI: https://doi.org/10.1007/3-540-45868-9_3

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44066-6

  • Online ISBN: 978-3-540-45868-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics