Skip to main content

Efficient Index for Handwritten Text

  • Conference paper
  • 480 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 60))

Abstract

This paper deals with one of the new emerging multimedia data types, namely, handwritten cursive text. The paper presents two indexing methods for searching a collection of cursive handwriting. The first index, word-level index, treats word as pictogram and uses global features for retrieval. The word-level index is suitable for large collection of cursive text. While the second one, called stroke-level index, treats the word as a set of strokes. The stroke-level index is more accurate, but more costly than the word level index. Each word (or stroke) can be described with a set of features and, thus, can be stored as points in the feature space. The Karhunen-Loeve transform is then used to minimize the number of features used (data dimensionality) and thus the index size. Feature vectors are stored in an R-tree. We implemented both indexes and carried many simulation experiments to measure the effectiveness and the cost of the search algorithm. The proposed indexes achieve substantial saving in the search time over the sequential search. Moreover, the proposed indexes improve the matching rate up to 46% over the sequential search.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Beckmann, N., et al.: The R*-tree: an efficient and robust access method for points and rectangles. In: Proc. of ACM SIGMOD (1990)

    Google Scholar 

  2. Jawahar, C.V., Balasubramanian, A., Meshesha, M., Namboodiri, A.: Retrieval of online handwriting by synthesis and matching. Pattern Recognition 42(7) (2009)

    Google Scholar 

  3. Roussopoulos, N., Leifker, D.: Direct spatial search on pictorial databases using packed r-trees. In: ACM SIGMOD, pp. 17–31 (1985)

    Google Scholar 

  4. Wang, J., Wu, C., Xu, Y., Shum, H.: Combining shape and physical models for on-line cursive handwriting synthesis. International Journal of Document Analysis and Recognition 7(4), 219–227 (2005)

    Article  Google Scholar 

  5. Gersha, A., Gray, R.: Vector Quantization and Signal Compression. Kluwer Academic, Dordrecht (1992)

    Google Scholar 

  6. Guttman, A.: R-trees: a dynamic index structure for spatial searching. In: Proc. of ACM SIGMOD (1984)

    Google Scholar 

  7. Jain, A., Namboodiri, A.: Indexing and Retrieval of On-line Handwritten Documents. In: Proc. of the 7th International Conference on Document Analysis and Recognition, p. 655 (2003)

    Google Scholar 

  8. Oda, H., Kitadai, A., Onuma, M., Nakagawa, M.: A Search Method for On-Line Handwritten Text Employing Writing-Box-Free Handwriting Recognition. In: Proc. of the 9th International Workshop on Frontiers in Handwriting Recognition, pp. 545–550 (2004)

    Google Scholar 

  9. Kamel, I., Faloutsos, C.: On packing Rtrees. In: Proc. of CIKM (1993)

    Google Scholar 

  10. Kamel, I., Faloutsos, C.: Hilbert R-tree: an improved r-tree using fractals. In: VLDB (1994)

    Google Scholar 

  11. Zheng, Y., Doermann, D.: Handwriting Matching and Its Application to Handwriting Synthesis. In: Proceedings of the 8th International Conference on Document Analysis and Recognition, pp. 861–865 (2005)

    Google Scholar 

  12. Ma, Y., Zhang, C.: Retrieval of cursive Chinese handwritten annotations based on radical model United States Patent 6681044 (2004)

    Google Scholar 

  13. Lopresti, D., Tomkins, A.: Pictographic naming. In: Tech. Rep. MITL- TR-21-92, Matsushita Information Technology Lab

    Google Scholar 

  14. Lopresti, D., Tomkins, A.: On the searchability of electronic ink. In: Tech. Rep. MITL- TR-114-94, Matsushita Information Technology Lab

    Google Scholar 

  15. Rubine, D.: The automatic recognition of gestures. In: PhD thesis, Carnegie Mellon University (1991)

    Google Scholar 

  16. Gatos, B., Pratikakis, I., Perantonis, S.J.: Hybrid Off-Line Cursive Handwriting Word Recognition. Pattern Recognition 2, 998–1002 (2006)

    Google Scholar 

  17. Al Aghbari, Z., Brook, S.: HAH manuscripts: A holistic paradigm for classifying and retrieving historical Arabic handwritten documents. Journal of Expert Systems with Applications (2009)

    Google Scholar 

  18. Wagner, R., Fisher, M.: The string-to-string correction problem. Journal of ACM 21, 168–173 (1974)

    Article  MATH  Google Scholar 

  19. Varga, T., Bunke, H.: Generation of Synthetic Training Data for an HMM-based Handwriting Recognition System. In: Proc. of the 7th International Conference on Document Analysis and Recognition, p. 618 (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kamel, I. (2009). Efficient Index for Handwritten Text. In: Ślęzak, D., Grosky, W.I., Pissinou, N., Shih, T.K., Kim, Th., Kang, BH. (eds) Multimedia, Computer Graphics and Broadcasting. MulGraB 2009. Communications in Computer and Information Science, vol 60. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10512-8_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-10512-8_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-10511-1

  • Online ISBN: 978-3-642-10512-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics