Fast Recognition of Asian Characters Based on Database Methodologies

Loh, Woong-Kee; Park, Young-Ho; Yoon, Yong-Ik

doi:10.1007/978-3-540-73390-4_5

Woong-Kee Loh¹,
Young-Ho Park² &
Yong-Ik Yoon²

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4587))

Included in the following conference series:

British National Conference on Databases

622 Accesses
3 Citations

Abstract

Character recognition has been an active research area in the field of pattern recognition. The existing character recognition algorithms are focused mainly on increasing the recognition rate. However, as in the recent Google Library Project, the requirement for speeding up recognition of enormous amount of documents is growing. Moreover, the existing algorithms do not pay enough attention to Asian characters. In this paper, we propose an algorithm for fast recognition of Asian characters based on the database methodologies. Since the number of Asian characters is very large and their shapes are complicated, Asian characters require much more recognition time than numeric and Roman characters. The proposed algorithm extracts the feature from each of Asian characters through the Discrete Fourier Transform (DFT) and optimizes the recognition speed by storing and retrieving the features using a multidimensional index. We improve the recognition speed of the proposed algorithm using the association rule technique, which is a widely adopted data mining technique. The proposed algorithm has the advantage that it can be applied regardless of the language, size, and font of the characters to be recognized.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agrawal, R., Faloutsos, C., Swami, A.N.: Efficient Similarity Search in Sequence Databases. In: Proc. Int’l Conf. on Foundations of Data Organization and Algorithms (FODO), Chicago, Illinois, pp. 69–84 (October 1993)
Google Scholar
Agrawal, R., Imielinski, T., Swami, A.N.: Mining Association Rules between Sets of Items in Large Databases. In: Proc. Int’l Conf. on Management of Data, ACM SIGMOD, Washington, D.C. pp. 207–216 (May 1993)
Google Scholar
Beckmann, N., Kriegel, H.-P., Schneider, R., Seeger, B.: The R*-Tree: An Efficient and Robust Access Method for Points and Rectangles. In: Proc. Int’l Conf. on Management of Data, ACM SIGMOD, pp. 322–331. Atlantic City, New Jersey (May 1990)
Google Scholar
Belkasim, S.O., Shridhar, M., Ahmadi, M.: Pattern Recognition with Moment Invariants: a Comparative Study and New Results. Pattern Recognition 24(12), 1117–1138 (1991)
Article Google Scholar
Berchtold, S., Keim, D.A., Kriegel, H.-P.: The X-tree: An Index Structure for High-Dimensional Data. In: Proc. Int’l Conf. on Very Large Data Bases (VLDB), Mumbai, India, pp. 28–39 (September 1996)
Google Scholar
Bishop, C.M.: Neural Networks for Pattern Recognition. Oxford University Press, Oxford (1996)
MATH Google Scholar
Bunke, H., Wang, P.S.P.: Handbook of Character Recognition and Document Image Analysis. World Scientific Publishing Company, Singapore (1997)
Google Scholar
Cho, W., Lee, S.-W., Kim, J.H.: Modeling and Recognition of Cursive Words with Hidden Markov Models. Pattern Recognition 28(12), 1941–1953 (1995)
Article Google Scholar
Google Book Search Library Project (2006), http://books.google.com/googleprint/library.html
Halliday, D., Resnick, R., Walker, J.: Fundamentals of Physics, 7th edn. Wiley, Chichester (2004)
Google Scholar
Kamel, I., Faloutsos, C.: On Packing R-trees. In: Proc. Int’l Conf. on Information and Knowledge Management (CIKM), Washington, D.C. pp. 490–499 (November 1993)
Google Scholar
KS C 5601-1992, Code for Information Interchange (in Korean) (1992)
Google Scholar
Mori, S., Nishida, H., Yamada, H.: Optical Character Recognition. Wiley, Chichester (1999)
Google Scholar
Natsev, A., Rastogi, R., Shim, K.: WALRUS: A Similarity Retrieval Algorithm for Image Databases. IEEE Trans. Knowledge & Data Engineering (TKDE) 16(3), 301–316 (2004)
Article Google Scholar
Press, W.H., Flannery, B.P., Teukolsky, S.A., Vetterling, W.T.: Numerical Recipes in C: The Art of Scientific Computing, 2nd edn. Cambridge University Press, Cambridge (1992)
Google Scholar
Sim, D.-G., Ham, Y.-K., Park, R.-H.: On-Line Recognition of Cursive Korean Characters Using DP Matching and Fuzzy Concept. Pattern Recognition 27(12), 1605–1620 (1994)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science & Engineering, University of Minnesota, 200 Union Street SE, Minneapolis, MN 55455, USA
Woong-Kee Loh
Department of Multimedia Science, Sookmyung Women’s University, 53-12 Chungpa-Dong, Yongsan-gu, Seoul 140-742, Korea
Young-Ho Park & Yong-Ik Yoon

Authors

Woong-Kee Loh
View author publications
You can also search for this author in PubMed Google Scholar
Young-Ho Park
View author publications
You can also search for this author in PubMed Google Scholar
Yong-Ik Yoon
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Richard Cooper Jessie Kennedy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Loh, WK., Park, YH., Yoon, YI. (2007). Fast Recognition of Asian Characters Based on Database Methodologies. In: Cooper, R., Kennedy, J. (eds) Data Management. Data, Data Everywhere. BNCOD 2007. Lecture Notes in Computer Science, vol 4587. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73390-4_5

Download citation

DOI: https://doi.org/10.1007/978-3-540-73390-4_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73389-8
Online ISBN: 978-3-540-73390-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics