A New Text Detection Algorithm for Content-Oriented Line Drawing Image Retrieval

Zhang, Zhenyu; Lu, Tong; Su, Feng; Yang, Ruoyu

doi:10.1007/978-3-642-15702-8_31

Zhenyu Zhang²²,
Tong Lu^22,23,
Feng Su²² &
…
Ruoyu Yang²²

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6297))

Included in the following conference series:

Pacific-Rim Conference on Multimedia

1468 Accesses
1 Citations

Abstract

Content retrieval of scanned line drawing images is a difficult problem, especially from real-life large scale databases. Existing algorithms don’t work well due to their low efficiency by first recognizing various types of graphical primitives and then content-oriented texts. A new method for directly detecting texts from line drawing images is proposed in this paper. We first decompose a drawing image into a set of Local Consecutive Segments (LCSs). A LCS is defined as a minimum meaningful structural unit to imitate a stroke during human-drawing process. Next, we identify candidate character LCSs by statistical analysis and merge them into character LCS blocks by geometrical analysis. Finally, Hough transforms are applied to calculate the orientations of character LCS blocks and generate candidate strings. Experimental results show that our algorithm can well detect strings in any orientation. Our method is robust to text-graphic touching, scanning degradation and drawing noises, providing an efficient approach for content retrieval of document images.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Zhang, C., Chai, J.Y., Jin, R.: User term feedback in interactive text-based image retrieval. In: SIGIR 2005: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 51–58. ACM, New York (2005)
Chapter Google Scholar
Zhao, R., Grosky, W.I.: Narrowing the semantic gap - improved text-based web document retrieval using visual features. IEEE Transactions on Multimedia 4, 189–200 (2002)
Article Google Scholar
Stehling, R.O., Falcão, A.X., Nascimento, M.A.: An adaptive and efficient clustering-based approach for content-based image retrieval in image databases. In: Database Engineering and Applications Symposium, International, p. 356 (2001)
Google Scholar
Heczko, M., Hinneburg, A., Keim, D., Wawryniuk, M.: Multiresolution similarity search in image databases. Multimedia Systems 10, 28–40 (2004)
Article Google Scholar
Cho, S.B., Lee, J.Y.: A human-oriented image retrieval system using interactive genetic algorithm. In: IEEE Transactions on Systems, Man and Cybernetics, Part A: Systems and Humans, vol. 32(3), pp. 452–458
Google Scholar
Xu, X., Zhang, L., Yu, Z., Zhou, C.: Image retrieval using multi-granularity color features. In: International Conference on Audio, Language and Image Processing, ICALIP 2008, Shanghai, pp. 1584–1589 (2008)
Google Scholar
Dori, D., Wenyin, L.: Vector-based segmentation of text connected to graphics in engineering drawings. In: Perner, P., Rosenfeld, A., Wang, P. (eds.) SSPR 1996. LNCS, vol. 1121, pp. 322–331. Springer, Heidelberg (1996)
Google Scholar
Su, F., Lu, T., Cai, S., Yang, R.: A character segmentation method for engineering drawings based on holistic and contextual constraints. In: GREC 2009: Proceddings of the 8th IAPR International Workshop on Graphics RECognition, France, pp. 280–290 (2009)
Google Scholar
Song, J., Su, F., Tai, C.L., Cai, S.: An object-oriented progressive-simplification-based vectorization system for engineering drawings: model, algorithm, and performance. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(8), 1048–1060
Google Scholar
Fletcher, L.A., Kasturi, R.: A robust algorithm for text string separation from mixed text/graphics images. IEEE Transactions on Pattern Analysis and Machine Intelligence 10(6), 910–918
Google Scholar
Lai, C.P., Kasturi, R.: Detection of dimension sets in engineering drawings. IEEE Transactions on Pattern Analysis and Machine Intelligence 16(8), 848–855
Google Scholar
Lu, Z.: Detection of text regions from digital engineering drawings. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(4), 431–439
Google Scholar
Strouthopoulos, C., Nikolaidis, A.: A robust technique for text extraction in mixed-type binary documents. In: 19th International Conference on Pattern Recognition, ICPR 2008, Tampa, FL, pp. 1–4 (2008)
Google Scholar
Roy, P.P., Pal, U., Lladós, J.: Touching text character localization in graphical documents using SIFT. In: Ogier, J.-M., Liu, W., Lladós, J. (eds.) GREC 2009. LNCS, vol. 6020, pp. 271–279. Springer, Heidelberg (2010)
Chapter Google Scholar
Lu, T., Tai, C.L., Yang, H., Cai, S.: A novel knowledge-based system for interpreting complex engineering drawings: Theory, representation, and implementation. IEEE Transactions on Pattern Analysis and Machine Intelligence 31(8), 1444–1457
Google Scholar
Lu, T., Yang, H., Yang, R., Cai, S.: Automatic analysis and integration of architectural drawings. Int. J. Doc. Anal. Recognit. 9(1), 31–47 (2007)
Article Google Scholar
Lu, T., Tai, C.L., Su, F., Cai, S.: A new recognition model for electronic architectural drawings. Comput. Aided Des. 37(10), 1053–1069 (2005)
Article Google Scholar
Song, J., Li, Z., Lyu, M.R., Cai, S.: Recognition of merged characters based on forepart prediction, necessity-sufficiency matching, and character-adaptive masking. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics 35(1), 2–11
Google Scholar

Download references

Author information

Authors and Affiliations

State Key Lab. for Novel Software Technology, Nanjing University, China
Zhenyu Zhang, Tong Lu, Feng Su & Ruoyu Yang
Jiangyin Institute of Information Technology of Nanjing University, China
Tong Lu

Authors

Zhenyu Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Tong Lu
View author publications
You can also search for this author in PubMed Google Scholar
Feng Su
View author publications
You can also search for this author in PubMed Google Scholar
Ruoyu Yang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Science, University of Nottingham, Jubilee Campus, NG8 1BB, Nottingham, UK
Guoping Qiu
The Centre for Multimedia Signal Processing, The Hong Kong Polytechnic University, Hong Kong, China
Kin Man Lam
Faculty of System Design, Tokyo Metropolitan University, 6-6, Asahigaoka, 191-0065, Hino-city, Tokyo
Hitoshi Kiya
Shanghai Key Laboratory of Intelligent Information Processing, Department of Computer Science & Engineering, Fudan University, Shanghai, China
Xiang-Yang Xue
Department of Electrical Engineering, University of Southern California, 90089-2564, Los Angeles, CA
C.-C. Jay Kuo
LIACS Media Lab, Leiden University,
Michael S. Lew

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, Z., Lu, T., Su, F., Yang, R. (2010). A New Text Detection Algorithm for Content-Oriented Line Drawing Image Retrieval. In: Qiu, G., Lam, K.M., Kiya, H., Xue, XY., Kuo, CC.J., Lew, M.S. (eds) Advances in Multimedia Information Processing - PCM 2010. PCM 2010. Lecture Notes in Computer Science, vol 6297. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15702-8_31

Download citation

DOI: https://doi.org/10.1007/978-3-642-15702-8_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15701-1
Online ISBN: 978-3-642-15702-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics