A New Text Detection Algorithm for Content-Oriented Line Drawing Image Retrieval

  • Zhenyu Zhang
  • Tong Lu
  • Feng Su
  • Ruoyu Yang
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6297)


Content retrieval of scanned line drawing images is a difficult problem, especially from real-life large scale databases. Existing algorithms don’t work well due to their low efficiency by first recognizing various types of graphical primitives and then content-oriented texts. A new method for directly detecting texts from line drawing images is proposed in this paper. We first decompose a drawing image into a set of Local Consecutive Segments (LCSs). A LCS is defined as a minimum meaningful structural unit to imitate a stroke during human-drawing process. Next, we identify candidate character LCSs by statistical analysis and merge them into character LCS blocks by geometrical analysis. Finally, Hough transforms are applied to calculate the orientations of character LCS blocks and generate candidate strings. Experimental results show that our algorithm can well detect strings in any orientation. Our method is robust to text-graphic touching, scanning degradation and drawing noises, providing an efficient approach for content retrieval of document images.


text detection content-oriented line drawings image retrieval 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Zhang, C., Chai, J.Y., Jin, R.: User term feedback in interactive text-based image retrieval. In: SIGIR 2005: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 51–58. ACM, New York (2005)CrossRefGoogle Scholar
  2. 2.
    Zhao, R., Grosky, W.I.: Narrowing the semantic gap - improved text-based web document retrieval using visual features. IEEE Transactions on Multimedia 4, 189–200 (2002)CrossRefGoogle Scholar
  3. 3.
    Stehling, R.O., Falcão, A.X., Nascimento, M.A.: An adaptive and efficient clustering-based approach for content-based image retrieval in image databases. In: Database Engineering and Applications Symposium, International, p. 356 (2001)Google Scholar
  4. 4.
    Heczko, M., Hinneburg, A., Keim, D., Wawryniuk, M.: Multiresolution similarity search in image databases. Multimedia Systems 10, 28–40 (2004)CrossRefGoogle Scholar
  5. 5.
    Cho, S.B., Lee, J.Y.: A human-oriented image retrieval system using interactive genetic algorithm. In: IEEE Transactions on Systems, Man and Cybernetics, Part A: Systems and Humans, vol. 32(3), pp. 452–458Google Scholar
  6. 6.
    Xu, X., Zhang, L., Yu, Z., Zhou, C.: Image retrieval using multi-granularity color features. In: International Conference on Audio, Language and Image Processing, ICALIP 2008, Shanghai, pp. 1584–1589 (2008)Google Scholar
  7. 7.
    Dori, D., Wenyin, L.: Vector-based segmentation of text connected to graphics in engineering drawings. In: Perner, P., Rosenfeld, A., Wang, P. (eds.) SSPR 1996. LNCS, vol. 1121, pp. 322–331. Springer, Heidelberg (1996)Google Scholar
  8. 8.
    Su, F., Lu, T., Cai, S., Yang, R.: A character segmentation method for engineering drawings based on holistic and contextual constraints. In: GREC 2009: Proceddings of the 8th IAPR International Workshop on Graphics RECognition, France, pp. 280–290 (2009)Google Scholar
  9. 9.
    Song, J., Su, F., Tai, C.L., Cai, S.: An object-oriented progressive-simplification-based vectorization system for engineering drawings: model, algorithm, and performance. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(8), 1048–1060Google Scholar
  10. 10.
    Fletcher, L.A., Kasturi, R.: A robust algorithm for text string separation from mixed text/graphics images. IEEE Transactions on Pattern Analysis and Machine Intelligence 10(6), 910–918Google Scholar
  11. 11.
    Lai, C.P., Kasturi, R.: Detection of dimension sets in engineering drawings. IEEE Transactions on Pattern Analysis and Machine Intelligence 16(8), 848–855Google Scholar
  12. 12.
    Lu, Z.: Detection of text regions from digital engineering drawings. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(4), 431–439Google Scholar
  13. 13.
    Strouthopoulos, C., Nikolaidis, A.: A robust technique for text extraction in mixed-type binary documents. In: 19th International Conference on Pattern Recognition, ICPR 2008, Tampa, FL, pp. 1–4 (2008)Google Scholar
  14. 14.
    Roy, P.P., Pal, U., Lladós, J.: Touching text character localization in graphical documents using SIFT. In: Ogier, J.-M., Liu, W., Lladós, J. (eds.) GREC 2009. LNCS, vol. 6020, pp. 271–279. Springer, Heidelberg (2010)CrossRefGoogle Scholar
  15. 15.
    Lu, T., Tai, C.L., Yang, H., Cai, S.: A novel knowledge-based system for interpreting complex engineering drawings: Theory, representation, and implementation. IEEE Transactions on Pattern Analysis and Machine Intelligence 31(8), 1444–1457Google Scholar
  16. 16.
    Lu, T., Yang, H., Yang, R., Cai, S.: Automatic analysis and integration of architectural drawings. Int. J. Doc. Anal. Recognit. 9(1), 31–47 (2007)CrossRefGoogle Scholar
  17. 17.
    Lu, T., Tai, C.L., Su, F., Cai, S.: A new recognition model for electronic architectural drawings. Comput. Aided Des. 37(10), 1053–1069 (2005)CrossRefGoogle Scholar
  18. 18.
    Song, J., Li, Z., Lyu, M.R., Cai, S.: Recognition of merged characters based on forepart prediction, necessity-sufficiency matching, and character-adaptive masking. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics 35(1), 2–11Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Zhenyu Zhang
    • 1
  • Tong Lu
    • 1
    • 2
  • Feng Su
    • 1
  • Ruoyu Yang
    • 1
  1. 1.State Key Lab. for Novel Software TechnologyNanjing UniversityChina
  2. 2.Jiangyin Institute of Information Technology of Nanjing UniversityChina

Personalised recommendations