Abstract
Content retrieval of scanned line drawing images is a difficult problem, especially from real-life large scale databases. Existing algorithms don’t work well due to their low efficiency by first recognizing various types of graphical primitives and then content-oriented texts. A new method for directly detecting texts from line drawing images is proposed in this paper. We first decompose a drawing image into a set of Local Consecutive Segments (LCSs). A LCS is defined as a minimum meaningful structural unit to imitate a stroke during human-drawing process. Next, we identify candidate character LCSs by statistical analysis and merge them into character LCS blocks by geometrical analysis. Finally, Hough transforms are applied to calculate the orientations of character LCS blocks and generate candidate strings. Experimental results show that our algorithm can well detect strings in any orientation. Our method is robust to text-graphic touching, scanning degradation and drawing noises, providing an efficient approach for content retrieval of document images.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Zhang, C., Chai, J.Y., Jin, R.: User term feedback in interactive text-based image retrieval. In: SIGIR 2005: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 51–58. ACM, New York (2005)
Zhao, R., Grosky, W.I.: Narrowing the semantic gap - improved text-based web document retrieval using visual features. IEEE Transactions on Multimedia 4, 189–200 (2002)
Stehling, R.O., Falcão, A.X., Nascimento, M.A.: An adaptive and efficient clustering-based approach for content-based image retrieval in image databases. In: Database Engineering and Applications Symposium, International, p. 356 (2001)
Heczko, M., Hinneburg, A., Keim, D., Wawryniuk, M.: Multiresolution similarity search in image databases. Multimedia Systems 10, 28–40 (2004)
Cho, S.B., Lee, J.Y.: A human-oriented image retrieval system using interactive genetic algorithm. In: IEEE Transactions on Systems, Man and Cybernetics, Part A: Systems and Humans, vol. 32(3), pp. 452–458
Xu, X., Zhang, L., Yu, Z., Zhou, C.: Image retrieval using multi-granularity color features. In: International Conference on Audio, Language and Image Processing, ICALIP 2008, Shanghai, pp. 1584–1589 (2008)
Dori, D., Wenyin, L.: Vector-based segmentation of text connected to graphics in engineering drawings. In: Perner, P., Rosenfeld, A., Wang, P. (eds.) SSPR 1996. LNCS, vol. 1121, pp. 322–331. Springer, Heidelberg (1996)
Su, F., Lu, T., Cai, S., Yang, R.: A character segmentation method for engineering drawings based on holistic and contextual constraints. In: GREC 2009: Proceddings of the 8th IAPR International Workshop on Graphics RECognition, France, pp. 280–290 (2009)
Song, J., Su, F., Tai, C.L., Cai, S.: An object-oriented progressive-simplification-based vectorization system for engineering drawings: model, algorithm, and performance. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(8), 1048–1060
Fletcher, L.A., Kasturi, R.: A robust algorithm for text string separation from mixed text/graphics images. IEEE Transactions on Pattern Analysis and Machine Intelligence 10(6), 910–918
Lai, C.P., Kasturi, R.: Detection of dimension sets in engineering drawings. IEEE Transactions on Pattern Analysis and Machine Intelligence 16(8), 848–855
Lu, Z.: Detection of text regions from digital engineering drawings. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(4), 431–439
Strouthopoulos, C., Nikolaidis, A.: A robust technique for text extraction in mixed-type binary documents. In: 19th International Conference on Pattern Recognition, ICPR 2008, Tampa, FL, pp. 1–4 (2008)
Roy, P.P., Pal, U., Lladós, J.: Touching text character localization in graphical documents using SIFT. In: Ogier, J.-M., Liu, W., Lladós, J. (eds.) GREC 2009. LNCS, vol. 6020, pp. 271–279. Springer, Heidelberg (2010)
Lu, T., Tai, C.L., Yang, H., Cai, S.: A novel knowledge-based system for interpreting complex engineering drawings: Theory, representation, and implementation. IEEE Transactions on Pattern Analysis and Machine Intelligence 31(8), 1444–1457
Lu, T., Yang, H., Yang, R., Cai, S.: Automatic analysis and integration of architectural drawings. Int. J. Doc. Anal. Recognit. 9(1), 31–47 (2007)
Lu, T., Tai, C.L., Su, F., Cai, S.: A new recognition model for electronic architectural drawings. Comput. Aided Des. 37(10), 1053–1069 (2005)
Song, J., Li, Z., Lyu, M.R., Cai, S.: Recognition of merged characters based on forepart prediction, necessity-sufficiency matching, and character-adaptive masking. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics 35(1), 2–11
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, Z., Lu, T., Su, F., Yang, R. (2010). A New Text Detection Algorithm for Content-Oriented Line Drawing Image Retrieval. In: Qiu, G., Lam, K.M., Kiya, H., Xue, XY., Kuo, CC.J., Lew, M.S. (eds) Advances in Multimedia Information Processing - PCM 2010. PCM 2010. Lecture Notes in Computer Science, vol 6297. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15702-8_31
Download citation
DOI: https://doi.org/10.1007/978-3-642-15702-8_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15701-1
Online ISBN: 978-3-642-15702-8
eBook Packages: Computer ScienceComputer Science (R0)