Unsupervised Segmentation of Text Fragments in Real Scenes

  • Leonardo M. B. Claudino
  • Antônio de P. Braga
  • Arnaldo de A. Araújo
  • André F. Oliveira
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3617)


This paper proposes a method that aims to reduce a real scene to a set of regions that contain text fragments and keep small number of false positives. Text is modeled and characterized as a texture pattern, by employing the QMF wavelet decomposition as a texture feature extractor. Processing includes segmentation and spatial selection of regions and then content-based selection of fragments. Unlike many previous works, text fragments in different scales and resolutions laid against complex backgrounds are segmented without supervision. Tested in four image databases, the method is able to reduce visual noise to 4.69% and reaches 96.5% of coherency between the localized fragments and those generated by manual segmentation.


Manual Segmentation Decomposition Level License Plate Real Scene Visual Noise 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Barroso, J., Rafael, A., Dagless, E.L., Bulas-Cruz, J.: Number plate reading using computer vision. In: IEEE-International Symposium on Industrial Electronics ISIE 1997, pp. 761–766 (1997)Google Scholar
  2. 2.
    Setchell, C.J.: Applications of Computer Vision to Road-traffic Monitoring. PhD thesis, Faculty of Engineering, Department of Electrical and Electronic Engineering of the Univesity of Bristol (1997)Google Scholar
  3. 3.
    Mariano, V.Y., Kasturi, R.: Detection of text marks on moving vehicles. In: 7th International Conference on Document Analysis and Recognition (ICDAR 2003), 2-Volume Set, August 3-6, Edinburgh, Scotland, UK, pp. 393–397 (2003)Google Scholar
  4. 4.
    Clark, P., Mirmehdi, M.: Finding text regions using localised measures. In: Mirmehdi, M., Thomas, B. (eds.) Proceedings of the 11th British Machine Vision Conference, pp. 675–684. BMVA Press (2000)Google Scholar
  5. 5.
    Wu, V., Manmatha, R., Riseman, E.M.: Textfinder: An automatic system to detect and recognize text in images. IEEE Transactions on Pattern Analysis and Machine Intelligence 21, 1224–1229 (1999)CrossRefGoogle Scholar
  6. 6.
    Etemad, K., Doermann, D., Chellapa, R.: Multiscale segmentation of unstructured document pages using soft decision integration. IEEE Transactions on Pattern Analysis and Machine Intelligence 19, 92–96 (1997)CrossRefGoogle Scholar
  7. 7.
    Jain, A.K., Yu, B.: Automatic text location in images and video frames. Pattern Recognition 31, 2055–2076 (1998)CrossRefGoogle Scholar
  8. 8.
    Smith, J.R.: Integrated Spatial and Feature Image Systems: Retrieval, Analysis and Compression. Phd. thesis, Graduate School of Arts and Sciences, Columbia University, New York, NY (1997)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Leonardo M. B. Claudino
    • 1
  • Antônio de P. Braga
    • 1
  • Arnaldo de A. Araújo
    • 2
  • André F. Oliveira
    • 2
  1. 1.Centro de Pesquisa e Desenvolvimento em Engenharia ElétricaUniversidade Federal de Minas GeraisBelo Horizonte, Minas GeraisBrazil
  2. 2.Depto. de Ciência da ComputaçãoUniversidade Federal de Minas GeraisBelo Horizonte, Minas GeraisBrazil

Personalised recommendations