A Technique for Segmentation of Gurmukhi Text

  • G. S. Lehal
  • Chandan Singh
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2124)


This paper describes a technique for text segmentation of machine printed Gurmukhi script documents. Research in the field of segmentation of Gurmukhi script faces major problems mainly related to the unique characteristics of the script like connectivity of characters on the headline, two or more characters in a word having intersecting minimum bounding rectangles, multi-component characters, touching characters which are present even in clean documents. The segmentation problems unique to the Gurmukhi script such as horizontally overlapping text segments and touching characters in various zonal positions in a word have been discussed in detail and a solution has been proposed.


text segmentation Gurmukhi script 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. [1]
    Lu, Y.: Machine Printed Character Segmentation-an Overview. Pattern Recognition, 28 (1995) 67–80CrossRefGoogle Scholar
  2. [2]
    Casy, R.G., Lecolinet, E.: A survey of methods and strategies in character segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 18 (1996) 690–706CrossRefGoogle Scholar
  3. [3]
    Chaudhuri, B.B., Pal, U.: A complete printed Bangla OCR system. Pattern Recognition, 31 (1998) 531–549CrossRefGoogle Scholar
  4. [4]
    Pal, U., Chaudhuri, B.B.: Printed Devnagri Script OCR System. Vivek, 10 (1997) 12–24Google Scholar
  5. [5]
    Bansal, V.: Integrating knowledge sources in Devanagri text recognition. Ph.D. thesis, IIT Kanpur, INDIA (1999)Google Scholar
  6. [6]
    Goyal, A.K., Lehal, G.S., Deol, S.S.: Segmentation of Machine Printed Gurmukhi Script. In: Proceedings 9th International Graphonomics Society Conference, Singapore (1999) 293–297Google Scholar
  7. [7]
    Lehal, G.S., Singh, S.: Text segmentation of Machine Printed Gurmukhi Script. Document Recognition and Retrieval VIII, Kantor, P.B., Lopresti, D.P., Jiangying Zhou, (eds.): Proceedings SPIE, USA, Vol. 4307 (2001) 223–231Google Scholar
  8. [8]
    Kahan, S., Pavlidis, T., Baird, H.S.: On the recognition of printed characters of any font and size. IEEE Transactions on Pattern Analysis and Machine Intelligence, 9 (1987) 274–287CrossRefGoogle Scholar
  9. [9]
    Liang, S., Shirdhar, M., Ahmed, M.: Segmentation of touching characters in printed document recognition. Pattern Recognition, 27 (1994) 825–840CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2001

Authors and Affiliations

  • G. S. Lehal
    • 1
  • Chandan Singh
    • 2
  1. 1.Department of Computer Science and EngineeringThapar Institute of Engineering & TechnologyPatialaIndia
  2. 2.Department of Computer Science and EngineeringPunjabi UniversityPatialaIndia

Personalised recommendations