Anveshak - A Groundtruth Generation Tool for Foreground Regions of Document Images

  • Soumyadeep DeyEmail author
  • Jayanta Mukherjee
  • Shamik Sural
  • Amit Vijay Nandedkar
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10481)


We propose a graphical user interface based groundtruth generation tool in this paper. Here, annotation of an input document image is done based on the foreground pixels. Foreground pixels are grouped together with user interaction to form labeling units. These units are then labeled by the user with the user defined labels. The output produced by the tool is an image with an XML file containing its metadata information. This annotated data can be further used in different applications of document image analysis.



This work is partly funded by TCS research scholar program and partly by Ministry of Communications and Information Technology, Government of India; MCIT 11(19)/2010-HCC (TDIL) dt. 28-12-2010.


  1. 1.
    Bradski, G.: The OpenCV Library. Dr. Dobb’s J. Softw. Tools 25(11) (2000)Google Scholar
  2. 2.
    Dey, S., Mukherjee, J., Sural, S.: Stamp and logo detection from document images by finding outliers. In: 2015 Fifth National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG), pp. 1–4, December 2015Google Scholar
  3. 3.
    Dey, S., Mukherjee, J., Sural, S.: Consensus-based clustering for document image segmentation. Int. J. Doc. Anal. Recogn. (IJDAR) 19(4), 351–368 (2016)CrossRefGoogle Scholar
  4. 4.
    Dey, S., Mukhopadhyay, J., Sural, S., Bhowmick, P.: Margin noise removal from printed document images. In: Workshop on Document Analysis and Recognition, pp. 86–93 (2012)Google Scholar
  5. 5.
    Doermann, D., Zotkina, E., Li, H.: GEDI - A Groundtruthing Environment for Document Images (2010).
  6. 6.
    Douglas, D.H., Peucker, T.M.: Algorithm for the reduction of the number of points required to represent a digitized line or its caricature. Cartographica: Int. J. Geogr. Inf. Geovisualization 10(2), 112–122 (1973)CrossRefGoogle Scholar
  7. 7.
    Ford, G., Thoma, G.R.: Ground truth data for document image analysis. In: Proceedings of 2003 Symposium on Document Image Understanding and Technology, pp. 199–205, 9-11 April 2003Google Scholar
  8. 8.
    Gonzalez, R.C., Woods, R.E.: Digital Image Processing, 3rd edn. Prentice-Hall Inc., Upper Saddle River (2009)Google Scholar
  9. 9.
    Ha, L.C., Kanungo, T.: The architecture of trueviz: a groundtruth/metadata editing and VIsualiZing toolkit. Pattern Recogn. 36(3), 811–825 (2003)CrossRefGoogle Scholar
  10. 10.
    Micenkova, B., Beusekom, J.V.: Stamp detection in color document images. In: 11th International Conference on Document Analysis and Recognition, pp. 1125–1129 (2011)Google Scholar
  11. 11.
    Moll, M., Baird, H., An, C.: Truthing for pixel-accurate segmentation. In: The Eighth IAPR International Workshop on Document Analysis Systems, DAS 2008, pp. 379–385, September 2008Google Scholar
  12. 12.
    Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9(1), 62–66 (1979)CrossRefMathSciNetGoogle Scholar
  13. 13.
    Saund, E., Lin, J., Sarkar, P.: Pixlabeler: user interface for pixel-level labeling of elements in document images. In: 10th International Conference on Document Analysis and Recognition, pp. 646–650 (2009)Google Scholar
  14. 14.
    Shafait, F., Keysers, D., Breuel, T.: Pixel-accurate representation and evaluation of page segmentation in document images. In: 18th International Conference on Pattern Recognition, ICPR 2006, vol. 1, pp. 872–875 (2006)Google Scholar
  15. 15.
    Strecker, T., van Beusekom, J., Albayrak, S., Breuel, T.: Automated ground truth data generation for newspaper document images. In: 10th International Conference on Document Analysis and Recognition, pp. 1275–1279, July 2009Google Scholar
  16. 16.
    Suzuki, S., Abe, K.: Topological structural analysis of digitized binary images by border following. Comput. Vis. Graph. Image Process. 30(1), 32–46 (1985)CrossRefzbMATHGoogle Scholar
  17. 17.
    Wahl, F.M., Wong, K.Y., Casey, R.G.: Block segmentation and text extraction in mixed text/image documents. Comput. Graph. Image Process. 20(4), 375–390 (1982)CrossRefGoogle Scholar
  18. 18.
    Wenyin, L., Dori, D.: A protocol for performance evaluation of line detection algorithms. Mach. Vis. Appl. 9(5–6), 240–250 (1997)CrossRefGoogle Scholar
  19. 19.
    Yacoub, S., Saxena, V., Sami, S.: Perfectdoc: a ground truthing environment for complex documents. In: 8th International Conference on Document Analysis and Recognition, pp. 452–456 (2005)Google Scholar
  20. 20.
    Yang, L., Huang, W., Tan, C.L.: Semi-automatic ground truth generation for chart image recognition. In: Bunke, H., Spitz, A.L. (eds.) DAS 2006. LNCS, vol. 3872, pp. 324–335. Springer, Heidelberg (2006). doi: 10.1007/11669487_29 CrossRefGoogle Scholar
  21. 21.
    Yanikoglu, B., Vincent, L.: Pink panther: a complete environment for ground-truthing and benchmarking document page segmentation. Pattern Recogn. 31(9), 1191–1204 (1998)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  • Soumyadeep Dey
    • 1
    Email author
  • Jayanta Mukherjee
    • 1
  • Shamik Sural
    • 1
  • Amit Vijay Nandedkar
    • 1
  1. 1.Department of Computer Science and EngineeringIndian Institute of Technology KharagpurKharagpurIndia

Personalised recommendations