Multimedia Tools and Applications

, Volume 78, Issue 22, pp 31757–31791 | Cite as

Optimal feature and classifier selection for text region classification in natural scene images using Weka tool

  • Rituraj SoniEmail author
  • Bijendra Kumar
  • Satish Chand


The problem of text detection and localization in scene images has always been challenging for the researchers over the years due to diversities present in these images. This diversity includes variation in fonts, size, color, different backgrounds, etc. The textual content in such images can be helpful for humans in many different domains like visually impaired people, scene understanding, intelligent navigation, etc. The natural scene contains some non-text objects along with relevant text objects, and it is necessary to classify them appropriately & accurately to increase the performance of the detection and localization method. The classification of text regions in scene images depends on the selection of optimal features and optimal classifier. This work contributes to finding both the optimal feature set and the optimal classifier with the help of weka tool. In this paper, first, we detect the possible text regions with the help of the improved MSER algorithm; then, we extract 11 features on these potential text regions. From these 11 features, we choose an optimal feature set for discrimination between text and non-text components with the help of the CfsSubsetEval and BFS parameter of the Weka Tool. We trained several classifiers using these optimal features with the help of Weka tool on the ICDAR 2013 training set. The performance of these classifiers is compared empirically based on the classification accuracy obtained using Weka tool. Based on this empirical estimation, Naive Bayes Classifier with the highest accuracy of 92.5% is proposed as an optimal choice for classification purpose.


Extraction of text regions MSER Feature selection and extraction Classification Weka tool 



This work is supported by UPE-II, Jawaharlal University, New Delhi, India.


  1. 1.
    Amancio DR, Comin CH, Casanova D, Travieso G, Bruno OM, Rodrigues FA, da Fontoura Costa L (2014) A systematic comparison of supervised classifiers. PloS one 9(4):e94137CrossRefGoogle Scholar
  2. 2.
    Ansari GJ, Shah JH, Yasmin M, Sharif M, Fernandes SL (2018) A novel machine learning approach for scene text extraction. Future Generation Computer SystemsGoogle Scholar
  3. 3.
    Baran R, Partila P, Wilk R (2018) Automated text detection and character recognition in natural scenes based on local image features and contour processing techniques. In: International conference on intelligent human systems integration. Springer, pp 42–48Google Scholar
  4. 4.
    Bušta M, Neumann L, Matas J (2017) Deep textspotter: an end-to-end trainable scene text localization and recognition framework. In: IEEE International conference on computer vision (ICCV). Venice, pp 22–29Google Scholar
  5. 5.
    Chang CC, Lin CJ (2011) Libsvm: a library for support vector machines. ACM Trans Intell Syst Technol (TIST) 2(3):27Google Scholar
  6. 6.
    Chen X, Yuille AL (2004) Detecting and reading text in natural scenes. In: Proceedings of the 2004 IEEE computer society conference on computer vision and pattern recognition, 2004. CVPR 2004, vol 2. IEEE, pp II–IIGoogle Scholar
  7. 7.
    Chen H, Tsai SS, Schroth G, Chen DM, Grzeszczuk R, Girod B (2011) Robust text detection in natural images with edge-enhanced maximally stable extremal regions. In: 2011 18th IEEE International conference on image processing (ICIP). IEEE, pp 2609–2612Google Scholar
  8. 8.
    da Silva BLS, Ciarelli PM (2016) Edge detection and confidence map applied to identify textual elements in imagesGoogle Scholar
  9. 9.
    Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: IEEE Computer society conference on computer vision and pattern recognition, 2005. CVPR 2005, vol 1. IEEE, pp 886–893Google Scholar
  10. 10.
    Dash JK, Mukhopadhyay S (2018) Similarity learning for texture image retrieval using multiple classifier system. Multimed Tools Appl 77(1):459–483CrossRefGoogle Scholar
  11. 11.
    Dash JK, Mukhopadhyay S, Gupta RD (2017) Multiple classifier system using classification confidence for texture classification. Multimed Tools Appl 76(2):2535–2556CrossRefGoogle Scholar
  12. 12.
    Epshtein B, Ofek E, Wexler Y (2010) Detecting text in natural scenes with stroke width transform. In: 2010 IEEE Conference on computer vision and pattern recognition (CVPR). IEEE, pp 2963–2970Google Scholar
  13. 13.
    Extremal MS, Matas J, Chum O, Urban M, Pajdla T (2002) Robust wide baseline stereo from in. In: British machine vision conference. CiteseerGoogle Scholar
  14. 14.
    Fabrizio J, Robert-Seidowsky M, Dubuisson S, Calarasanu S, Boissel R (2016) Textcatcher: a method to detect curved and challenging text in natural scenes. Int J Doc Anal Recogn (IJDAR) 19(2):99–117CrossRefGoogle Scholar
  15. 15.
    Feng Y, Song Y, Zhang Y (2016) Scene text detection based on multi-scale swt and edge filtering. In: 2016 23rd International conference on pattern recognition (ICPR). IEEE, pp 645–650Google Scholar
  16. 16.
    Frank E (2014) Fully supervised training of Gaussian radial basis function networks in wekaGoogle Scholar
  17. 17.
    Freund Y, Schapire RE (1999) Large margin classification using the perceptron algorithm. Mach Learn 37(3):277–296zbMATHCrossRefGoogle Scholar
  18. 18.
    Friedman N, Geiger D, Goldszmidt M (1997) Bayesian network classifiers. Mach Learn 29(2–3):131–163zbMATHCrossRefGoogle Scholar
  19. 19.
    Genkin A, Lewis DD, Madigan D (2007) Large-scale bayesian logistic regression for text categorization. Technometrics 49(3):291–304MathSciNetCrossRefGoogle Scholar
  20. 20.
    Ghanei S, Faez K (2017) A robust approach for scene text localization using rule-based confidence map and grouping. Int J Pattern Recogn Artif Intell 31(03):1753002CrossRefGoogle Scholar
  21. 21.
    Ghoshal R, Roy A, Dhara BC, Parui SK (2015) Text extraction from scene images through local binary pattern and business features based color image segmentation. In: Information systems design and intelligent applications. Springer, pp 475–482Google Scholar
  22. 22.
    Gllavata J, Ewerth R, Freisleben B (2004) Text detection in images based on unsupervised classification of high-frequency wavelet coefficients. In: Proceedings of the 17th International conference on pattern recognition, 2004. ICPR 2004, vol 1. IEEE, pp 425–428Google Scholar
  23. 23.
    Gonzalez A, Bergasa LM, Yebes JJ, Bronte S (2012) Text location in complex images. In: 2012 21st International conference on pattern recognition (ICPR). IEEE, pp 617–620Google Scholar
  24. 24.
    Guan L, Chu J (2017) Natural scene text detection based on swt, mser and candidate classification, IEEEGoogle Scholar
  25. 25.
    Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The weka data mining software: an update. ACM SIGKDD Explor Newslett 11(1):10–18CrossRefGoogle Scholar
  26. 26.
    Hanif SM, Prevost L (2009) Text detection and localization in complex scene images using constrained adaboost algorithm. In: 10th International conference on document analysis and recognition, 2009. ICDAR’09. IEEE, pp 1–5Google Scholar
  27. 27.
    He T, Huang W, Qiao Y, Yao J (2016) Text-attentional convolutional neural network for scene text detection. IEEE Trans Image Process 25(6):2529–2541MathSciNetzbMATHCrossRefGoogle Scholar
  28. 28.
    Heckerman D, Geiger D, Chickering DM (1995) Learning bayesian networks: the combination of knowledge and statistical data. Mach Learn 20(3):197–243zbMATHGoogle Scholar
  29. 29.
    Iqbal K, Yin XC, Yin X, Ali H, Hao HW (2013) Classifier comparison for mser-based text classification in scene images. In: The 2013 international joint conference on neural networks (IJCNN). IEEE, pp 1–6Google Scholar
  30. 30.
    Jaderberg M, Simonyan K, Vedaldi A, Zisserman A (2014) Deep structured output learning for unconstrained text recognition. arXiv:14125903
  31. 31.
    Jian H, Xiaopei L, Qian Z (2016) A swt verified method of natural scene text location. In: 2016 International symposium on computer, consumer and control (IS3C). IEEE, pp 980–984Google Scholar
  32. 32.
    Jiang R, Qi F, Xu L, Wu G (2006) Using connected-components’ features to detect and segment text. J Image Graph 11:1653–1656Google Scholar
  33. 33.
    Jiang M, Cheng J, Chen M, Ku X (2018) An improved text localization method for natural scene images. In: Journal of physics: conference series, vol 960. IOP Publishing, p 012027Google Scholar
  34. 34.
    Joan SF, Valli S (2017) An enhanced text detection technique for the visually impaired to read text. Inf Syst Front 19(5):1039–1056CrossRefGoogle Scholar
  35. 35.
    Jung K, Kim KI, Jain AK (2004) Text information extraction in images and video: a survey. Pattern Recog 37(5):977–997CrossRefGoogle Scholar
  36. 36.
    Karatzas D, Shafait F, Uchida S, Iwamura M, i Bigorda LG, Mestre SR, Mas J, Mota DF, Almazan JA, de las Heras LP (2013) Icdar 2013 robust reading competition. In: 2013 12th International conference on document analysis and recognition (ICDAR). IEEE, pp 1484–1493Google Scholar
  37. 37.
    Kim HK, Kim M (2016) Model-induced term-weighting schemes for text classification. Appl Intell 45(1):30–43CrossRefGoogle Scholar
  38. 38.
    Kim KI, Jung K, Kim JH (2003) Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm. IEEE Trans Pattern Anal Mach Intell 25(12):1631–1639CrossRefGoogle Scholar
  39. 39.
    Klein DA, Frintrop S (2011) Center-surround divergence of feature statistics for salient object detection. In: 2011 IEEE International conference on computer vision (ICCV). IEEE, pp 2214–2219Google Scholar
  40. 40.
    Koo HI, Kim DH (2013) Scene text detection via connected component clustering and nontext filtering. IEEE Trans Image Process 22(6):2296–2305MathSciNetzbMATHCrossRefGoogle Scholar
  41. 41.
    Lee S, Cho MS, Jung K, Kim JH (2010) Scene text extraction with edge constraint and text collinearity. In: 2010 20th International conference on pattern recognition (ICPR). IEEE, pp 3983–3986Google Scholar
  42. 42.
    Lee JJ, Lee PH, Lee SW, Yuille A, Koch C (2011) Adaboost for text detection in natural scene. In: 2011 International conference on document analysis and recognition (ICDAR). IEEE, pp 429–434Google Scholar
  43. 43.
    Li M, Wang C (2008) An adaptive text detection approach in images and video frames. In: IEEE International joint conference on neural networks 2008 IJCNN 2008 IEEE World congress on computational intelligence. IEEE, pp 72–77Google Scholar
  44. 44.
    Li Y, Lu H (2012) Scene text detection via stroke width. In: 2012 21st International conference on pattern recognition (ICPR). IEEE, pp 681–684Google Scholar
  45. 45.
    Li Y, Jia W, Shen C, van den Hengel A (2014) Characterness: an indicator of text in the wild. IEEE Trans Image Process 23(4):1666–1677MathSciNetzbMATHCrossRefGoogle Scholar
  46. 46.
    Liu X, Samarabandu J (2006) Multiscale edge-based text extraction from complex images. In: 2006 IEEE International conference on multimedia and expo. IEEE, pp 1721–1724Google Scholar
  47. 47.
    Liu J, Su H, Yi Y, Hu W (2016) Robust text detection via multi-degree of sharpening and blurring. Signal Process 124:259–265CrossRefGoogle Scholar
  48. 48.
    Lucas SM (2005) Icdar 2005 text locating competition results. In: 2005 Proceedings. Eighth international conference on document analysis and recognition. IEEE, pp 80–84Google Scholar
  49. 49.
    Lucas SM, Panaretos A, Sosa L, Tang A, Wong S, Young R (2003) Icdar 2003 robust reading competitions. In: ICDAR, vol 2003. Citeseer, p 682Google Scholar
  50. 50.
    Ma L, Wang C, Xiao B (2010) Text detection in natural images based on multi-scale edge detetion and classification. In: 2010 3rd International congress on image and signal processing (CISP), vol 4. IEEE, pp 1961–1965Google Scholar
  51. 51.
    Majtey A, Lamberti P, Prato D (2005) Jensen-shannon divergence as a measure of distinguishability between mixed quantum states. Phys Rev A 72(5):052310CrossRefGoogle Scholar
  52. 52.
    Maruyama M, Yamaguchi T (2009) Extraction of characters on signboards in natural scene images by stump classifiers. In: 10th International conference on document analysis and recognition, 2009. ICDAR’09. IEEE, pp 1365–1369Google Scholar
  53. 53.
    McCallum A, Nigam K, et al. (1998) A comparison of event models for naive bayes text classification. In: AAAI-98 workshop on learning for text categorization, vol 752. Citeseer, pp 41–48Google Scholar
  54. 54.
    Misra C, Swain P, Mantri J (2012) Text extraction and recognition from image using neural network. Int J Comput Appl 40(2):13–19Google Scholar
  55. 55.
    Mukhopadhyay A, Kumar S, Chowdhury SR, Chakraborty N, Mollah AF, Basu S, Sarkar R (2019) Multi-lingual scene text detection using one-class classifier. Int J Comput Vis Image Process (IJCVIP) 9(2):48–65CrossRefGoogle Scholar
  56. 56.
    Neumann L, Matas J (2010) A method for text localization and recognition in real-world images. In: Asian conference on computer vision. Springer, pp 770–783Google Scholar
  57. 57.
    Neumann L, Matas J (2012) Real-time scene text localization and recognition. In: 2012 IEEE Conference on computer vision and pattern recognition (CVPR). IEEE, pp 3538–3545Google Scholar
  58. 58.
    Nguyen K, Thanh ND (2016) Scene text detection based on structural features. In: 2016 International conference on computer, control, informatics and its applications (IC3INA). IEEE, pp 48–53Google Scholar
  59. 59.
    Ou Ww, Zhu J, Liu C (2004) Text location in natural scene. J Chin Inf Process 5:006Google Scholar
  60. 60.
    Pan YF, Hou X, Liu CL (2011) A hybrid approach to detect and localize texts in natural scene images. IEEE Trans Image Process 20(3):800–813MathSciNetzbMATHCrossRefGoogle Scholar
  61. 61.
    Pan YF, Liu CL, Hou X (2010) Fast scene text localization by learning-based filtering and verification. In: 2010 17th IEEE International conference on image processing (ICIP). IEEE, pp 2269–2272Google Scholar
  62. 62.
    Panda M, Abraham A, Patra MR (2010) Discriminative multinomial naive bayes for network intrusion detection. In: 2010 Sixth International conference on information assurance and security (IAS). IEEE, pp 5–10Google Scholar
  63. 63.
    Pietikainen M, Okun O (2001) Edge-based method for text detection from complex document images. In: 2001 Proceedings. Sixth international conference on document analysis and recognition. IEEE, pp 286–291Google Scholar
  64. 64.
    Platt JC (1999) 12 fast training of support vector machines using sequential minimal optimization. Adv Kernel Methods, 185–208Google Scholar
  65. 65.
    Rish I, et al. (2001) An empirical study of the naive bayes classifier. In: IJCAI 2001 workshop on empirical methods in artificial intelligence, vol 3, pp 41–46Google Scholar
  66. 66.
    Salvithal NN, Kulkarni R (2013) Evaluating performance of data mining classification algorithm in wekaGoogle Scholar
  67. 67.
    Shahab A, Shafait F, Dengel A (2011) Icdar 2011 robust reading competition challenge 2: reading text in scene images. In: 2011 International conference on document analysis and recognition (ICDAR). IEEE, pp 1491–1496Google Scholar
  68. 68.
    Shi H, Liu Y (2011) Naïve bayes vs. support vector machine: resilience to missing data. In: International conference on artificial intelligence and computational intelligence. Springer, pp 680–687Google Scholar
  69. 69.
    Shi C, Wang C, Xiao B, Zhang Y, Gao S (2013) Scene text detection using graph model built upon maximally stable extremal regions. Pattern Recogn Lett 34(2):107–116CrossRefGoogle Scholar
  70. 70.
    Singh A, Halgamuge MN, Lakshmiganthan R (2017) Impact of different data types on classifier performance of random forest, naive bayes, and k-nearest neighbors algorithms. Int J Adv Comput Sci Appl 8(12):1–10Google Scholar
  71. 71.
    Soni R, Kumar B, Chand S (2018) Text detection and localization in natural scene images based on text awareness score. Appl Intell, 1–30Google Scholar
  72. 72.
    Sun L, Huo Q, Jia W, Chen K (2015) A robust approach for text detection from natural scene images. Pattern Recogn 48(9):2906–2920CrossRefGoogle Scholar
  73. 73.
    Šarić M (2017) Scene text segmentation using low variation extremal regions and sorting based character grouping. Neurocomputing 266:56–65CrossRefGoogle Scholar
  74. 74.
    Tang B, Kay S, He H (2016) Toward optimal feature selection in naive bayes for text categorization. arXiv:160202850
  75. 75.
    Unar S, Hussain A, Shaikh M, Memon KH, Ansari MA, Memon Z (2018) A study on text detection and localization techniques for natural scene images. IJCSNS 18(1):100Google Scholar
  76. 76.
    Wang K, Belongie S (2010) Word spotting in the wild. In: European conference on computer vision. Springer, pp 591–604Google Scholar
  77. 77.
    Wang Q, Lu Y, Sun S (2015) Text detection in nature scene images using two-stage nontext filtering. In: 2015 13th International conference on document analysis and recognition (ICDAR). IEEE, pp 106–110Google Scholar
  78. 78.
    Wei Y, Zhang Z, Shen W, Zeng D, Fang M, Zhou S (2017) Text detection in scene images based on exhaustive segmentation. Signal Process Image Commun 50:1–8CrossRefGoogle Scholar
  79. 79.
    Wu H, Zou B, Yq Zhao, Guo J (2017) Scene text detection using adaptive color reduction, adjacent character model and hybrid verification strategy. Vis Comput 33(1):113–126CrossRefGoogle Scholar
  80. 80.
    Xiang ZL, Yu XR, Kang DK (2016) Experimental analysis of naïve bayes classifier based on an attribute weighting framework with smooth kernel density estimations. Appl Intell 44(3):611–620CrossRefGoogle Scholar
  81. 81.
    Xu S, Krauthammer M (2010) A new pivoting and iterative text detection algorithm for biomedical images. J Biomed Inform 43(6):924–931CrossRefGoogle Scholar
  82. 82.
    Xu S, Li Y, Wang Z (2017) Bayesian multinomial naïve bayes classifier to text classification. In: Advanced multimedia and ubiquitous engineering. Springer, pp 347–352Google Scholar
  83. 83.
    Yao C, Bai X, Liu W, Ma Y, Tu Z (2012) Detecting texts of arbitrary orientations in natural images. In: 2012 IEEE Conference on computer vision and pattern recognition (CVPR). IEEE, pp 1083–1090Google Scholar
  84. 84.
    Ye Q, Doermann D (2015) Text detection and recognition in imagery: a survey. IEEE Trans Pattern Anal Mach Intell 37(7):1480–1500CrossRefGoogle Scholar
  85. 85.
    Ye Q, Jiao J, Huang J, Yu H (2007) Text detection and restoration in natural scene images. J Vis Commun Image Represent 18(6):504–513CrossRefGoogle Scholar
  86. 86.
    Yi C, Tian Y (2011) Text string detection from natural scenes by structure-based partition and grouping. IEEE Trans Image Process 20(9):2594–2605MathSciNetzbMATHCrossRefGoogle Scholar
  87. 87.
    Yu C, Song Y, Zhang Y (2016) Scene text localization using edge analysis and feature pool. Neurocomputing 175:652–661CrossRefGoogle Scholar
  88. 88.
    Zhang J, Kasturi R (2010) Text detection using edge gradient and graph spectrum. In: 2010 20th International conference on pattern recognition (ICPR). IEEE, pp 3979–3982Google Scholar
  89. 89.
    Zhang H, Zhao K, Song YZ, Guo J (2013) Text extraction from natural scene image: a survey. Neurocomputing 122:310–323CrossRefGoogle Scholar
  90. 90.
    Zhang G, Kai H, Zhang B, Fu H, ZHAO J (2017) A natural scene text extraction method based on the maximum stable extremal region and stroke width transform. J Xi’an Jiaotong Univ 1:021Google Scholar
  91. 91.
    Zhou G, Liu Y, Meng Q, Zhang Y (2011) Detecting multilingual text in natural scene. In: 2011 1st International symposium on access spaces (ISAS). IEEE, pp 116–120Google Scholar
  92. 92.
    Zhu A, Wang G, Dong Y (2015) Detecting natural scenes text via auto image partition, two-stage grouping and two-layer classification. Pattern Recogn Lett 67:153–162CrossRefGoogle Scholar
  93. 93.
    Zhu Y, Huang X, Huang Q, Tian Q (2016) Large-scale video copy retrieval with temporal-concentration sift. Neurocomputing 187:83–91CrossRefGoogle Scholar
  94. 94.
    Zhu Y, Yao C, Bai X (2016) Scene text detection and recognition: recent advances and future trends. Front Comput Sci 10(1):19–36CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2019

Authors and Affiliations

  1. 1.Department of Computer EngineeringNSITNew DelhiIndia
  2. 2.School of Computer and Systems ScienceJNUNew DelhiIndia

Personalised recommendations