Advertisement

A Scalable Heuristic Classifier for Huge Datasets: A Theoretical Approach

  • Hamid Parvin
  • Behrouz Minaei-Bidgoli
  • Sajad Parvin
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7042)

Abstract

This paper proposes a heuristic classifier ensemble to improve the performance of learning in multiclass problems. Although the more accurate classifier leads to a better performance, there is another approach to use many inaccurate classifiers while each one is specialized for a few data in the problem space and using their consensus vote as the classifier. In this paper, some ensembles of classifiers are first created. The classifiers of each of these ensembles jointly work using majority weighting votes. The results of these ensembles are combined to decide the final vote in a weighted manner. Finally the outputs of these ensembles are heuristically aggregated. The proposed framework is evaluated on a very large scale Persian digit handwritten dataset and the experimental results show the effectiveness of the algorithm.

Keywords

Genetic Algorithm Optical Character Recognition Pairwise Classifier Multiclass Classification 

References

  1. 1.
    Breiman, L.: Bagging Predictors. Journal of Machine Learning 24(2), 123–140 (1996)zbMATHGoogle Scholar
  2. 2.
    Gunter, S., Bunke, H.: Creation of classifier ensembles for handwritten word recognition using feature selection algorithms. In: IWFHR 2002 (January 15, 2002)Google Scholar
  3. 3.
    Haykin, S.: Neural Networks, a comprehensive foundation, 2nd edn. Prentice Hall International, Inc. (1999) ISBN: 0-13-908385-5Google Scholar
  4. 4.
    Khosravi, H., Kabir, E.: Introducing a very large dataset of handwritten Farsi digits and a study on the variety of handwriting styles. Pattern Recognition Letters 28(10), 1133–1141 (2007)CrossRefGoogle Scholar
  5. 5.
    Kuncheva, L.I.: Combining Pattern Classifiers, Methods and Algorithms. Wiley, New York (2005)zbMATHGoogle Scholar
  6. 6.
    Minaei-Bidgoli, B., Punch, W.F.: Using Genetic Algorithms for Data Mining Optimization in an Educational Web-based System. In: GECCO (2003) Google Scholar
  7. 7.
    Parvin, H., Alizadeh, H., Minaei-Bidgoli, B.: A New Approach to Improve the Vote-Based Classifier Selection. In: International Conference on Networked Computing and Advanced Information Management (NCM 2008), Korea (2008)Google Scholar
  8. 8.
    Parvin, H., Alizadeh, H., Fathi, M., Minaei-Bidgoli, B.: Improved Face Detection Using Spatial Histogram Features. In: The 2008 Int. Conf. on Image Processing, Computer Vision, and Pattern Recognition (IPCV 2008), Las Vegas, Nevada, USA (July 14-17, 2008)Google Scholar
  9. 9.
    Parvin, H., Alizadeh, H., Minaei-Bidgoli, B., Analoui, M.: An Scalable Method for Improving the Performance of Classifiers in Multiclass Applications by Pairwise Classifiers and GA. In: International Conference on Networked Computing and advanced Information Management (NCM 2008), Korea (2008)Google Scholar
  10. 10.
    Saberi, A., Vahidi, M., Minaei-Bidgoli, B.: Learn to Detect Phishing Scams Using Learning and Ensemble Methods. In: IEEE/WIC/ACM International Conference on Intelligent Agent Technology, Workshops (IAT 2007), Silicon Valley, USA, November 2-5, pp. 311–314 (2007)Google Scholar
  11. 11.
    Yang, T.: Computational Verb Decision Trees. International Journal of Computational Cognition, 34–46 (2006)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Hamid Parvin
    • 1
  • Behrouz Minaei-Bidgoli
    • 1
  • Sajad Parvin
    • 1
  1. 1.Nourabad Mamasani BranchIslamic Azad UniversityNourabadIran

Personalised recommendations