Improving the Accuracy of the Optimum-Path Forest Supervised Classifier for Large Datasets
In this work, a new approach for supervised pattern recognition is presented which improves the learning algorithm of the Optimum-Path Forest classifier (OPF), centered on detection and elimination of outliers in the training set. Identification of outliers is based on a penalty computed for each sample in the training set from the corresponding number of imputable false positive and false negative classification of samples. This approach enhances the accuracy of OPF while still gaining in classification time, at the expense of a slight increase in training time.
KeywordsOptimum-Path Forest Classifier Outlier Detection Supervised Classification Learning Algorithm
- 1.MPEG-7: The generic multimedia content description standard, part 1. IEEE MultiMedia 9, 78–87 (2002)Google Scholar
- 3.Asuncion, A., Newman, D.J.: UCI machine learning repository (2007)Google Scholar
- 4.Boser, B.E., Guyon, I.M., Vapnik, V.N.: A training algorithm for optimal margin classifiers. In: Proceedings of the 5th Workshop on Computational Learning Theory, pp. 144–152. ACM Press, New York (1992)Google Scholar
- 7.Papa, J.P., Cappabianco, F.A.M., Falcão, A.X.: Optimizing optimum-path forest classification for huge datasets. In: Proceedings of The 20th International Conference on Pattern Recognition (to appear, 2010)Google Scholar