A Rejection Option for the Multilayer Perceptron Using Hyperplanes
Currently, a growing quantity of the Artificial Intelligence tasks demand a high efficiency of the classification systems (classifiers); making an error in the classification of an object or event can cause serious problems. This is worrying when the classifiers confront tasks where the classes are not linearly separable, the classifiers efficiency diminishes considerably. One solution for decreasing this complication is the Rejection Option. In several circumstances it is advantageous to not have a decision be taken and wait to obtain additional information instead of making an error.
This work contains the description of a novel reject procedure whose purpose is to identify elements with a high risk of being misclassified; like those in an overlap zone. For this, the location of the object in evaluation is calculated with regard to two hyperplanes that emulate the classifiers decision boundary. The area between these hyperplanes is named an overlap region. If the element is localized in this area, it is rejected.
Experiments conducted with the artificial neural network Multilayer Perceptron, trained with the Backpropagation algorithm, show between 12.0%- 91.4%of the objects in question would have been misclassified if they had not been rejected.
KeywordsReject option Multilayer Perceptron Backpropagation hyperplane overlap
Unable to display preview. Download preview PDF.
- 1.Burden, R.L., Douglas Faires, J., Reynolds, A.C.: Numerical Analysis. In: Prindle, Weber, Schmidt (eds.) Wadsworth International (1981)Google Scholar
- 3.Irie, B., Miyake, S.: Capabilities of three-layered perceptrons. In: Proc. of the IEEE Conference on Neural Networks, vol. I, pp. 641–648 (1988)Google Scholar
- 4.Lippmann, R.P.: An introduction to computing with neural nets. IEEE Acoust. Speech Signal Process. Mag. 4, 4–22 (1987)Google Scholar
- 6.Looney, C.G.: Pattern Recognition Using Neural Networks: Theory and Algorithms for Engineers and Scientists. Oxford University Press, Oxford (1997)Google Scholar
- 8.Merz, C.J., Murphy, P.M.: Repository of Machine Learning Databases, University of California at Irvine (1998), http://www.csi.uci.edu/mlearn
- 9.Reister, D.B.: The least Squares Fit of a Hyperplane to Uncertain Data. ORNL (1996)Google Scholar