Pattern Synthesis Using Fuzzy Partitions of the Feature Set for Nearest Neighbor Classifier Design
- 701 Downloads
Nearest neighbor classifiers require a larger training set in order to achieve a better classification accuracy. For a higher dimensional data, if the training set size is small, it suffers from the curse of dimensionality effect and performance gets degraded. Partition based pattern synthesis is an existing technique of generating a larger set of artificial training patterns based on a chosen partition of the feature set. If the blocks of the partition are statistically independent then the quality of synthetic patterns generated is high. But, such a partition, often does not exist for real world problems. So, approximate ways of generating a partition based on correlation coefficient values between pairs of features were used earlier in some studies. That is, an approximate hard partition, where each feature belongs to exactly one cluster (block) of the partition was used for doing the synthesis. The current paper proposes an improvement over this. Instead of having a hard approximate partition, a soft approximate partition based on fuzzy set theory could be beneficial. The present paper proposes such a fuzzy partitioning method of the feature set called fuzzy partition around medoids (fuzzy-PAM). Experimentally, using some standard data-sets, it is demonstrated that the fuzzy partition based synthetic patters are better as for as the classification accuracy is concerned.
KeywordsPattern synthesis fuzzy partition nearest neighbor classifier partition around medoids
Unable to display preview. Download preview PDF.
- 6.Dasarathy, B.V.: Nearest neighbor (NN) norms: NN pattern classification techniques. IEEE Computer Society Press, Los Alamitos (1991)Google Scholar
- 7.Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, 2nd edn. A Wiley-interscience Publication, John Wiley & Sons (2000)Google Scholar
- 9.Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Academic Press (2001)Google Scholar
- 10.Murphy, P.M.: UCI Repository of Machine Learning Databases. Department of Information and Computer Science, University of California, Irvine, CA (2000), http://www.ics.uci.edu/mlearn/MLRepository.html
- 12.Viswanath, P., Murty, M., Bhatnagar, S.: A pattern synthesis technique with an efficient nearest neighbor classifier for binary pattern recognition. In: Proceedings of the 17 th International Conference on Pattern Recognition (ICPR 2004), Cambridge, UK, vol. 4, pp. 416–419 (2004)Google Scholar