Abstract
This work introduces a novel algorithm, called Condensation rule based on Fuzzy Rough Sets (FRSC), based on the FCNN rule together with fuzzy rough sets theory, to compute training-set-consistent subset for the nearest neighbor decision rule. In combination with fuzzy rough set theory, the FRSC rule improves the performance of FCNN rule. Two variants, named as FRSC1 and FRSC2, of the FRSC rule, are presented. The FRSC1 rule is suitable for small data set and the FRSC2 adapts to larger data sets. Compared with the FCNN rule, the FRSC1 rule requires much less time cost and gets smaller subset for small data set. For medium-size data set, less than 5000 samples, the FRSC2 rule has better time performance than FCNN rule.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Pawlak, Z.: Rough sets. Int. J. Comput. Inf. Sci. 11(5), 341–356 (1982)
Angiulli, F.: Fast nearest neighbor condensation for large data sets classification. IEEE Trans. Knowledge and Data Engineering 19(11), 1450–1464 (2007)
Kryszkiewicz, M.: Rough set approach to incomplete information systems. Inf. Sci. 112, 39–49 (1998)
Pawlak, Z.: Rough Sets: Theoretical Aspects of Reasoning About Data. Kluwer, Norwell (1991)
Tsumoto, S.: Automated extraction of medical expert system rules from clinical databases based on rough set theory. Inf. Sci. 112, 67–84 (1998)
Skowron, A., Polkowski, L.: Rough Sets in Knowledge Discovery, vol. 1, 2. Springer, Berlin (1998)
Jensen, R., Shen, Q.: Fuzzy-rough attribute reduction with application to web categorization. Fuzzy Sets and Systems 141, 469–485 (2004)
Yeung, D.S., Chen, D., Tsang, C.C., Lee, J.W.T., Xizhao, W.: On the generalization of fuzzy rough sets. IEEE Trans. Fuzzy Systems 13(3) (July 2005)
Aha, D.W.: “Editorial,” Artificial Intelligence Rev. Special Issue on Lazy Learning 11(5-5), 7–10 (1997)
Aha, D.W., Kibler, D., Albert, M.K.: Instance-Based Learning Algorithms. Machine Learning 6, 37–66 (1991)
Alpaydin, E.: Voting over Multiple Condensed Nearest Neighbors. Artificial Intelligence Rev. 11, 115–132 (1997)
Angiulli, F.: Fast Condensed Nearest Neighbor Rule. In: Proc. 22nd Int’l Conf. Machine Learning (ICML 2005), pp. 25–32 (2005)
Bay, S.: Combining Nearest Neighbor Classifiers through Multiple Feature Subsets. In: Proc. 15th Int’l Conf. Machine Learning, ICML 1998 (1998)
Bay, S.: Nearest Neighbor Classification from Multiple Feature Sets. Intelligent Data Analysis 3, 191–209 (1999)
Bhattacharya, B., Kaller, D.: Reference Set Thinning for the k-Nearest Neighbor Decision Rule. In: Proc. 14th Int’l Conf. Pattern Recognition, ICPR 1998 (1998)
Brighton, H., Mellish, C.: Advances in Instance Selection for Instance-Based Learning Algorithms. Data Mining and Knowledge Discovery 6(2), 153–172 (2002)
Cover, T.M., Hart, P.E.: Nearest Neighbor Pattern Classification. IEEE Trans. Information Theory 13(1), 21–27 (1967)
Dasarathy, B.: Minimal Consistent Subset (MCS) Identification for Optimal Nearest Neighbor Decision Systems Design. IEEE Trans. Systems, Man, and Cybernetics 24(3), 511–517 (1994)
Dasarathy, B.: Nearest Unlike Neighbor (NUN): An Aid to Decision Confidence Estimation. Optical Eng. 34, 2785–2792 (1995)
Devijver, P., Kittler, J.: On the Edited Nearest Neighbor Rule. In: Proc. Fifth Int’l Conf. Pattern Recognition (ICPR 1980), pp. 72–80 (1980)
Devroye, L.: On the Inequality of Cover and Hart in Nearest Neighbor Discrimination. IEEE Trans. Pattern Analysis and Machine Intelligence 3, 75–78 (1981)
Fukunaga, K., Hostetler, L.D.: k-Nearest-Neighbor Bayes-Risk Estimation. IEEE Trans. Information Theory 21, 285–293 (1975)
Gates, W.: The Reduced Nearest Neighbor Rule. IEEE Trans. Information Theory 18(3), 431–433 (1972)
Hart, P.E.: The Condensed Nearest Neighbor Rule. IEEE Trans. Information Theory 14(3), 515–516 (1968)
Karac̨ali, B., Krim, H.: Fast Minimization of Structural Risk by Nearest Neighbor Rule. IEEE Trans. Neural Networks 14(1), 127–134 (2003)
Ritter, G.L., Woodruff, H.B., Lowry, S.R., Isenhour, T.L.: An Algorithm for a Selective Nearest Neighbor Decision Rule. IEEE Trans. Information Theory 21, 665–669 (1975)
Wilfong, G.: Nearest Neighbor Problems. Int’l J. Computational Geometry & Applications 2(4), 383–416 (1992)
Wilson, D.L.: Asymptotic Properties of Nearest Neighbor Rules Using Edited Data. IEEE Trans. Systems, Man, and Cybernetics 2, 408–420 (1972)
Wilson, D.R., Martinez, T.R.: Reduction Techniques for Instance-Based Learning Algorithms. Machine Learning 38(3), 257–286 (2000)
Ziarko, W.P.: Rough sets, fuzzy sets and knowledge discovery. In: Workshop in Computing, London, U.K. (1994)
Chanas, S., Kuchta, D.: Further remarks on the relation between rough and fuzzy sets. Fuzzy Sets Syst. 47, 391–394 (1992)
Dubois, D., Prade, H.: Rough fuzzy sets and fuzzy rough sets. Int. J. Gen. Syst. 17(2-3), 191–209 (1990)
Yao, Y.Y.: A comparative study of fuzzy sets and rough sets. J. Inf. Sci. 109, 227–242 (1998)
Radzikowska, A.M., Kerre, E.E.: A comparative study of fuzzy rough sets. Fuzzy Sets Syst. 126, 137–155 (2002)
Yao, Y.Y.: Constructive and Algebraic methods of the theory of rough sets. Inf. Sci. 109, 21–47 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Pan, W., She, K., Wei, P., Zeng, K. (2014). Nearest Neighbor Condensation Based on Fuzzy Rough Set for Classification. In: Miao, D., Pedrycz, W., Ślȩzak, D., Peters, G., Hu, Q., Wang, R. (eds) Rough Sets and Knowledge Technology. RSKT 2014. Lecture Notes in Computer Science(), vol 8818. Springer, Cham. https://doi.org/10.1007/978-3-319-11740-9_40
Download citation
DOI: https://doi.org/10.1007/978-3-319-11740-9_40
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11739-3
Online ISBN: 978-3-319-11740-9
eBook Packages: Computer ScienceComputer Science (R0)