Abstract
In recent years, dimension of datasets has increased rapidly in many applications which bring great difficulty to data mining and pattern recognition. Also, all the measured variables of these high-dimensional datasets are not relevant for understanding the underlying phenomena of interest. In this paper, firstly, similarities among the attributes are measured by computing similarity factors based on relative indiscernibility relation, a concept of rough set theory. Based on the similarity factors, attribute similarity set AS = {(A \(\overset{k}{\rightarrow}\) B) / A, B are attributes and B similar to A with similarity factor k} is formed which helps to construct a directed weighted graph with weights as the inverse of similarity factor k. Then a minimal spanning tree of the graph is generated, from which iteratively most important vertex is selected in reduct set. The iteration completes when the edge set is empty. Thus the selected attributes, from which edges emanate, are the most relevant attributes and are known as reduct. The proposed method has been applied on some benchmark datasets and the classification accuracy is calculated by various classifiers to demonstrate the effectiveness of the method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Pawlak, Z.: Rough sets. International Journal of Information and Computer Sciences 11, 341–356 (1982)
Pawlak, Z.: Rough set theory and its applications to data analysis. Cybernetics and Systems 29, 661–688 (1998)
Hu, X., Lin, T.Y., Jianchao, J.: A New Rough Sets Model Based on Database Systems. Fundamental Informaticae, 1–18 (2004)
Jensen, R., Shen, Q.: Fuzzy-Rough Attribute Reduction with Application to Web Categorization. Fuzzy Sets and Systems 141(3), 469–485 (2004)
Zhong, N., Skowron, A.: A Rough Set-Based Knowledge Discovery Process. Int. Journal of Applied Mathematics and Computer Science 11(3), 603–619 (2001); BIME Journal 05(1) (2005)
Kerber, R.: ChiMerge: Discretization of Numeric Attributes. In: Proceedings of AAAI-1992, Ninth Int’l Conf. Artificial Intelligence, pp. 123–128. AAAI Press (1992)
Devroye, L., Gyorfi, L., Lugosi, G.: A Probabilistic Theory of Pattern Recognition. Springer, Newyork (1996)
Gupta, S.C., Kapoor, V.K.: Fundamental of Mathematical Statistics. Sultan Chand & Sons, A.S. Printing Press, India (1994)
Pal, S.K., Mitra, S.: Neuro-Fuzzy pattern Recognition: Methods in Soft Computing. Willey, New York (1999)
Bang-Jensen, J., Gutin, G.: Digraphs: Theory, Algorithms and Applications. Springer, Heidelberg, ISBN 1-85233-268-9
Chu, Y.J., Liu, T.H.: On the shortest arborescence of a directed graph. Science Sinica 14, 1396–1400 (1965)
Murphy, P., Aha, W.: UCI repository of machine learning databases (1996), http://www.ics.uci.edu/mlearn/MLRepository.html
WEKA: Machine Learning Software, http://www.cs.waikato.ac.nz/~ml/
Hall, M.A.: Correlation-Based Feature Selection for Machine Learning PhD thesis, Dept. of Computer Science, Univ. of Waikato, Hamilton, New Zealand (1998)
Liu, Setiono, R.: A Probabilistic Approach to Feature Selection: A Filter Solution. In: Proc. 13th Int’l Conf. Machine
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Das, A.K., Sengupta, S., Chakrabarty, S. (2012). Reduct Generation by Formation of Directed Minimal Spanning Tree Using Rough Set Theory. In: Satapathy, S.C., Avadhani, P.S., Abraham, A. (eds) Proceedings of the International Conference on Information Systems Design and Intelligent Applications 2012 (INDIA 2012) held in Visakhapatnam, India, January 2012. Advances in Intelligent and Soft Computing, vol 132. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27443-5_15
Download citation
DOI: https://doi.org/10.1007/978-3-642-27443-5_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-27442-8
Online ISBN: 978-3-642-27443-5
eBook Packages: EngineeringEngineering (R0)