Skip to main content

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 132))

Abstract

In recent years, dimension of datasets has increased rapidly in many applications which bring great difficulty to data mining and pattern recognition. Also, all the measured variables of these high-dimensional datasets are not relevant for understanding the underlying phenomena of interest. In this paper, firstly, similarities among the attributes are measured by computing similarity factors based on relative indiscernibility relation, a concept of rough set theory. Based on the similarity factors, attribute similarity set AS = {(A \(\overset{k}{\rightarrow}\) B) / A, B are attributes and B similar to A with similarity factor k} is formed which helps to construct a directed weighted graph with weights as the inverse of similarity factor k. Then a minimal spanning tree of the graph is generated, from which iteratively most important vertex is selected in reduct set. The iteration completes when the edge set is empty. Thus the selected attributes, from which edges emanate, are the most relevant attributes and are known as reduct. The proposed method has been applied on some benchmark datasets and the classification accuracy is calculated by various classifiers to demonstrate the effectiveness of the method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Pawlak, Z.: Rough sets. International Journal of Information and Computer Sciences 11, 341–356 (1982)

    Article  MATH  MathSciNet  Google Scholar 

  2. Pawlak, Z.: Rough set theory and its applications to data analysis. Cybernetics and Systems 29, 661–688 (1998)

    Article  MATH  Google Scholar 

  3. Hu, X., Lin, T.Y., Jianchao, J.: A New Rough Sets Model Based on Database Systems. Fundamental Informaticae, 1–18 (2004)

    Google Scholar 

  4. Jensen, R., Shen, Q.: Fuzzy-Rough Attribute Reduction with Application to Web Categorization. Fuzzy Sets and Systems 141(3), 469–485 (2004)

    Article  MATH  MathSciNet  Google Scholar 

  5. Zhong, N., Skowron, A.: A Rough Set-Based Knowledge Discovery Process. Int. Journal of Applied Mathematics and Computer Science 11(3), 603–619 (2001); BIME Journal 05(1) (2005)

    MATH  MathSciNet  Google Scholar 

  6. Kerber, R.: ChiMerge: Discretization of Numeric Attributes. In: Proceedings of AAAI-1992, Ninth Int’l Conf. Artificial Intelligence, pp. 123–128. AAAI Press (1992)

    Google Scholar 

  7. Devroye, L., Gyorfi, L., Lugosi, G.: A Probabilistic Theory of Pattern Recognition. Springer, Newyork (1996)

    MATH  Google Scholar 

  8. Gupta, S.C., Kapoor, V.K.: Fundamental of Mathematical Statistics. Sultan Chand & Sons, A.S. Printing Press, India (1994)

    Google Scholar 

  9. Pal, S.K., Mitra, S.: Neuro-Fuzzy pattern Recognition: Methods in Soft Computing. Willey, New York (1999)

    Google Scholar 

  10. Bang-Jensen, J., Gutin, G.: Digraphs: Theory, Algorithms and Applications. Springer, Heidelberg, ISBN 1-85233-268-9

    Google Scholar 

  11. Chu, Y.J., Liu, T.H.: On the shortest arborescence of a directed graph. Science Sinica 14, 1396–1400 (1965)

    MATH  Google Scholar 

  12. Murphy, P., Aha, W.: UCI repository of machine learning databases (1996), http://www.ics.uci.edu/mlearn/MLRepository.html

  13. WEKA: Machine Learning Software, http://www.cs.waikato.ac.nz/~ml/

  14. Hall, M.A.: Correlation-Based Feature Selection for Machine Learning PhD thesis, Dept. of Computer Science, Univ. of Waikato, Hamilton, New Zealand (1998)

    Google Scholar 

  15. Liu, Setiono, R.: A Probabilistic Approach to Feature Selection: A Filter Solution. In: Proc. 13th Int’l Conf. Machine

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Das, A.K., Sengupta, S., Chakrabarty, S. (2012). Reduct Generation by Formation of Directed Minimal Spanning Tree Using Rough Set Theory. In: Satapathy, S.C., Avadhani, P.S., Abraham, A. (eds) Proceedings of the International Conference on Information Systems Design and Intelligent Applications 2012 (INDIA 2012) held in Visakhapatnam, India, January 2012. Advances in Intelligent and Soft Computing, vol 132. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27443-5_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-27443-5_15

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-27442-8

  • Online ISBN: 978-3-642-27443-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics