Skip to main content

DualPOS: A Semi-supervised Attribute Selection Approach for Symbolic Data Based on Rough Set Theory

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9659))

Abstract

Rough set theory, supplying an effective model for representation of uncertain knowledge, has been widely used in knowledge engineering and data mining. Especially, rough set theory has been used as an attribute selection method with much success. However, current rough set approaches for attribute reduction are unsuitable for semi-supervised learning as no enough labeled data can guarantee to calculate the dependency degree. We propose a new attribute selection strategy based on rough sets, called DualPOS. It provides mutual function mechanism of multi-attributes, and generates the most consistent one as a candidate. Experiments are carried out to test the performances of classification and clustering of the proposed algorithm. The results show that DualPOS is valid for attribute selection in semi-supervised learning.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Blum, A.L., Langley, P.: Selection of relevant features and examples in machine learning. Artif. Intell. 97(1C2), 245–271 (1997)

    Article  MathSciNet  MATH  Google Scholar 

  2. Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. J. Mach. Learn. Res. 3, 1157–1182 (2003)

    MATH  Google Scholar 

  3. Bae, C., Yeh, W.C., Chung, Y.Y., Liu, S.L.: Feature selection with intelligent dynamic swarm and rough set. Expert Syst. Appl. 37(10), 7026–7032 (2010)

    Article  Google Scholar 

  4. Pawlak, Z.: Rough sets. Int. J. Comput. Inform. Sci. 11(5), 341–356 (1982)

    Article  MathSciNet  MATH  Google Scholar 

  5. Pawlak, Z.: Rough sets and fuzzy sets. Fuzzy Sets Syst. 17(1), 99–102 (1985)

    Article  MathSciNet  MATH  Google Scholar 

  6. Revett, K., Iantovics, B.: A survey of electronic fetal monitoring: a computational perspective. Stud. Comput. Intell. 486, 135–141 (2014)

    Google Scholar 

  7. Skowron, A., Rauszer, C.: The discernibility matrices and functions in information systems. In: Slowiński, R. (ed.) Intelligent Decision Support. Theory and Decision Library, vol. 11, pp. 331–362. Springer, Netherlands (1992)

    Chapter  Google Scholar 

  8. Vafaie, H., Imam, I.F.: Feature selection methods: genetic algorithms vs. greedy-like search. In: Proceedings of the International Conference on Fuzzy and Intelligent Control Systems, pp. 39–43 (1994)

    Google Scholar 

  9. Hu, X., Cercone, N.: Learning in relational databases: a rough set approach. Comput. Intell. 11(2), 323–338 (1995)

    Article  Google Scholar 

  10. Hu, X.: Knowledge discovery in databases: an attribute-oriented rough set approach. Ph.D. thesis, Citeseer (1995)

    Google Scholar 

  11. Susmaga, R.: Reducts and constructs in attribute reduction. Fundamenta Informaticae 61(2), 159–181 (2004)

    MathSciNet  MATH  Google Scholar 

  12. Dai, J., Wang, W., Xu, Q.: An uncertainty measure for incomplete decision tables and its applications. IEEE Trans. Cybern. 43(4), 1277–1289 (2013)

    Article  Google Scholar 

  13. Dai, J., Wang, W., Tian, H., Liu, L.: Attribute selection based on a new conditional entropy for incomplete decision systems. Knowl.-Based Syst. 39, 207–213 (2013)

    Article  Google Scholar 

  14. Dai, J., Xu, Q., Wang, W., Tian, H.: Conditional entropy for incomplete decision systems and its application in data mining. Int. J. Gen. Syst. 41(7), 713–728 (2012)

    Article  MathSciNet  MATH  Google Scholar 

  15. Dai, J., Xu, Q.: Attribute selection based on information gain ratio in fuzzy rough set theory with application to tumor classification. Appl. Soft Comput. 13(1), 211–221 (2013)

    Article  Google Scholar 

  16. Dai, J., Li, Y.X., Liu, Q.: Hybrid genetic algorithm for reduct of attributes in decision system based on rough set theory. Wuhan Univ. J. Nat. Sci. 7(3), 285–289 (2002)

    Article  Google Scholar 

  17. Dai, J., Chen, W., Gu, H., Pan, Y.: Particle swarm algorithm for minimal attribute reduction of decision data tables. In: Proceedings First International Multi-Symposiums on Computer and Computational Sciences (IMSCCS 2006), Hangzhou, China, I, pp. 572–575, April 2006

    Google Scholar 

  18. Bazan, J.G., Nguyen, H.S., Nguyen, S.H., Synak, P., Wróblewski, J.: Rough set algorithms in classification problem. In: Polkowski, L., Tsumoto, S., Lin, T.Y. (eds.) Rough Set Methods and Applications. Studies in Fuzziness and Soft Computing, vol. 56, pp. 49–88. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  19. Wroblewski, J.: Finding minimal reducts using genetic algorithms. In: Proccedings of the 2nd Annual Join Conference on Infromation Science, pp. 186–189 (1995)

    Google Scholar 

  20. Zhu, X., Goldberg, A.B.: Introduction to semi-supervised learning. Synth. Lect. Artif. Intell. Mach. Learn. 3(1), 1–130 (2009)

    Article  MATH  Google Scholar 

  21. Pawlak, Z., Sowinski, R.: Rough set approach to multi-attribute decision analysis. Eur. J. Oper. Res. 72(3), 443–459 (1994)

    Article  MATH  Google Scholar 

  22. Pawlak, Z., Grzymala-Busse, J., Slowinski, R., Ziarko, W.: Rough sets. Commun. ACM 38(11), 88–95 (1995)

    Article  Google Scholar 

  23. Dai, J., Xu, Q.: Approximations and uncertainty measures in incomplete information systems. Inf. Sci. 198, 62–80 (2012)

    Article  MathSciNet  MATH  Google Scholar 

  24. Dai, J., Wang, W., Xu, Q., Tian, H.: Uncertainty measurement for interval-valued decision systems based on extended conditional entropy. Knowl.-Based Syst. 27, 443–450 (2012)

    Article  Google Scholar 

  25. Fayyad, U., Irani, K.: Multi-interval discretization of continuous-valued attributes for classification learning. In: Proceedings of the 13th International Join Conference on Artificial Intelligence, pp. 1022–1027 (1993)

    Google Scholar 

  26. Jain, A., Zongker, D.: Feature selection: evaluation, application, and small sample performance. IEEE Trans. Pattern Anal. Mach. Intell. 19(2), 153–158 (1997)

    Article  Google Scholar 

  27. Zhu, H., Zhou, M.: Efficient role transfer based on kuhn-munkres algorithm. IEEE Trans. Syst. Man Cybern. Part A Syst. Hum. 42(2), 491–496 (2012)

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgements

This work was partially supported by the National Natural Science Foundation of China (No. 61473259, No. 61070074, No. 60703038), the Zhejiang Provincial Natural Science Foundation (No. Y14F020118), the National Science & Technology Support Program of China (2015BAK26B00, 2015BAK26B02) and the PEIYANG Young Scholars Program of Tianjin University (2016XRX-0001).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jianhua Dai .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Dai, J., Han, H., Hu, H., Hu, Q., Zhang, J., Wang, W. (2016). DualPOS: A Semi-supervised Attribute Selection Approach for Symbolic Data Based on Rough Set Theory. In: Cui, B., Zhang, N., Xu, J., Lian, X., Liu, D. (eds) Web-Age Information Management. WAIM 2016. Lecture Notes in Computer Science(), vol 9659. Springer, Cham. https://doi.org/10.1007/978-3-319-39958-4_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-39958-4_31

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-39957-7

  • Online ISBN: 978-3-319-39958-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics