Rough Set-Based Feature Subset Selection Technique Using Jaccard’s Similarity Index

  • Bhawna TibrewalEmail author
  • Gargi Sur Chaudhury
  • Sanjay Chakraborty
  • Animesh Kairi
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 811)


Feature selection is the tool required to study data with high dimensions in an easy way. It involves extracting attributes from a dataset having a large number of attributes in such a way so as the reduced attribute set can describe the dataset in a manner similar to that of the entire attribute set. Reducing the features of the data and selecting only the more relevant features reduce the computational and storage requirements which are needed to process the entire dataset. Rough set is the approach of approximating a conventional set. It is used in data mining for reduction of datasets and to find hidden pattern in datasets. This paper aims to devise an algorithm which performs feature selection on a given dataset using the concepts of rough set.


Feature subset selection Machine learning Jaccard coefficient Rough set Adjacency matrix Indiscernibility relation 


  1. 1.
    Singh, B., Kushwaha, N., Vyas, O.P.: A feature subset selection technique for high dimensional data using symmetric uncertainty. J. Data Anal. Informat. Process. 2(4), 95 (2014)CrossRefGoogle Scholar
  2. 2.
    Jovic, A., Brkic, K., Bogunovic, N.: A review of feature selection methods with applications. In: 38th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO), pp. 1200–1205. IEEE (2015)Google Scholar
  3. 3.
    Rudnicki, W.R., Wrzesień, M., Paja, W.: All relevant feature selection methods and applications. In: Feature Selection for Data and Pattern Recognition, pp. 11–28. Springer (2015)Google Scholar
  4. 4.
    Ramaswami, M., Bhaskaran, R.: A Study On Feature Selection Techniques in Educational Data Mining. arXiv:0912.3924 (2009)
  5. 5.
    Mladenić, D.: Feature selection for dimensionality reduction. In: Subspace, Latent Structure and Feature Selection, pp. 84–102, Springer (2006)Google Scholar
  6. 6.
    Caballero, Y., Alvarez, D., Bello, R., Garcia, M.M.: Feature selection algorithms using rough set theory. In: ISDA Seventh International Conference on Intelligent Systems Design and Applications, pp. 407–411. IEEEGoogle Scholar
  7. 7.
    Pawlak, Z.: Rough sets. Int. J. Comput. Informat. Sci. 11(5), 341–356 (1982)CrossRefGoogle Scholar
  8. 8.
    Pawlak, Z.: Rough Sets: Theoretical Aspects of Reasoning About Data. Springer Science and Business Media, vol. 9 (2012)Google Scholar
  9. 9.
    Al-Radaideh, Q.A., Sulaiman, M.N., Selamat, M.H., Ibrahim, H.: Approximate reduct computation by rough sets based attribute weighting. In: IEEE International Conference on Granular Computing, vol. 2, pp. 383–386. IEEE (2005)Google Scholar
  10. 10.
    Zhang, M., Yao, J.: A rough sets based approach to feature selection. In: IEEE Annual Meeting of the Fuzzy Information, Processing NAFIPS’04, vol. 1, pp. 434–439 (2004)Google Scholar
  11. 11.
    Vijayabalaji, S., Balaji, P.: Rough matrix theory and its decision making. Int. J. Pure Appl. Math., 87(6), 845–853Google Scholar
  12. 12.
    Maheswari, D.U., Gunasundari, R.: User interesting navigation pattern discovery using fuzzy correlation based rule mining. Int. J. Appl. Eng. Res. 12(22), 11818–11823 (2017)Google Scholar
  13. 13.
    Chelvan, M.P., Perumal, K.: On feature selection algorithms and feature selection stability measures. Comp. Anal. 9(06), 159–168Google Scholar
  14. 14.
    Skrivanek, S.: The Use of Dummy Variables in Regression Analysis. More Steam, LLC (2009)Google Scholar
  15. 15.
    Koller, D., Sahami, M.: Toward Optimal Feature Selection. Technical Reports, Stanford InfoLab (1996)Google Scholar
  16. 16.
    Kohavi, R., John, G.H.: Wrappers for feature subset selection. Artif. Intell. 97(1–2), 273–324 (1997)CrossRefGoogle Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2019

Authors and Affiliations

  • Bhawna Tibrewal
    • 1
    Email author
  • Gargi Sur Chaudhury
    • 1
  • Sanjay Chakraborty
    • 2
  • Animesh Kairi
    • 3
  1. 1.Computer Science and Engineering DepartmentInstitute of Engineering and ManagementKolkataIndia
  2. 2.Department of Information TechnologyTechnoIndiaSalt Lake, KolkataIndia
  3. 3.Department of Information TechnologyInstitute of Engineering and ManagementSalt Lake, KolkataIndia

Personalised recommendations