Abstract
Internet users are always being attacked by spam messages, especially spam e-mails. Due to this issue, researchers had done many research works to find alternatives against the spam attacks. Different approaches, software and methods had been proposed in order to protect the Internet users from spam. This proposed work was inspired by the rough set theory, which was proven effective in handling uncertainties and large data set and also by the soft set theory which is a new emerging parameter reduction method that could overcome the limitation of rough set and fuzzy set theories in dealing with an uncertainty problem. The objective of this work was to propose a new hybrid parameter reduction method which could solve the uncertainty problem and inefficiency of parameterization tool issues which were used in the spam e-mail classification process. The experimental work had returned significant results which proved that the hybrid rough set and soft set parameter reduction method can be applied in the spam e-mail classification process that helps the classifier to classify spam e-mails effectively. As a recommendation, enhancement works on the functionality of this hybrid method shall be considered in different application fields, especially for the fields dealing with uncertainties problem and high dimension of data set.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Allias, N., Noor, M.N.M.M., Ismail, M.N.: A hybrid gini pso-svm feature selection based on taguchi method: an evaluation on email filtering. In: Proceedings of the 8th International Conference on Ubiquitous Information Management and Communication, p. 55. ACM (2014)
Chung, W., Tseng, T.L.: Discovering business intelligence from online product reviews: a rule-induction framework. Expert Syst. Appl. 39(15), 11870–11879 (2012)
Feng, F., Li, C., Davvaz, B., Ali, M.I.: Soft sets combined with fuzzy sets and rough sets: a tentative approach. Soft Comput. 14(9), 899–911 (2010)
Feng, F., Liu, X., Leoreanu-Fotea, V., Jun, Y.B.: Soft sets and soft rough sets. Inf. Sci. 181(6), 1125–1137 (2011)
Geng, S., Li, Y., Feng, F., Wang, X.: Generalized intuitionistic fuzzy soft sets and multiattribute decision making. In: Proceedings - 2011 4th International Conference on Biomedical Engineering and Informatics, BMEI 2011, vol. 4, pp. 2206–2211 (2011)
Guan, X., Li, Y., Feng, F.: A new order relation on fuzzy soft sets and its application. Soft Comput. 17(1), 63–70 (2013)
Herawan, T., Deris, M.M.: Soft decision making for patients suspected influenza. In: Taniar, D., Gervasi, O., Murgante, B., Pardede, E., Apduhan, B.O. (eds.) ICCSA 2010, Part III. LNCS, vol. 6018, pp. 405–418. Springer, Heidelberg (2010)
Hu, Y., Guo, C., Ngai, E.W.T., Liu, M., Chen, S.: A scalable intelligent non-content-based spam-filtering framework. Expert Syst. Appl. 37(12), 8557–8565 (2010)
Idris, I., Selamat, A., Omatu, S.: Hybrid email spam detection model with negative selection algorithm and differential evolution. Eng. Appl. Artif. Intell. 28, 97–110 (2014)
Inuiguchi, M., Yoshioka, Y., Kusunoki, Y.: Variable-precision dominance-based rough set approach and attribute reduction. Int. J. Approximate Reasoning 50(8), 1199–1214 (2009)
Ali, M.I.: A note on soft sets, rough soft sets and fuzzy soft sets. Appl. Soft Comput. J. 11(4), 3329–3332 (2011)
Jain, K.: A hybrid approach for spam filtering using support vector machine and artificial immune system, pp. 5–9 (2014)
Li, P., Wu, J., Qian, H.: Groundwater quality assessment based on rough sets attribute reduction and topsis method in a semi-arid area, china. Environ. Monit. Assess. 184(8), 4841–4854 (2012)
Ma, X., Sulaiman, N., Qin, H.: Parameterization value reduction of soft sets and its algorithm. In: 2011 IEEE Colloquium on Humanities, Science and Engineering, CHUSER 2011 (Chuser), pp. 261–264 (2011)
Meng, D., Zhang, X., Qin, K.: Soft rough fuzzy sets and soft fuzzy rough sets. Comput. Math. Appl. 62(12), 4635–4645 (2011)
Rathi, M., Pareek, V.: Spam mail detection through data mining a comparative performance analysis. Int. J. Mod. Educ. Comput. Sci. 5, 31–39 (2013)
Salama, M.A., Hassanien, A.E., Revett, K.: Employment of neural network and rough set in meta-learning. Memetic Comput. 5(3), 165–177 (2013)
Senan, N., Ibrahim, R., Nawi, N.M., Yanto, I.T.R., Herawan, T.: Soft set theory for feature selection of traditional malay musical instrument sounds. In: Zhu, R., Zhang, Y., Liu, B., Liu, C. (eds.) ICICA 2010. LNCS, vol. 6377, pp. 253–260. Springer, Heidelberg (2010)
Son, C.S., Kim, Y.N., Kim, H.S., Park, H.S., Kim, M.S.: Decision-making model for early diagnosis of congestive heart failure using rough set and decision tree approaches. J. Biomed. Inform. 45(5), 999–1008 (2012)
Whissell, J.S., Clarke, C.L.: Clustering for semi-supervised spam filtering categories and subject descriptors. In: Proceedings of the 8th Annual Collaboration, Electronic messaging, Anti-Abuse and Spam Conference, pp. 125–134. ACM (2011)
Zhang, Z.: A rough set approach to intuitionistic fuzzy soft set based decision making. Appl. Math. Model. 36(10), 4605–4633 (2012)
Zhu, D., Li, Y., Nie, H., Huang, C.: A decision-making method based on soft set under incomplete information. In: 2013 25th Chinese Control and Decision Conference (CCDC), pp. 1620–1622. IEEE (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Mohamad, M., Selamat, A. (2016). A New Hybrid Rough Set and Soft Set Parameter Reduction Method for Spam E-Mail Classification Task. In: Ohwada, H., Yoshida, K. (eds) Knowledge Management and Acquisition for Intelligent Systems . PKAW 2016. Lecture Notes in Computer Science(), vol 9806. Springer, Cham. https://doi.org/10.1007/978-3-319-42706-5_2
Download citation
DOI: https://doi.org/10.1007/978-3-319-42706-5_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42705-8
Online ISBN: 978-3-319-42706-5
eBook Packages: Computer ScienceComputer Science (R0)