A Novel Approach to the Potentially Hazardous Text Identification Under Theme Uncertainty Based on Intelligent Data Analysis

Babutskiy, Vladislav; Sidorov, Igor

doi:10.1007/978-3-030-00211-4_4

Vladislav Babutskiy¹⁷ &
Igor Sidorov¹⁷

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 859))

Included in the following conference series:

Proceedings of the Computational Methods in Systems and Software

585 Accesses
1 Citations

Abstract

The problem of potentially hazardous text identification is an important one in the intelligent data analysis area. As usual, this problem is solved by methods and techniques, which are of a low efficiency in conditions of theme uncertainty.

Within this paper, a novel approach to the potentially hazardous text identification under theme uncertainty is presented. The main idea of data processing approach proposed is based on the user and automatically extracted keywords comparison. This paper contains the brief overview of the text identification methods, the description of the approach presented, some statistical experimental results, discussion and conclusion.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Nigam, K., McCallum, A.D., Thrun, S., Mitchell, T.M.: Text classification from labelled and unlabeled documents using EM. Mach. Learn. 39, 103–134 (2007)
Article Google Scholar
Omer, E.: Using machine learning to identify jihadist messages on Twitter. M.S. theses, Department of Information Technology, Uppsala University Sweden (2015)
Google Scholar
Zhang, L., Zhu, J., Yao, T.: An evaluation of statistical spam filtering techniques. ACM Trans. Asian Lang. Inf. Process. (TALIP) 3(4), 243–269 (2004)
Article Google Scholar
Rish, I.: An empirical study of the Naïve Bayes classifier. In: IJCAI 2001 Work Empire Methods Artificial Intelligence, vol. 3 (2001)
Google Scholar
Kwon, O.-W., Lee, J.-H.: Text categorization based on k-nearest neighbor approach for web site classification. Inf. Process. Manag. 39(1), 25–44 (2003)
Article Google Scholar
Tresch, M., Luniewski, A.: An extensible classifier for semi-structured documents. In: Park, E.K., Makki, K. (eds.) Proceedings of the Fourth International Conference on Information and Knowledge Management (CIKM 1995), Niki Pissinou, Avi Silber-schatz, pp. 226–233. ACM, New York (1995)
Google Scholar
Haykin, S.: Neural Networks - A Comprehensive Foundation. Canada (1999)
Google Scholar
Sebastiani, F.: Machine learning in automated text categorization. ACM Comput. Surv. 34, 1–47 (2002)
Article Google Scholar
Mahinovs, A., Tiwari, A.: Text Classification Method Review. Cranfield University, Cranfield (2007)
Google Scholar
Jordan, M.I., Bishop, C.: Neural Networks. CRC Press, Boca Raton (1997)
Google Scholar

Download references

Author information

Authors and Affiliations

South Federal University, Taganrog, Russia
Vladislav Babutskiy & Igor Sidorov

Authors

Vladislav Babutskiy
View author publications
You can also search for this author in PubMed Google Scholar
Igor Sidorov
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vladislav Babutskiy .

Editor information

Editors and Affiliations

Faculty of Applied Informatics, Tomas Bata University in Zlin, Zlin, Czech Republic
Radek Silhavy
Faculty of Applied Informatics, Tomas Bata University in Zlin, Zlin, Czech Republic
Petr Silhavy
Faculty of Applied Informatics, Tomas Bata University in Zlin, Zlin, Czech Republic
Zdenka Prokopova

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Babutskiy, V., Sidorov, I. (2019). A Novel Approach to the Potentially Hazardous Text Identification Under Theme Uncertainty Based on Intelligent Data Analysis. In: Silhavy, R., Silhavy, P., Prokopova, Z. (eds) Computational and Statistical Methods in Intelligent Systems. CoMeSySo 2018. Advances in Intelligent Systems and Computing, vol 859. Springer, Cham. https://doi.org/10.1007/978-3-030-00211-4_4

Download citation

DOI: https://doi.org/10.1007/978-3-030-00211-4_4
Published: 30 August 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00210-7
Online ISBN: 978-3-030-00211-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics