Skip to main content

A Novel Approach to the Potentially Hazardous Text Identification Under Theme Uncertainty Based on Intelligent Data Analysis

  • Conference paper
  • First Online:
Computational and Statistical Methods in Intelligent Systems (CoMeSySo 2018)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 859))

Included in the following conference series:

Abstract

The problem of potentially hazardous text identification is an important one in the intelligent data analysis area. As usual, this problem is solved by methods and techniques, which are of a low efficiency in conditions of theme uncertainty.

Within this paper, a novel approach to the potentially hazardous text identification under theme uncertainty is presented. The main idea of data processing approach proposed is based on the user and automatically extracted keywords comparison. This paper contains the brief overview of the text identification methods, the description of the approach presented, some statistical experimental results, discussion and conclusion.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Nigam, K., McCallum, A.D., Thrun, S., Mitchell, T.M.: Text classification from labelled and unlabeled documents using EM. Mach. Learn. 39, 103–134 (2007)

    Article  Google Scholar 

  2. Omer, E.: Using machine learning to identify jihadist messages on Twitter. M.S. theses, Department of Information Technology, Uppsala University Sweden (2015)

    Google Scholar 

  3. Zhang, L., Zhu, J., Yao, T.: An evaluation of statistical spam filtering techniques. ACM Trans. Asian Lang. Inf. Process. (TALIP) 3(4), 243–269 (2004)

    Article  Google Scholar 

  4. Rish, I.: An empirical study of the Naïve Bayes classifier. In: IJCAI 2001 Work Empire Methods Artificial Intelligence, vol. 3 (2001)

    Google Scholar 

  5. Kwon, O.-W., Lee, J.-H.: Text categorization based on k-nearest neighbor approach for web site classification. Inf. Process. Manag. 39(1), 25–44 (2003)

    Article  Google Scholar 

  6. Tresch, M., Luniewski, A.: An extensible classifier for semi-structured documents. In: Park, E.K., Makki, K. (eds.) Proceedings of the Fourth International Conference on Information and Knowledge Management (CIKM 1995), Niki Pissinou, Avi Silber-schatz, pp. 226–233. ACM, New York (1995)

    Google Scholar 

  7. Haykin, S.: Neural Networks - A Comprehensive Foundation. Canada (1999)

    Google Scholar 

  8. Sebastiani, F.: Machine learning in automated text categorization. ACM Comput. Surv. 34, 1–47 (2002)

    Article  Google Scholar 

  9. Mahinovs, A., Tiwari, A.: Text Classification Method Review. Cranfield University, Cranfield (2007)

    Google Scholar 

  10. Jordan, M.I., Bishop, C.: Neural Networks. CRC Press, Boca Raton (1997)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Vladislav Babutskiy .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Babutskiy, V., Sidorov, I. (2019). A Novel Approach to the Potentially Hazardous Text Identification Under Theme Uncertainty Based on Intelligent Data Analysis. In: Silhavy, R., Silhavy, P., Prokopova, Z. (eds) Computational and Statistical Methods in Intelligent Systems. CoMeSySo 2018. Advances in Intelligent Systems and Computing, vol 859. Springer, Cham. https://doi.org/10.1007/978-3-030-00211-4_4

Download citation

Publish with us

Policies and ethics