Abstract
Knowledge acquisition from text is an important research of artificial intelligence. In this paper, we present a method of acquiring knowledge from Chinese records of events of cyber attacks based on a semantic grammar. In order to parse the sentences in the records, the method first identifies Chinese noun phrases in the records, and then use the semantic grammar of the cyber-attack domain to parse the records. Finally, knowledge is extracted from the parsing trees. Experimental results show that our method for noun phase identification has a good performance, and the precision of knowledge acquisition reaches a high level of 90 %.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Zang, L., et al.: A Chinese framework of semantic taxonomy and description: preliminary experimental evaluation using web information extraction. In: Zhang, S., et al. (eds.) KSEM 2015. LNCS, vol. 9403, pp. 275–286. Springer, Heidelberg (2015). doi:10.1007/978-3-319-25159-2_25
Fillmore, C.J., Lee-Goldman, R., Rhodes, R.: Sign-based construction grammar and the framenet constructicon. Boas/Sag (Hg.) (2012)
Banko, M., Cafarella, M.J., Soderland, S., Broadhead, M., Etzioni, O.: Open information extraction from the web. Commun. ACM 51(12), 68–74 (2008)
Carlson, A., Betteridge, J., Kisiel, B., Settles, B., Hruschka Jr., E.R., Mitchell, T.M.: Toward an architecture for never-ending language learning. In: AAAI, vol. 5, p. 3 (2010)
Lenat, D.B.: CYC: a large-scale investment in knowledge infrastructure. Commun. ACM 38(11), 33–38 (1998)
Singh, P., et al.: The public acquisition of commonsense knowledge. In: Proceedings of AAAI Spring Symposium: Acquiring (and Using) Linguistic (and World) Knowledge for Information Access (2002)
Baker, C.F., Fillmore, C.J., Lowe, J.B.: The Berkeley framenet project. In: Proceedings of the 17th International Conference on Computational Linguistics, vol. 1, pp. 86–90. Association for Computational Linguistics (1998)
Fillmore, C.J., Johnson, C.R., Petruck, M.R.L.: Background to framenet. Int. J. Lexicography 16(3), 235–250 (2003)
Chinese framenet. http://sccfn.sxu.edu.cn/portal-en/home.aspx
Guo, H., Ying, M.: Application study of hidden Markov model based on genetic algorithm in noun phrase identification. Comput. Sci. 36(10), 244–247 (2009)
Li, R.: Noun phrase identification based on genetic algorithm and hidden Markov model. Int. J. Syst. Control 2(3), 221–227 (2007)
Kong, L., Ren, F., Sun, X., Quan, C.: Word frequency statistics model for Chinese base noun phrase identification. In: Huang, D.-S., Jo, K.-H., Wang, L. (eds.) ICIC 2014. LNCS, vol. 8589, pp. 635–644. Springer, Heidelberg (2014). doi:10.1007/978-3-319-09339-0_64
Stanford Word Segmenter. http://nlp.stanford.edu/software/segmenter.shtml
ICTCLAS (Institute of Computing Technology, Chinese Lexical Analysis System). http://www.nlp.org.cn/project/project.php?proj_id=6
Yuming, W.U., Luo, X., Yang, Z.: Semantic separator learning and its applications in unsupervised Chinese text parsing. Front. Comput. Sci. 7(1), 55–68 (2013). Selected Publications from Chinese Universities
Baidu Baike. http://baike.baidu.com/
Synonym dictionary. http://ir.hit.edu.cn/demo/ltp/Sharing_Plan.htm
Witten, I.H., Bell, T.C.: The zero-frequency problem: estimating the probabilities of novel events in adaptive text compression. IEEE Trans. Inf. Theory 37(4), 1085–1094 (1991)
Earley, J.: An Efficient Context-Free Parsing Algorithm. Morgan Kaufmann Publishers Inc., San Francisco (1986)
Acknowledgments
This word is supported by the National Science Foundation of China (under grant No. 91224006 and 61173063) and the Ministry of Science and Technology (under grant No. 201307107).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Fang, F., Wang, Y., Zhang, L., Cao, C. (2016). Knowledge Extraction from Chinese Records of Cyber Attacks Based on a Semantic Grammar. In: Lehner, F., Fteimi, N. (eds) Knowledge Science, Engineering and Management. KSEM 2016. Lecture Notes in Computer Science(), vol 9983. Springer, Cham. https://doi.org/10.1007/978-3-319-47650-6_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-47650-6_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-47649-0
Online ISBN: 978-3-319-47650-6
eBook Packages: Computer ScienceComputer Science (R0)