Abstract
The paper represents new methodology of semantic analysis for physical effects extracting. This methodology is based on the Tuzov ontology that formally describes the Russian language. In this paper, semantic patterns were described to extract structural physical information in the form of physical effects. A new algorithm of text analysis was described. The approach is applied to the database of physical effects and to the patent texts. The results of the proposed method compared with the results of the IOFEE system that is used for the same tasks. The method described in this article allowed increasing efficiency of the physical effect elements extracting. The semantic analyzer based on the Tuzov ontology was created to increase the accuracy and completeness of the method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Korobkin, D.M., Fomenkov, S.A., Kamaev, V.A., Fomenkova, M.A.: Multi-agent model of ontology- based extraction of physical effects descriptions from natural language text. In: Information Technologies in Science, Management, Social Sphere and Medicine (ITSMSSM), pp. 498–501. Atlantis Press (2016). doi:10.2991/itsmssm-16.2016.13
Pleshko, V.V., Ermakov, A.V.: Semantic interpretation in text analysis computer systems. J. Inf. Technol. (2009)
Taylor, J.: First Look – Attensity 5.5 decision, management, solutions. In: 5 International Conference on Information Technologies in Business and Industry (2017)
Krupka, G.R., Hausman, K.I.: Description of the NetOwl TM Extractor system as used for MUC-7. In: MUC-7 (1998)
Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008)
Nivre, J., Hall, J., Nilsson, J.: MaltParser: a data-driven parser-generator for dependency parsing. In: Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC), Genoa, Italy, pp. 2216–2219 (2006)
Tukeyev, U.A., Melby, A.K., Zhumanov, Z.M.: Models and algorithms of translation of the Kazakh language sentences into English language with use of link grammar and the statistical approach. In: IV Congress of the Turkic World Math. Society, Baku (2014)
Azarova, I.: The matching of AGFL subcategories to Russian lexical and grammatical groupings. In: Proceedings of the Second AGFL Workshop on Syntactic Description and Processing of Natural Language (2002)
Ogogrodnik, P.B., Serebryannaya, L.V.: Text analysis with Tomita parser. In: International Conference (BSUIR), Minsk, pp. 230–231 (2014)
Tuzo, V.A.: Computer Linguistics. St. Petersburg State University Press, St. Petersburg (1998)
Korobkin, D., Fomenkov, S., Kolesnikov, S., Lobeyko, V., Golovanchikov, A.: Modification of physical effect model for the synthesis of the physical operation principles of technical system. In: Kravets, A., Shcherbakov, M., Kultsova, M., Shabalina, O. (eds.) CIT&DS 2015. CCIS, vol. 535, pp. 368–378. Springer, Cham (2015). doi:10.1007/978-3-319-23766-4_29
Fomenkov, S., Korobkin, D., Kolesnikov, S.: Method of ontology-based extraction of physical effect description from Russian text. In: Kravets, A., Shcherbakov, M., Kultsova, M., Iijima, T. (eds.) JCKBSE 2014. CCIS, vol. 466, pp. 321–330. Springer, Cham (2014). doi:10.1007/978-3-319-11854-3_27
Fomenkov, S.A., Kolesnikov, S.G., Korobkin, D.M., Kamaev, V.A., Orlova, Y.A.: The information filling of the database by physical effects. J. Eng. Appl. Sci. 9, 422–426 (2014)
Patent US 20100051095 A1 “Hybrid Photovoltaic Cell Using Amorphous Silicon Germanium Absorbers With Wide Bandgap Dopant Layers and an Up-Converter”
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. J. Inf. Process. Manag. 24(5), 513–523 (1988)
U.S. Patent Grant Data/XML v4.3. http://www.uspto.gov/sites/default/files/products/Patent_Grant_XML_v4.3.pdf
Zlotin, B., Zusman, A.: Directed Evolution: Philosophy Theory and Practice. Ideation International, Southfield (2001)
Fey, V., Rivin, E.: Innovation on Demand: New Product Development Using TRIZ. Cambridge University Press, Cambridge (2005)
Blei, D.M.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3(4–5), 993–1022 (2003)
Korobkin, D.M., Fomenkov, S.A., Kravets, A.G., Golovanchikov, A.B.: Patent data analysis system for information extraction tasks. In: 13th International Conference on Applied Computing (AC), pp. 215–219. IADIS (2016)
Toutanova, K., Manning, C.D.: Enriching the knowledge sources used in a maximum entropy part-of-speech tagger. In: EMNLP 2000, Hong Kong, pp. 63–70 (2000)
Hall, J.: MaltParser – An Architecture for Inductive Labeled Dependency Parsing. University of Colorado (2006)
Mel’čuk, I.A.: Dependency Syntax: Theory and Practice. SUNY Publication, New York (1988)
Stanford typed dependencies manual. https://nlp.stanford.edu/software/dependencies_manual.pdf
Korobkin, D.M., Fomenkov, S.A., Kolesnikov, S.G., Voronin, Y.F.: System of physical effects extraction from natural language text in the internet. J. World Appl. Sci. J. 24, 55–61 (2013)
Dvoryankin, A.M., Polovinkin, A.I., Sobolev, A.N.: Automating the search for operation principles of technical systems on the basis of a bank of physical phenomena. J. Cybern. 14, 79–86 (1978)
Arel, E.: Goldfire Innovator. Volume II: Patents and Innovation Trend Analysis User Guide. Invention Machine Corporation, Boston (2004)
Acknowledgement
This research was partially supported by the Russian Fund of Basic Research (grants No. 15-07-09142 A, No.15-07-06254 A, No. 16-07-00534 A).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Fomenkova, M., Korobkin, D., Fomenkov, S. (2017). Extraction of Physical Effects Based on the Semantic Analysis of the Patent Texts. In: Kravets, A., Shcherbakov, M., Kultsova, M., Groumpos, P. (eds) Creativity in Intelligent Technologies and Data Science. CIT&DS 2017. Communications in Computer and Information Science, vol 754. Springer, Cham. https://doi.org/10.1007/978-3-319-65551-2_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-65551-2_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-65550-5
Online ISBN: 978-3-319-65551-2
eBook Packages: Computer ScienceComputer Science (R0)