Creation of Ontological Knowledge Bases in the Semantic Web by Analyzing Table Structures

Part of the Studies in Computational Intelligence book series (SCI, volume 941)


The active development of the Semantic Web initiative to create expressive models for representing knowledge distributed on the Web in the form of ontologies raises a number of problems associated with the development of information structures of ontological knowledge bases for automatic processing of data and knowledge. The subject of the research is the ontological knowledge base and methods of their formation within the framework of the Semantic Web. Moreover, various tabular structures are considered as sources of knowledge. The problem arises as a result of the contradiction between the wide variety of tabular structures used to organize the content of knowledge sources in a hypermedia environment and the insufficient efficiency of classical methods for analyzing sources of this type. In the course of research, this problem was decomposed into a number of tasks:
  • analysis of existing approaches to the formation of ontological knowledge bases based on the sources of tabular structures;

  • development of a formal model of ontological knowledge bases;

  • development of a method for the formation of databases of ontological knowledge based on targeted enumeration and its mathematical support;

  • development of a formal model of the sources of knowledge of table structures;

  • development of a method for analyzing the sources of knowledge of tabular structures based on targeted enumeration and its mathematical support;

  • development of a method for generating instances of objects of subject areas based on knowledge sources of tabular structures and its mathematical support;

  • application of the developed methods for the implementation of a set of software tools for the formation of ontological knowledge bases.

In the course of solving the first of the stated tasks, it was found that historically the first was the approach to the formation of ontological knowledge bases based on the methods of structural analysis. The effectiveness of methods of this kind is limited by the small number of tabular structures analyzed and the inconsistency of interpretation of the structural components of the knowledge sources of tabular structures and their visual representation. The need to solve the problem of creating ontological knowledge bases based on the sources of tabular structures, characterized by a high level of complexity of organizing the contents of these sources, has led to the emergence of a new generation of intelligent methods for forming ontological knowledge bases based on top-level ontologies. The approach to the formation of ontological knowledge bases on the basis of upper-level ontologies involves the formation of such bases in accordance with the terminology defined in the upper-level ontology. Thus, the boundaries of the presentation of the components of domain objects, in contrast to structural analysis, are found as a possible combination of terms defined in the ontology by calculating measures of semantic similarity. This approach allows you to build procedures for obtaining new knowledge, abstracting from the method and format of storing the contents of structured sources of knowledge. The methodological basis of research includes the ideas and principles of artificial intelligence, elements of the hypertext technologies of the Semantic Web, tools for knowledge engineering, in particular ontological engineering. Experimental studies were carried out on test examples and on real sources of knowledge of tabular structures in the form of documents that are widely used in the Semantic Web environment for organizing workflows. The implementation of the theoretical results of the study in the form of algorithmic, mathematical support, as well as experimental studies conducted to determine the upper bound and the nature of the growth of complexity of the method of forming the ontological knowledge bases based on targeted enumeration, confirm the validity of the hypothesis adopted at the beginning.


Semantic web Ontological knowledge bases Tabular structures Organizing workflows Hypermedia environment 


  1. 1.
    Berners-Lee, T., Handler, J., Lassila, O.: The Semantic Web. Sci. Am. 284(5), 34–43 (2001)CrossRefGoogle Scholar
  2. 2.
    Shostak, I., Volobuyeva, L., Danova, M.: Ontology based approach for green software ecosystem formalization. In: Abstracts of the DEpendable Systems, SERvices and Technologies—DESSERT’2018, IEEE Ukraine Section, Kyiv 24–27 May 2018Google Scholar
  3. 3.
    Shostak, I., et al.: Ontological approach to the construction of multi-agent systems for the maintenance supporting processes of production equipment. In: Abstracts of the IEEE International Scientific and Practical Conference «Problems of Infocommunications. Science and Technology» (PICS&T-2018), Kharkiv, 9–12 Oct 2018, pp 209–214 (2018)Google Scholar
  4. 4.
    Pavlenko, V., et al.: Information support for business processes on the virtual enterprises with the use of multi-agent technologies. In: Abstracts of the DEpendable Systems, SERvices and Technologies—DESSERT’2018, IEEE Ukraine Section, Kyiv 24–27 May 2018Google Scholar
  5. 5.
    Noy, N.F., et al.: The knowledge model of protégé-2000: combining interoperability and flexibility. In: 2th International Abstracts of the Conference Knowledge Engineering and Knowledge Management, Springer, Juan-les-Pins, pp. 17–32 (2000)Google Scholar
  6. 6.
    Intelligent, A.H.: E-Buisness: from technology to value. IEEE Intell. Syst. 16(4), 8–10 (2001)CrossRefGoogle Scholar
  7. 7.
    Bast, R.: Learning the business of business. IEEE Intell. Syst. 16(4), 4–7 (2001)Google Scholar
  8. 8.
    Hendler, J., Berners-Lee, T., et al.: Integrating applications on the semantic web. J. Inst. Electr. Eng. Jap 122(10), 676–680 (2002)Google Scholar
  9. 9.
    Noy, N., Sintek, M., Decker, S., et al.: Creating semantic web contents with protege-2000. IEEE Intell. Syst. 2(16), 60–71 (2001)CrossRefGoogle Scholar
  10. 10.
    Rector. A.L.: Modularization of domain ontologies implemented in description logics and related formalisms including OWL. In: Abstracts of the 2nd International Conference on Knowledge Capture, Sanibel Island (USA): ACM Press, pp. 51–59 (2003)Google Scholar
  11. 11.
    Embley, D.W., Campbell, D.M., Jiang, Y.S., et al.: Conceptual-model-based data extraction from multiple-record web data. Data Knowl. Eng. 31(3), 227–251 (1999)CrossRefGoogle Scholar
  12. 12.
    Lopresti, D., Nagy, G.A.: Tabular survey of automated table processing. In: Proceedings of the Third IAPR Workshop on Graphics Recognition, Jaipur (India), pp 93–120. Springer, Berlin/Heidelberg (2000)Google Scholar
  13. 13.
    Hammer, J., Garcia-Molina, H., Cho, J., et al.: Extracting semistructured information from the Web. In: Proceeding of the Workshop on Management of Semistructured Data, Tucson (USA), p 50. AIII Press/MIT Press (1997)Google Scholar
  14. 14.
    Yourdon, E.: Modern Structured Analysis. Yourdon Press/Prentice Hall, N.J. (1989)Google Scholar
  15. 15.
    Liddle, S., Embley, D.W., Yau, D.S.: Extracting data behind web forms. In: Proceeding of the Joint Workshop on Conceptual Modelling Approaches for E-business: A Web Service Perspective, Tampere (Finland), pp 38–49. Springer, Berlin (2002)Google Scholar
  16. 16.
    Cowie, J., Lehnert, W.: Information extraction. Commun. ACM 39(1), 80–91 (1996)CrossRefGoogle Scholar
  17. 17.
    Biscup, J., Embley, D.W.: Extraction information from heterogeneous information sources using ontologically specified target views. Inf. Syst. 28(3), 169–212 (2003)CrossRefGoogle Scholar
  18. 18.
    Gordijn, J., Akkermans, H.: Designing and evaluating e-business models. IEEE Intell. Syst. 16(4), 11–18 (2001)CrossRefGoogle Scholar
  19. 19.
    Hu. J., Kashi, R., Lopresti, D., et al: Why table ground-truthing is hard. In: Proceeding of the 6th International Conference on Document Analysis and Recognition, Washington (USA), pp. 129–133. IEEE Computer Society (2001)Google Scholar
  20. 20.
    Kuznetsov, A., et al.: Performance of hash algorithms on GPUs for use in blockchain. In: 2019 IEEE International Conference on Advanced Trends in Information Theory, ATIT 2019 – Proceedings Kyiv, Ukraine, pp. 166–170 (2019).
  21. 21.
    Green, E., Krishnamoorthy, M.: Model-Based analysis of printed table. In: Proceeding IAPR International Conference on Document Analysis & Recognition III. Montreal (Canada), pp. 80–91. Springer, Berlin (1995)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2021

Authors and Affiliations

  1. 1.International E-Commerce and Hotel and Restaurant Business DepartmentV.N. Karazin Kharkiv National UniversityKharkivUkraine
  2. 2.Department of Software EngineeringNational Aerospace University “KhAI”KharkivUkraine
  3. 3.Department of ManagementNational Aerospace University “KhAI”KharkivUkraine

Personalised recommendations