Iterative Approach for Information Extraction and Ontology Learning from Textual Aviation Safety Reports

  • Lama SaeedaEmail author
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10250)


Textual aviation safety reports are one of the main resources that contain valuable information to understand incidents and accidents in a high-risk industry such as the aviation domain. The reporting process, hence, is essential to provide these reports. Most of the time, the reporting process is done manually, and typically, poorly structured data are provided by the reporters. Automated content analysis for these reports has attracted researchers to extract the required information to perform many tasks, and they used several techniques to achieve it. Ontologies provide formal and explicit specifications of conceptualizations and play a crucial role in the information extraction process. In this paper, we propose a novel iterative ontology-based approach of information extraction and semantic annotations for aviation safety reports and augmenting back the aviation safety ontology with new concepts and relations depending on the terms already annotated in the discovered report model.


Information extraction Domain ontology Ontology learning Safety reports 



I want to thank Dr. Petr Křemen for his support in accomplishing this proposal. Furthermore, this work was partially supported by grants No. TA04030465 Research and development of progressive methods for measuring aviation organization’s safety performance of the Technology Agency of the Czech Republic, No. SGS16/229/OHK3/3T/13 Supporting ontological data quality in information systems of the Czech Technical University in Prague.


  1. 1.
    Switzer, J., Khan, L., Bin Muhaya, F.: Subjectivity classification and analysis of the ASRS corpus. In: 2011 IEEE International Conference on Information Reuse & Integration (2011)Google Scholar
  2. 2.
    Tanguy, L., Tulechki, N., Urieli, A., Hermann, E., Raynal, C.: Natural language processing for aviation safety reports: from classification to interactive analysis. Comput. Ind. 78, 80–95 (2016)CrossRefGoogle Scholar
  3. 3.
    Williams, T., Betak, J., Findley, B.: Text mining analysis of railroad accident investigation reports. In: 2016 Joint Rail Conference (2016)Google Scholar
  4. 4.
    Tixier, A.J.-P., Hallowell, M.R., Rajagopalan, B., Bowman, D.: Automated content analysis for construction safety: a natural language processing system to extract precursors and outcomes from unstructured injury reports. Autom. Constr. 62, 45–56 (2016)CrossRefGoogle Scholar
  5. 5.
    Chokor, A., Naganathan, H., Chong, W.K., Asmar, M.E.: Analyzing Arizona OSHA injury reports using unsupervised machine learning. Procedia Eng. 145, 1588–1593 (2016)CrossRefGoogle Scholar
  6. 6.
    Dou, D., Wang, H., Liu, H.: Semantic data mining: a survey of ontology-based approaches. In: Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015) (2015)Google Scholar
  7. 7.
    Persing, I., Ng, V.: Semi-supervised cause identification from aviation safety reports. In: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, ACL-IJCNLP 2009, vol. 2 (2009)Google Scholar
  8. 8.
    Tirunagari, S.: Data mining of causal relations from text: analysing maritime accident investigation reports. arXiv preprint arXiv:1507.02447 (2015)
  9. 9.
    Fujita, K., Akiyama, M., Park, K., (Nakagami) Yamaguchi, E., Furukawa, H.: Linguistic analysis of large-scale medical incident reports for patient safety. In: MIE, pp. 250–254 (2012)Google Scholar
  10. 10.
    Sfakianaki, P., Koumakis, L., Sfakianakis, S., Iatraki, G., Zacharioudakis, G., Graf, N., Marias, K., Tsiknakis, M.: Semantic biomedical resource discovery: a natural language processing framework. BMC Med. Inform. Decis. Mak. 15 (2015)Google Scholar
  11. 11.
    Wimalasuriya, D.C., Dou, D.: Ontology-based information extraction: an introduction and a survey of current approaches. J. Inf. Sci. 36, 306–323 (2010)CrossRefGoogle Scholar
  12. 12.
    Henriksson, A., Kvist, M., Dalianis, H., Duneld, M.: Identifying adverse drug event information in clinical notes with distributional semantic representations of context. J. Biomed. Inf. 57, 333–349 (2015)CrossRefGoogle Scholar
  13. 13.
    Rios-Alvarado, A., Lopez-Arevalo, I.: Ontology learning from text: method for learning axioms. Technical report (2012)Google Scholar
  14. 14.
    Rios-Alvarado, A.B., Lopez-Arevalo, I., Tello-Leal, E., Sosa-Sosa, V.J.: An approach for learning expressive ontologies in medical domain. J. Med. Syst. 39 (2015)Google Scholar
  15. 15.
    Barbantan, I., Porumb, M., Lemnaru, C., Potolea, R.: Feature engineered relation extraction - medical documents setting. Int. J. Web Inf. Syst. 12, 336–358 (2016)CrossRefGoogle Scholar
  16. 16.
    David Sanchez, R.: Domain ontology learning from the web. Ph.D. thesis, Universitat Politecnicade Catalunya (2007)Google Scholar
  17. 17.
    Chong, M., Abraham, A., Paprzycki, M.: Traffic accident analysis using machine learning paradigms. Informatica 29(1) (2005)Google Scholar
  18. 18.
    Vittek, P., Lališ, A., Stojic, S., Plos, V.: Challenges of implementation and practical deployment of aviation safety knowledge management software. In: Communications in Computer and Information Science Knowledge Engineering and Semantic Web, pp. 316–327 (2016)Google Scholar
  19. 19.
    Kostov, B., Ahmad, J., Křemen, P.: Towards ontology-based safety information management in the aviation industry. In: 13th International Conference, IESD (2016)Google Scholar
  20. 20.
    Saeeda, L., Křemen, P.: Text analyzing of aviation safety reports. WIKT & Data a Znalosti (2016)Google Scholar
  21. 21.
    Žáková, M., Železný, F., Garcia-Sedano, J.A., Tissot, C.M., Lavrač, N., Křemen, P., Molina, J.: Relational data mining applied to virtual engineering of product designs. In: Muggleton, S., Otero, R., Tamaddoni-Nezhad, A. (eds.) ILP 2006. LNCS (LNAI), vol. 4455, pp. 439–453. Springer, Heidelberg (2007). doi: 10.1007/978-3-540-73847-3_39CrossRefGoogle Scholar
  22. 22.
    Wong, M.K., et al.: A multi-phase correlation search framework for mining non-taxonomic relations from unstructured text. Knowl. Inf. Syst. 38(3), 641–667 (2012)CrossRefGoogle Scholar
  23. 23.
    Reyes, J.A., Montes, A.: Learning discourse relations from news reports: an event-driven approach. IEEE Lat. Am. Trans. 14(1), 356–363 (2016)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  1. 1.Faculty of Electrical EngineeringCzech Technical University in PraguePragueCzech Republic

Personalised recommendations