Skip to main content

Open Information Extraction for Mongolian Language

  • Conference paper
  • First Online:

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 157))

Abstract

In this paper, we describe MongoIE, an Open Information Extraction (Open IE) system for the Mongolian language. We present the characteristic of the language and, after analyzing the available preprocessing tools, we describe the features used for building the system. We have implemented two different approaches: (1) Rule-based and (2) Classification. Here, we describe them, analyze their errors and present their results. In the best of our knowledge, this is the first attempt in building Open IE systems for Mongolian. We conclude by suggesting possible future improvements and directions.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    One of the autonomous regions of China.

  2. 2.

    http://172.104.34.197/brat//np-chunk/test2.

  3. 3.

    http://milab.num.edu.mn/.

  4. 4.

    Available at: https://bit.ly/2nClF3q.

References

  1. Michele Banko, O.E.: The tradeoffs between open and traditional relation extraction. In: Proceedings of the ACL-08: HLT (2008)

    Google Scholar 

  2. Horn, C., Zhila, A., Gelbukh, A., Kern, R., Lex, E.: Using factual density to measure informativeness of web documents. In: Proceedings of the 19th Nordic Conference on Computational Linguistics (2013)

    Google Scholar 

  3. Mausam, Schmitz, M., Soderland, S., Bart, R., Etzioni, O.: Open language learning for information extraction. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (2012)

    Google Scholar 

  4. Lin, T., Mausam, Etzioni, O.: Identifying functional relations in web text. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (2010)

    Google Scholar 

  5. Bird, S., Loper, E., Klein, E.: In: Natural Language Processing with Python. O’Reilly Media Inc (2009)

    Google Scholar 

  6. Helmut, S.: In: Improvements in Part-of-Speech Tagging with an Application to German, pp. 13–25. Springer, Netherlands, Dordrecht (1999)

    Google Scholar 

  7. Sangha, N., Younggyun, N., Sejin, N., Key-Sun, C.: SRDF: Korean open information extraction using singleton property. In: Proceedings of the 14th International Semantic Web Conference (2015)

    Google Scholar 

  8. Sidorov, G., Velasquez, F., Stamatatos, E., Gelbukh, A., Chanona-Hern\(\acute{\text{a}}\)ndez, L.: Syntactic dependency-based n-grams as classification features. In: Gonzalez-Mendoza, M., Batyrshin, I. (eds.) Advances in Computational Intelligence. Proceedings of MICAI 2012 (2012)

    Google Scholar 

  9. Sidorov, G., Velasquez, F., Stamatatos, E., Gelbukh, A., Chanona-Hern\(\acute{\text{ a }}\)ndez, L.: Syntactic dependency-based n-grams: more evidence of usefulness in classification. In: Gelbukh, A. (ed.) Computational Linguistics and Intelligent Text Processing. Proceedings of International Conference on Intelligent Text Processing and Computational Linguistics, CICLing 2013 (2013)

    Chapter  Google Scholar 

  10. Bayartsatsral, C., Altangerel, C.: Annotating noun phrases for Mongolian language and using it in machine learning. In: Proceedings of the Mongolian Information Technology—2018, Ulaanbaatar, Udam Soyol, pp. 12–15 (2018)

    Google Scholar 

  11. Davidov, D., Rappoport, A.: Unsupervised discovery of generic relationships using pattern clusters and its evaluation by automatically generated sat analogy questions. In: Proceedings of the ACL-08 (2008)

    Google Scholar 

  12. Fader, A., Soderland, S., Etzioni, O.: Identifying relations for open information extraction. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP’11 (2011)

    Google Scholar 

  13. Alisa, Z., Alexander, G.: Open information extraction for Spanish language based on syntactic constraints. In: Proceedings of the ACL2014 Student Research Workshop, Baltimore, Maryland, USA, pp. 78–85 (2014)

    Google Scholar 

  14. Gamallo, P., Garcia, M., Fern\(\acute{\text{ a }}\)ndez-Lanza, S.: Dependency-based open information extraction. In: Proceedings of the Joint Workshop on Unsupervised and SemiSupervised Learning in NLP, ROBUS-UNSUP ’12 (2012)

    Google Scholar 

  15. Van Durme, B., Schubert, L.: Open knowledge extraction using compositional language processing. In: Proceedings of the STEP ’08 Proceedings of the 2008 Conference on Semantics in Text Processing (2008)

    Google Scholar 

  16. Michele, B., Michael, J.C., Stephan, S., Matt, B., Oren, E.: Open information extraction from the web. In: Proceedings of the Twentieth International Joint Conference on Artificial Intelligence (2007)

    Google Scholar 

  17. Wu, F., Weld, D.S.: Open information extraction using wikipedia. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, ACL ’10 (2010)

    Google Scholar 

Download references

Acknowledgements

This work was supported by Ernst Mach-Stipendien (Eurasia-Pacific Uninet) grant funded by The Austrian Agency for International Cooperation in Education and Research (OeAD-GmbH), and Centre for International Cooperation and Mobility (ICM).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ganchimeg Lkhagvasuren .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Lkhagvasuren, G., Rentsendorj, J. (2020). Open Information Extraction for Mongolian Language. In: Pan, JS., Li, J., Tsai, PW., Jain, L. (eds) Advances in Intelligent Information Hiding and Multimedia Signal Processing. Smart Innovation, Systems and Technologies, vol 157. Springer, Singapore. https://doi.org/10.1007/978-981-13-9710-3_31

Download citation

Publish with us

Policies and ethics