Skip to main content

Extracting Frame-Like Structures from Google Books NGram Dataset

  • Conference paper
Human-Inspired Computing and Its Applications (MICAI 2014)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8856))

Included in the following conference series:

  • 1738 Accesses

Abstract

We propose a method that facilitates a process of semi-automatic FrameNet construction. The method requires Google Books NGram dataset and WordNet or another thesaurus for a particular language. We evaluated the method for Russian ngrams. Due to a huge amount of available data the method does not require sophisticated natural language processing techniques (e.g. for word sense disambiguation), and it shows a promising result.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Baker, C.F., Fillmore, C.J., Cronin, B.: The structure of the framenet database. International Journal of Lexicography 16(3), 281–296 (2003)

    Article  Google Scholar 

  2. Castro-Sánchez, N.A., Sidorov, G.: Analysis of definitions of verbs in an explanatory dictionary for automatic extraction of actants based on detection of patterns. In: Hopfe, C.J., Rezgui, Y., Métais, E., Preece, A., Li, H. (eds.) NLDB 2010. LNCS, vol. 6177, pp. 233–239. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  3. Koeva, S.: Lexicon and grammar in bulgarian framenet. In: LREC (2010)

    Google Scholar 

  4. Lin, Y., Michel, J.-B., Aiden, E.L., Orwant, J., Brockman, W., Petrov, S.: Syntactic annotations for the google books ngram corpus. In: Proceedings of the ACL 2012 System Demonstrations, pp. 169–174. Association for Computational Linguistics (2012)

    Google Scholar 

  5. Loukachevitch, N., Dobrov, B.: Ruthes linguistic ontology vs. russian wordnets. In: Proceedings of Global WordNet Conference GWC-2014, Tartu (2014)

    Google Scholar 

  6. Lyashevskaya, O.: Dictionary of valencies meets corpus annotation: A case of russian framebank. Proceedings of EURALEX 15 (2012)

    Google Scholar 

  7. Palmer, M., Gildea, D., Kingsbury, P.: The proposition bank: An annotated corpus of semantic roles. Computational linguistics 31(1), 71–106 (2005)

    Article  Google Scholar 

  8. Kochetkova, N.A., Klyshinsky, E.S.: A method of automatic generating of russian verb subordination models. In: Proceedings of the In XII National Conference of Artificial Intelligence (2013)

    Google Scholar 

  9. Schuler, K.K.: Verbnet: A broad-coverage, comprehensive verb lexicon (2005)

    Google Scholar 

  10. Sidorov, G.: Syntactic dependency based n-grams in rule based automatic english as second language grammar correction. International Journal of Computational Linguistics and Applications 4(2), 169–188 (2013)

    Google Scholar 

  11. Sidorov, G., Velasquez, F., Stamatatos, E., Gelbukh, A., Chanona-Hernández, L.: Syntactic n-grams as machine learning features for natural language processing. Expert Systems with Applications 41(3), 853–860 (2014)

    Article  Google Scholar 

  12. Tonelli, S., Pianta, E.: Frame information transfer from english to italian. In: LREC (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Ivanov, V. (2014). Extracting Frame-Like Structures from Google Books NGram Dataset. In: Gelbukh, A., Espinoza, F.C., Galicia-Haro, S.N. (eds) Human-Inspired Computing and Its Applications. MICAI 2014. Lecture Notes in Computer Science(), vol 8856. Springer, Cham. https://doi.org/10.1007/978-3-319-13647-9_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-13647-9_3

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-13646-2

  • Online ISBN: 978-3-319-13647-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics