Skip to main content

Universal Computational Formalisms and Developer Environment for Rule-Based NLP

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 10761))

Abstract

The paper explores the issues of universal computational formalisms and reusable developer environment as applied to rule-based NLP. It suggests a portable grammar framework and modular NLP architecture that by combining certain modules can be reused for different unilingual and multilingual applications through a universal developer environment. The developer environment includes a lexicon shell with flexible settings to define, among others, tag descriptions, entry structures, depth of knowledge, and a number of compilers with universal rule-writing formalisms. The formalisms and compilers described have been successfully used (in different combinations and with different depth of analysis) in a number of unilingual and multilingual applications that involved English, Danish, French and Russian.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    It might not always be the case as there are applications that skip phrase chunking and work directly on lexicon tags.

References

  1. Camilleri, J.J.: An IDE for the grammatical framework. In: Proceedings of a Workshop on Free/Open-Source Rule-Based Machine Translation, Gothenburg, Sweden, pp. 1–12 (2012)

    Google Scholar 

  2. Feldman, A., Jirka, H., Brew, Ch.: A cross-language approach to rapid creation of new morpho-syntactically annotated resources. In: Proceedings of LREC 2006 (2006)

    Google Scholar 

  3. Joshi, A.K.: Unification and some new grammatical formalisms. In: Proceedings of the Workshop on Theoretical Issues in Natural Language Processing TINLAP 1987, pp. 45–50. Las Cruces, New Mexico. USA (1987)

    Google Scholar 

  4. Joshi, A.K, Srinivas, B.: Disambiguation of super parts of speech (or Supertags): almost parsing (1994). http://acl.ldc.upenn.edu/C/C94/C94-1024.pdf

  5. Neumann, C.: Generating patent claims in english from a Japanese-only interface. In: Proceedings of the Workshop on Patent Translation in conjunction with the MT Summit X. Phluket, Thailand, September 2005

    Google Scholar 

  6. Paul, M.: Translation knowledge recycling for related languages. In: Proceedings of MT Summit VIII. Santiago de Compostela, Galicia, Spain, 18–22 September 2001

    Google Scholar 

  7. Petrov, S., Das, D., McDonald, R.: A universal part-of-speech tagset. In: Proceedings of the Conference on Language Resources and Evaluation (LREC 2012). Istanbul, Turkey (2012)

    Google Scholar 

  8. Pereira, F.C.N., Shieber, S.M.: The semantics of grammar formalisms seen as computer languages. In: Proceedings of the Tenth International Conference on Computational Linguistics. Stanford, CA, USA (1984)

    Google Scholar 

  9. Pinkham, J., Corston-Oliver, M., Smets, M., Pettenaro, M.: Rapid assembly of a large-scale French-English MT system. In: Proceedings of MT Summit VIII. Santiago de Compostela, Galicia. Spain, 18–22 September 2001

    Google Scholar 

  10. Przepiórkowski, A.: A preliminary formalism for simultaneous rule-based tagging and partial parsing data structures for linguistic resources and applications. In: Rehm, G., Witt, A., Lemnitzer, L. (eds.) Tübingen:Gunter Narr Verlag, pp. 81–90 (2007)

    Google Scholar 

  11. Sharoff, S., Nivre, J.: The proper place of men and machines in language technology processing Russian without any linguistic knowledge. In: Proceedings of the International Conference on Computational Linguistics “Dialogue”- 2011. Russian State Humanitarian University, Moscow (2011)

    Google Scholar 

  12. Sharoff, S., Kopotev, M., Erjavec, T., Feldman, A., Divjak, D.: Designing and evaluating a Russian tagset. In: Proceedings of the Sixth Language Resources and Evaluation Conference, LREC 2008, Marrakech (2008)

    Google Scholar 

  13. Sheremetyeva, S.: Application adaptive electronic dictionary with intelligent interface. In: Proceedings of the Workshop on Enhancing and Using Electronic Dictionaries in Conjunction with the 20th International Conference on Computational Linguistics. COLING 2004, Geneva, Switzerland, 23–28 August 2004

    Google Scholar 

  14. Sheremetyeva, S.: On portability of resources for quick ramp-up of multilingual MT for patent claims. In: Proceedings of the workshop on Patent Translation in conjunction with MT Summit XI, Copenhagen, Denmark, 10–14 September 2007

    Google Scholar 

  15. Sheremetyeva, S.: On teaching technical writing with an authoring tool. In: Proceedings of the 10th International Technology, Education and Development Conference INTED 2016, Valencia, Spain, 7–9 March 2016

    Google Scholar 

  16. Takeda, K.: Portable knowledge sources for machine translation. In: Proceedings of the 15th International Conference on Computational Linguistics, Kyoto, Japan, 5–9 August 1994

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Svetlana Sheremetyeva .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sheremetyeva, S. (2018). Universal Computational Formalisms and Developer Environment for Rule-Based NLP. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2017. Lecture Notes in Computer Science(), vol 10761. Springer, Cham. https://doi.org/10.1007/978-3-319-77113-7_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-77113-7_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-77112-0

  • Online ISBN: 978-3-319-77113-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics