Skip to main content

Recognition and Acquisition of Compound Names from Corpora

  • Conference paper
  • First Online:
Natural Language Processing — NLP 2000 (NLP 2000)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1835))

Included in the following conference series:

Abstract

In this paper we will present an approach to acquisition of some classes of compound words from large corpora, as well as a method for semi-automatic generation of appropriate linguistic models, that can be further used for compound word recognition and for completion of compound word dictionaries. The approach is intended for a highly inflective language such as Serbo-Croatian. Generated linguistic models are represented by local grammars.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Burnard, L. et al: TEI Lite: An Introduction to Text Encoding for Interchange, doc. No: TEI U 5, June 1995

    Google Scholar 

  2. Coates-Stephands, S.: The Analysis and Acquisition of Proper Names for Robust Text Understanding, PhD Thesis, Department of Computer Science,City University London, 1992

    Google Scholar 

  3. Gross M., Perrin D. (eds.): Electronic Dictionaries and Automata in Computational Linguistics, Lecture Notes in Computer Science, Berlin, Springer Verlag, 110 p., 1989

    Google Scholar 

  4. Gross M.: A Bootstrap Method for Construction Local Grammars, in Monograph on 125th anniversary of the Faculty of Mathematics, University of Belgrade, pp. 231–249, 1998

    Google Scholar 

  5. Maier-Meyer P., Oesterle J.: Recognition of Noun-Phrases in German, in Actes des Premieres Journees INTEX, LADL, 1996

    Google Scholar 

  6. Nenadić, G., Vitas, D.: Using Local Grammars for Agreement Modeling in Highly Inflective Languages, in Proc. of First Workshop on Text, Speech, Dialogue-TSD 98, Brno, 1998

    Google Scholar 

  7. Nenadić, G., Vitas, D.: Formal Model of Noun Phrases in Serbo-Croatian, BULAG 23, Universite Franche-Compte, 1998

    Google Scholar 

  8. Nenadić G., Spasić I.: The Acquisition of Some Lexical Constraints from Corpora, in Text, Speech and Dialogue-TSD’ 99, Lecture Notes in Artificial Intelligence 1692, Berlin, Springer Verlag, 1999

    Google Scholar 

  9. Silberztein, M: INTEX: a Corpus Processing System, in Proc. of COLING 94, ACL, Tokyo, 1994

    Google Scholar 

  10. Silberztein, M.: Dictionnaries électroniques et analyse automatique de textes: le systéme INTEX, Masson, Paris, 1993

    Google Scholar 

  11. Spasić I.: Automatic Foreign Words Recognition in a Serbo-Croatian Scientific and Technical Texts, in Proc. of Conference on ”Terminology Standardization”, Serbian Academy of Arts and Sciences, 1996 (in Serbo-Croatian)

    Google Scholar 

  12. Spasić I.: Natural Language Interface towards Relational Databases, MSc thesis, Faculty of Mathematics, University of Belgrade, 1999 (in Serbo-Croatian)

    Google Scholar 

  13. Spasić I., Pavlović-Lažetić G.: Syntactic Structures in a Sublanguage of Serbian for Querying Relational Databases, in Proc. of Third European Conference on Formal Description of Slavic Languages FDSL-3, 1999

    Google Scholar 

  14. Vitas, D.: Mathematical Model of Serbo-Croatian Morphology (Nominal Inflection), PhD thesis, Faculty of Mathematics, University of Belgrade, 1993 (in Serbo-Croatian)

    Google Scholar 

  15. Vitas D., Krstev C.: Tuning the Text with an Electronic Dictionary, in Proc. of COMPLEX 96, Budapest, Hungarian Academy of Sciences, 1996

    Google Scholar 

  16. Wakao T., Gaizauskas R., Wilks Y.: Evaluation of an Algorithm for the Recognition and Classification of Proper Names, in Proc. of the 16th International Conference on Computational Linguistics (COLING96), Copenhagen, pp. 418–423, 1996

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Nenadić, G., Spasić, I. (2000). Recognition and Acquisition of Compound Names from Corpora. In: Christodoulakis, D.N. (eds) Natural Language Processing — NLP 2000. NLP 2000. Lecture Notes in Computer Science(), vol 1835. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45154-4_4

Download citation

  • DOI: https://doi.org/10.1007/3-540-45154-4_4

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-67605-8

  • Online ISBN: 978-3-540-45154-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics