Recognition and Acquisition of Compound Names from Corpora

Nenadić, Goran; Spasić, Irena

doi:10.1007/3-540-45154-4_4

Goran Nenadić² &
Irena Spasić³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1835))

Included in the following conference series:

International Conference on Natural Language Processing

935 Accesses
1 Citations

Abstract

In this paper we will present an approach to acquisition of some classes of compound words from large corpora, as well as a method for semi-automatic generation of appropriate linguistic models, that can be further used for compound word recognition and for completion of compound word dictionaries. The approach is intended for a highly inflective language such as Serbo-Croatian. Generated linguistic models are represented by local grammars.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Burnard, L. et al: TEI Lite: An Introduction to Text Encoding for Interchange, doc. No: TEI U 5, June 1995
Google Scholar
Coates-Stephands, S.: The Analysis and Acquisition of Proper Names for Robust Text Understanding, PhD Thesis, Department of Computer Science,City University London, 1992
Google Scholar
Gross M., Perrin D. (eds.): Electronic Dictionaries and Automata in Computational Linguistics, Lecture Notes in Computer Science, Berlin, Springer Verlag, 110 p., 1989
Google Scholar
Gross M.: A Bootstrap Method for Construction Local Grammars, in Monograph on 125th anniversary of the Faculty of Mathematics, University of Belgrade, pp. 231–249, 1998
Google Scholar
Maier-Meyer P., Oesterle J.: Recognition of Noun-Phrases in German, in Actes des Premieres Journees INTEX, LADL, 1996
Google Scholar
Nenadić, G., Vitas, D.: Using Local Grammars for Agreement Modeling in Highly Inflective Languages, in Proc. of First Workshop on Text, Speech, Dialogue-TSD 98, Brno, 1998
Google Scholar
Nenadić, G., Vitas, D.: Formal Model of Noun Phrases in Serbo-Croatian, BULAG 23, Universite Franche-Compte, 1998
Google Scholar
Nenadić G., Spasić I.: The Acquisition of Some Lexical Constraints from Corpora, in Text, Speech and Dialogue-TSD’ 99, Lecture Notes in Artificial Intelligence 1692, Berlin, Springer Verlag, 1999
Google Scholar
Silberztein, M: INTEX: a Corpus Processing System, in Proc. of COLING 94, ACL, Tokyo, 1994
Google Scholar
Silberztein, M.: Dictionnaries électroniques et analyse automatique de textes: le systéme INTEX, Masson, Paris, 1993
Google Scholar
Spasić I.: Automatic Foreign Words Recognition in a Serbo-Croatian Scientific and Technical Texts, in Proc. of Conference on ”Terminology Standardization”, Serbian Academy of Arts and Sciences, 1996 (in Serbo-Croatian)
Google Scholar
Spasić I.: Natural Language Interface towards Relational Databases, MSc thesis, Faculty of Mathematics, University of Belgrade, 1999 (in Serbo-Croatian)
Google Scholar
Spasić I., Pavlović-Lažetić G.: Syntactic Structures in a Sublanguage of Serbian for Querying Relational Databases, in Proc. of Third European Conference on Formal Description of Slavic Languages FDSL-3, 1999
Google Scholar
Vitas, D.: Mathematical Model of Serbo-Croatian Morphology (Nominal Inflection), PhD thesis, Faculty of Mathematics, University of Belgrade, 1993 (in Serbo-Croatian)
Google Scholar
Vitas D., Krstev C.: Tuning the Text with an Electronic Dictionary, in Proc. of COMPLEX 96, Budapest, Hungarian Academy of Sciences, 1996
Google Scholar
Wakao T., Gaizauskas R., Wilks Y.: Evaluation of an Algorithm for the Recognition and Classification of Proper Names, in Proc. of the 16th International Conference on Computational Linguistics (COLING96), Copenhagen, pp. 418–423, 1996
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Mathematics, University of Belgrade, Yugoslavia
Goran Nenadić
Faculty of Economics, University of Belgrade, Yugoslavia
Irena Spasić

Authors

Goran Nenadić
View author publications
You can also search for this author in PubMed Google Scholar
Irena Spasić
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Engineering Department and Computer Technology Institute, University of Patras, 26500, Patras, Greece
Dimitris N. Christodoulakis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nenadić, G., Spasić, I. (2000). Recognition and Acquisition of Compound Names from Corpora. In: Christodoulakis, D.N. (eds) Natural Language Processing — NLP 2000. NLP 2000. Lecture Notes in Computer Science(), vol 1835. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45154-4_4

Download citation

DOI: https://doi.org/10.1007/3-540-45154-4_4
Published: 25 May 2000
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67605-8
Online ISBN: 978-3-540-45154-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics