Advertisement

Outline of a Model for Lexical Databases

  • Nancy Ide
  • Jacques Le Maitre
  • Jean Véronis
Chapter
Part of the Linguistica Computazionale book series (LICO, volume 9)

Abstract

In this paper we show that previously applied data models are inadequate for lexical databases. In particular, we show that relational data models, including unnormalized models which allow the nesting of relations, cannot fully capture the structural properties of lexical information. We propose an alternative feature-based model which allows for a full representation of sense nesting and defines a factoring mechanism that enables the elimination of redundant information. We then demonstrate that feature structures map readily to an object-oriented data model and show how our model can be implemented in an object-oriented DBMS.

Keywords

Dictionary Entry Disjunctive Normal Form Computational Linguistics Oxford English Dictionary Lexical Database 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abiteboul, S., Bidoit, N. (1984). Non first normal form relations to represent hierarchically organized data. Proceedings of the ACM SIGACT/SIGMOD Symposium on Principles of Database Systems. Waterloo, Ontario, 191–200.Google Scholar
  2. Amsler, R. A. (1980). The structure of the Merriam-Webster Pocket Dictionary. Doctoral dissertation, University of Texas at Austin.Google Scholar
  3. Amsler, R. A., Tompa, F. W. (1988). An SGML-based standard for English monolingual dictionaries. Proceedings of the 4th Annual Conference of the UW Centre for the New Oxford English Dictionary. Waterloo, Ontario, 61–80.Google Scholar
  4. Bancilhon, F., Fortin, D., Gamersan, S., D., Verroust, A. (1983). VERSO: A relational backend database machine. In D. K. Hsiao (Ed.), Advanced Database Machine Architecture. Englewood Cliffs: Prentice Hall.Google Scholar
  5. Boguraev, B., Neff, M. S. (in press). From machine readable dictionaries to lexical databases. International Journal of Lexicography. Google Scholar
  6. Boucelma, O., Le Maitre, J. (1991). An extensible functional query language for an object-oriented database system. In C. Delobel, M. Kifer, Y. Masunaga (Eds.), Deductive and Object-Oriented Databases. Lecture Notes in Computer Science, Berlin: Springer Verlag.Google Scholar
  7. Brustkern, J., Hess K. (1982). The BONNLEX lexicon system. In J. Goetschalckx L. Rolling (Eds.), Lexicography in the Electronic Age. Amsterdam: North-Holland.Google Scholar
  8. Byrd, R. J., Calzolari, N., Chodorow, M. S., Klavans, J. L., Neff, M. S., Rizk, O. Tools and methods for computational linguistics. Computational Linguistics,13(3/4), 219–240.Google Scholar
  9. Calzolari, N. (1984). Detecting patterns in a lexical data base. Proceedings of the 10th International Conference on Computational Linguistics, COLING’84. Stanford, California, 170–173.Google Scholar
  10. Calzolari, N., Peters, C., Roventini, A. (1990). Computational Model of the Dictionary Entry. (ACQUILEX Preliminary Report, Esprit Basic Research Action No. 3030 ). Pisa, Italy: Istituto di Linguistica Computazionale.Google Scholar
  11. Coombs, J. H., Renear, A. H., DeRose, S. J. (1987). Markup systems and the future of scholarly text processing. Communications of the ACM, 30 (11), 933–934CrossRefGoogle Scholar
  12. Deux, O. et al. (1991). The 02 System. Communications of the ACM, 34 (10), 34–35CrossRefGoogle Scholar
  13. Gardarin, G., Valduriez, P. (1990). SGBD Avancés, Bases de Données Objets, Déductives, Réparties. Paris: Eyrolles.Google Scholar
  14. Gonnet, G., Tompa, F. W. (1987). Mind your grammar: a new approach to modelling text. Proceedings of the 13th Conference on Very Large Data Bases, VLDB’87. Brighton, England, 339–346.Google Scholar
  15. Gonnet, G., Baeza-Yates, R. A., Snider, T. (1991). Lexicographical indices for text: Inverted files vs. PAT trees (Technical report OED-91–01). Waterloo, Ontario: UW Centre for the New Oxford English Dictionary and Text Research.Google Scholar
  16. Ide, N., Véronis, J., Warwick-Armstrong, S., Calzolari, N. (in press). Principles for Encoding machine readable dictionaries. Proceedings of the Fifth EURALEX International Congress, EURALEX 92. Tempere, Finland.Google Scholar
  17. Kaplan, R. and Bresnan, J. (1982). Lexical-functional grammar: A formal system for grammatical representation. In J. Bresnan (Ed.), The Mental Representation of Grammatical Relations. Cambridge, Massachussets: MIT Press.Google Scholar
  18. Karttunen, L. (1984). Features and values. Proceedings of the 10th International Conference on Computational Linguistics, COLING’84. Stanford, California, 28–33.Google Scholar
  19. Kay, M. (1985). Parsing in functional unification grammar. In D.R. Dowty, L. Karttunen, A. M. Zwicky (eds.). Natural ty Press.Google Scholar
  20. Kipfer, B. A. (1983). Computer applications in lexicography: a bibliography. Dictionaries: Journal of Dictionary Society of North America, 4, 202–237.CrossRefGoogle Scholar
  21. Klavans, J., Chodorow, M., Wacholder, N. (1990). From dictionary to knowledge base via taxonomy. Proceedings of the 6th Annual Conference of the UW Centre for the New Oxford English Dictionary. Waterloo, Ontario, 110–132.Google Scholar
  22. Landau, S. I. (1984). Dictionaries: The Art and Craft of Lexicography. New York: Scribner Press.Google Scholar
  23. Le Maitre, J., Boucelma, O. (in press). LIFOO, un langage fonctionnel de requêtes pour bases de données avancées. Technique et Science Info r matiques. Google Scholar
  24. Lecluse, C., Richard, P. (1989). The 02 database programming language. Proceedings of the 15th Conference on Very Large Data Bases, VLDB’87. Amsterdam, 411–422.Google Scholar
  25. Markowitz, J., Ahlswede, T., Evens, M. (1986). Semantically significant patterns in dictionary definitions. Proceedings of the 24th Annual Conference of the Association for Computational Linguistics. New York, 112–119.Google Scholar
  26. Nakamura, J., Nagao, M. (1988). Extraction of semantic information from an ordinary English dictionary and its evaluation. Proceedings of the 12 th International Conference on Computational Linguistics, COLING’88. Budapest, Hungary, 459–464.Google Scholar
  27. Neff, M. S., Byrd, R. J., Rizk, O. A. (1988). Creating and querying lexical databases. Proceedings of the Association for Computational Linguistics Second Applied Conference on Natural Language Processing. Austin, Texas, 84–92.Google Scholar
  28. Pistor, P., Traunmueller, R. (1986). A database language for sets, lists and tables. Information Systems, 11 (4), 323–336.CrossRefGoogle Scholar
  29. Pollard, C., Sag, I. A. (1987). Information-based Syntax and Semantics. CSLI Lecture Notes Series, Chicago: University of Chicago Press.Google Scholar
  30. Roth, M. A., Korth, H. F., Silberschatz, A. (1988). Extended algebra and calculus for nested relational databases. ACM Transactions on Database Systems, 13 (4), 389–417.CrossRefGoogle Scholar
  31. Schek, H.-J., Paul, H.-B., Scholl, M.H., Weikum, G. (1990). The DASDBS project: objectives, experiences, and future prospects. IEEE Transactions on Knowledge and Data Engineering, 2 (1), 25–42.CrossRefGoogle Scholar
  32. Shieber, S.M. (1986). An Introduction to Unification-based Approaches to Grammar. CSLI Lecture Notes Series, Chicago: University of Chicago Press.Google Scholar
  33. Sinclair, J. M. (1987). An Account of the COBUILD Project. London: Collins ELT.Google Scholar
  34. Sperberg-McQueen, M., Burnard, L. (1990). Guidelines for the encoding and interchange of machine-readable texts, Draft, Version 0.0. ACH, ACL, and ALLC.Google Scholar
  35. The DANLEX Group (1987). Descriptive tools for electronic processing of dictionary data. Lexicographica, Series Maior. Tübingen: Niemeyer.Google Scholar
  36. Tompa, F. W. (1989). What is tagged text? Proceedings of the 5th Annual Conference of the UW Centre for the New Oxford English Dictionary. Oxford, England, 81–93.Google Scholar
  37. Véronis, J., Ide, N., M. (1990). Word Sense Disambiguation with Very Large Neural Networks Extracted from Machine Readable Dictionaries. Proceedings of the 13th International Conference on Computational Linguistics, COLING’90. Helsinki, Finland, 2, 389–394.Google Scholar
  38. Wilks, Y., Fass D., Guo, C., MacDonald, J., Plate, T., Slator, B. (1990). Providing Machine Tractable Dictionary Tools. Machine Translation, 5, 99–154.CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media Dordrecht 1994

Authors and Affiliations

  • Nancy Ide
    • 1
  • Jacques Le Maitre
    • 2
  • Jean Véronis
    • 1
    • 2
  1. 1.Department of Computer ScienceVassar CollegePoughkeepsieUSA
  2. 2.Groupe Représentation et Traitement des ConnaissancesCentre National de la Recherche Scientifique,31Marseille Cedex 09France

Personalised recommendations