Idioms Modeling in a Computer Ontology as a Morphosyntactic Disambiguation Strategy
The article presents the experience of developing computer ontology as one of the tools for Tibetan idioms processing. A computer ontology that contains a consistent specification of meanings of lexical units with different relations between them represents a model of lexical semantics and both syntactic and semantic valencies, reflecting the Tibetan linguistic picture of the world. The article presents an attempt to classify Tibetan idioms, including compounds, which are idiomatized clips of syntactic groups that have frozen inner syntactic relations and are often characterized by omission of grammatical morphemes; and the application of this classification for idioms processing in computer ontology. The article also proposes methods of using computer ontology for avoiding idioms processing ambiguity.
KeywordsTibetan language Idioms Compounds Computer ontology Tibetan corpus Natural language processing Corpus linguistics Immediate constituents
This work was supported by the Russian Foundation for Basic Research, Grant No. 16-06-00578 Morphosyntactycal analyser of texts in the Tibetan language.
- 4.Krippendorff, K.: Combinatorial Explosion. Web Dictionary of Cybernetics and Systems. http://pespmc1.vub.ac.be/ASC/Combin_explo.html. PRINCIPIA CYBERNETICA WEB
- 5.Dobrov, A.V.: Semantic and ontological relations in AIIRE natural language processor. Comput. Model. Bus. Eng. Domains. Rzeszow-Sofia: ITHEA, 147–157 (2014)Google Scholar
- 7.Melcuk, I.: Phrasemes in language and phraseology in linguistics. In: Everaert, M., Van der Linden, E.J., Schenk, A., Schreuder, R. (eds.) Idioms: Structural and Psychological Perspectives, pp. 167–232. Lawrence Erlbaum, New Jersey (1995)Google Scholar
- 9.Beyer, S.: The Classical Tibetan Language. State University of New York, New York (1992)Google Scholar