Advertisement

Idioms Modeling in a Computer Ontology as a Morphosyntactic Disambiguation Strategy

The Case of Tibetan Corpus of Grammar Treatises
  • Alexei Dobrov
  • Anastasia Dobrova
  • Pavel Grokhovskiy
  • Maria Smirnova
  • Nikolay Soms
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11107)

Abstract

The article presents the experience of developing computer ontology as one of the tools for Tibetan idioms processing. A computer ontology that contains a consistent specification of meanings of lexical units with different relations between them represents a model of lexical semantics and both syntactic and semantic valencies, reflecting the Tibetan linguistic picture of the world. The article presents an attempt to classify Tibetan idioms, including compounds, which are idiomatized clips of syntactic groups that have frozen inner syntactic relations and are often characterized by omission of grammatical morphemes; and the application of this classification for idioms processing in computer ontology. The article also proposes methods of using computer ontology for avoiding idioms processing ambiguity.

Keywords

Tibetan language Idioms Compounds Computer ontology Tibetan corpus Natural language processing Corpus linguistics Immediate constituents 

Notes

Acknowledgment

This work was supported by the Russian Foundation for Basic Research, Grant No. 16-06-00578 Morphosyntactycal analyser of texts in the Tibetan language.

References

  1. 1.
    Grokhovskii, P.L., Zakharov, V.P., Smirnova, M.O., Khokhlova, M.V.: The corpus of tibetan grammatical works. In: Automatic documentation and mathematical linguistics, vol. 49, no. 5, pp. 182–191 (2015).  https://doi.org/10.3103/S0005105515050064CrossRefGoogle Scholar
  2. 2.
    Gruber, T.R.: A translation approach to portable ontology specifications (PDF). Knowl. Acquis. 5(2), 199–220 (1993).  https://doi.org/10.1006/knac.1993.1008CrossRefGoogle Scholar
  3. 3.
    Aho, A.V., Corasick, M.J.: Efficient string matching: an aid to bibliographic search. Commun. ACM 18(6), 333–340 (1975)MathSciNetCrossRefGoogle Scholar
  4. 4.
    Krippendorff, K.: Combinatorial Explosion. Web Dictionary of Cybernetics and Systems. http://pespmc1.vub.ac.be/ASC/Combin_explo.html. PRINCIPIA CYBERNETICA WEB
  5. 5.
    Dobrov, A.V.: Semantic and ontological relations in AIIRE natural language processor. Comput. Model. Bus. Eng. Domains. Rzeszow-Sofia: ITHEA, 147–157 (2014)Google Scholar
  6. 6.
    Miller, G.A., Beckwith, R., Fellbaum, C.D., Gross, D., Miller, K.: WordNet: an online lexical database. Int. J. Lexicograph. 3(4), 235–244 (1990)CrossRefGoogle Scholar
  7. 7.
    Melcuk, I.: Phrasemes in language and phraseology in linguistics. In: Everaert, M., Van der Linden, E.J., Schenk, A., Schreuder, R. (eds.) Idioms: Structural and Psychological Perspectives, pp. 167–232. Lawrence Erlbaum, New Jersey (1995)Google Scholar
  8. 8.
    Pelletier, F.J.: The principle of semantic compositionality. Topoi 13, 11 (1994)MathSciNetCrossRefGoogle Scholar
  9. 9.
    Beyer, S.: The Classical Tibetan Language. State University of New York, New York (1992)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2018

Authors and Affiliations

  • Alexei Dobrov
    • 1
  • Anastasia Dobrova
    • 2
  • Pavel Grokhovskiy
    • 1
  • Maria Smirnova
    • 1
  • Nikolay Soms
    • 2
  1. 1.Saint-Petersburg State UniversitySaint-PetersburgRussia
  2. 2.LLC “AIIRE”Saint-PetersburgRussia

Personalised recommendations