Abstract
The identification of semantic relations from a raw text is an important problem in Natural Language Processing. This paper provides semi-automatic pattern-based extraction of part–whole relations. We utilized and adopted some lexico-syntactic patterns to disclose meronymy relation from a Turkish corpus. We applied two different approaches to prepare patterns; one is based on pre-defined patterns that are taken from the literature, second automatically produces patterns by means of bootstrapping method. While pre-defined patterns are directly applied to corpus, other patterns need to be discovered first by taking manually prepared unambiguous seeds. Then, word pairs are extracted by their occurrence in those patterns. In addition, we used statistical selection on global data that is obtaining from all results of entire patterns. It is a whole-by-part matrix on which several association metrics such as information gain, T-score, etc., are applied. We examined how all these approaches improve the system accuracy especially within corpus-based approach and distributional feature of words. Finally, we conducted a variety of experiments with a comparison analysis and showed advantage and disadvantage of the approaches with promising results.
Notes
Türk Dil Kurumu (The Turkish Language Association).
Vikisözlük: Özgür Sözlük.
References
Cruse AD (2003) The lexicon. In: Aronoff M, Ress-Miller J (eds) The handbook of linguistics. Blackwell Publisher Ltd., Oxford, pp 238–264
Keet CM, Artale A (2008) Representing and reasoning over a taxonomy of part–whole relations. Appl Ontol 3(1–2):91–110
Pribbenow S (2002) Meronymic relationships: from classical mereology to complex part–whole relations. In: Green R, Bean CA, Myaeng SH (eds) The semantics of relationships. Springer, Netherlands, pp 35–50
Croft W, Cruse D (2004) Cognitive linguistics. Cambridge University Press, Cambridge
Simons P (1987) Parts: a study in ontology. Oxford University Press, UK
Gerstl P, Pribbenow S (1995) Midwinters, end games, and body parts: a classification of part–whole relations. Int J Hum–Comput Stud 43(5–6):865–889
Iris MA, Litowitz BE, Evens M (1988) Problems of the part–whole relation. In: Evens M (ed) Relational models of the lexicon. Cambridge University Press, Cambridge, pp 261–288
Winston ME, Chaffin R, Herrmann D (1987) A Taxonomy of part–whole relations. Cogn Sci 11(4):417–444
Miller GA et al (1990) Introduction to WordNet: an on-line lexical database. Int J Lexicogr 3(4):235–244
Murphy ML (2003) Semantic relations and the lexicon: antonymy, synonymy, and other paradigms. Cambridge University Press, UK
Artale A, Franconi E, Guarino N, Pazzi L (1996) Part–whole relations in object-centered systems: an overview. Data Knowl Eng 20(3):347–383
Girju R, Badulescu A, Moldovan D (2006) Automatic discovery of part–whole relations. Comput Linguist 32(1):83–135
Hamon T, Natalia G (2008) How can the term compositionality be useful for acquiring elementary semantic relations? In: Nordström B, Ranta A (eds) Advances in natural language processing, LNCS 5221. Springer, Berlin, Heidelberg, pp 181–192
Roberts A (2005) Learning meronyms from biomedical text. In: Proceedings of the ACL student research workshop (ACLstudent ’05). Association for Computational Linguistics, Stroudsburg, PA, USA, pp 49–54
Ling X, Clark P, Weld DS (2013) Extracting meronyms for a biology knowledge base. In: Proceedings of the 2013 workshop on automated knowledge base construction (AKBC ’13). ACM, USA, pp 7–12
Ittoo A, Bouma G, Maruster L, Wortmann H (2010) Extracting meronymy relationships from domain specific, textual corporate databases. In: Hopfe CJ, Rezgui Y, Metais E, Preece AD, Li H (eds) Natural language processing and information system, LNCS 6177. Springer, Berlin, pp 48–59
Pantel P, Pennacchiotti M (2006) Espresso: leveraging generic patterns for automatically harvesting semantic relations. In: Proceeding of the 21st international conference on computational linguistics and 44th annual meeting of the Association for Computational Linguistics. Australia, Sydney, pp 113–120
Vor der Bruck T, Helbig H (2010) Meronymy extraction using an automated theorem prover. J Lang Technol Comput Linguist 25(1):57–82
Vor der Bruck T, Helbig H (2010) Validating meronymy hypotheses with support vector machines and graph kernels. In: Proceedings of the 2010 Ninth International Conference on Machine Learning and Applications (ICMLA '10). IEEE Computer Society, Washington, DC, USA, pp 243–250
Xia F, Cungen C (2014) Extracting part–whole relations from online encyclopedia. In: Shi Z, Wu Z, Leake D, Sattler U (eds) Intelligent information processing VII. IFIP advances in information and communication technology, vol 432. Springer, Berlin, Heidelberg, pp 57–66
Hearst MA (1992) Automatic acquisition of hyponyms from large text corpora. In: Proceedings of the 14th international conference on computational linguistics, COLING 1992. Nantes, France, pp 539–545
Berland M, Charniak E (1999) Finding parts in very large corpora. In: Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics, USA, pp 57–64
Girju R, Badulescu A, Moldovan D (2003) Learning semantic constraints for the automatic discovery of part–whole relations. In: Proceedings of the human language technology conference of the North American Chapter of the Association for Computational Linguistics. Edmonton, Canada, pp 1–8
Van HWR, Kolb H, Schreiber G (2006) A method for learning part–whole relations. In: Cruz IF, Decker S, Allemang D, Preist C, Schwabe D, Mika P, Uschold M, Aroyo L (eds) International semantic web, LNCS 4273. Springer, Berlin, pp 723–735
Ittoo A, Bouma G (2010) On learning subtypes of the part–whole relation: do not mix your seeds. In: Proceedings of the 48th annual meeting of the Association for Computational Linguistics, ACL’10. Association for Computational Linguistics, Uppsala, Sweden, pp 1328–1336
Cao X, Cao C, Wang S, Lu H (2008) Extracting part–whole relations from unstructured Chinese Corpus. In: Proceedings of the 2008 5th international conference on fuzzy systems and knowledge discovery, pp 175–179
Yao T, Uszkoreit H (2005) Identifying semantic relations between named entities from Chinese texts. In: Lu R, Siekmann JH, Ullrich C (eds) Proceedings of the 2005 joint Chinese-German conference on cognitive systems, LNCS 4429. Springer-Verlag, Berlin, Heidelberg, pp 70–83
Orhan Z, Pehlivan I, Uslan V, Onder P (2011) Automated extraction of semantic word relations in Turkish lexicon. Math Comput Appl 16(1):13–22
Serbetçi A, Orhan Z, Pehlivan I (2011) Extraction of semantic word relations in Turkish from dictionary definitions. In: Proceedings of the ACL 2011 workshop on relational models of semantics, RELMS 2011. Portland, Oregon, USA, pp 11–18
Yazıcı E, Amasyalı MF (2011) Automatic extraction of semantic relationships using Turkish dictionary definitions. EMO Bilimsel Dergi, İstanbul
Yıldız T, Yıldırım S, Diri B (2013) Extraction of part–whole relations from Turkish corpora. In: Gelbukh A (ed) Computational linguistics and intelligent text processing, LNCS 7816. Springer, Berlin, Heidelberg, pp 126–138
Yıldız T, Diri B, Yıldırım S (2014) Analysis of lexico-syntactic patterns for meronym extraction from a Turkish corpus. 6th Language and technology conference. Human language technologies as a challenge for computer science and linguistics, LTC, Poland, pp 429–433
Sak H, Güngör T, Saraçlar M (2008) Turkish language resources: morphological parser, morphological disambiguator and web corpus. In: Nordström B, Ranta A (eds) Advances in natural language processing, LNCS 5221. Springer-Verlag, Berlin, Heidelberg, pp 417–427
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Yıldız, T., Diri, B. & Yıldırım, S. Acquisition of Turkish meronym based on classification of patterns. Pattern Anal Applic 19, 495–507 (2016). https://doi.org/10.1007/s10044-015-0516-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10044-015-0516-9