Abstract
Japanese relative clause constructions (RCC’s) are defined as being the NP’s of structure ‘S NP’, noting the lack of a relative pronoun or any other explicit form of noun-clause demarcation. Japanese relative clause modification should be classified into at least two major semantic types: case-slot gapping and head restrictive. However, these types for relative clause modification cannot apparently be distinguished. In this paper we propose a method of identifying a RCC’s type with a machine learning technique. The features used in our approach are not only representing RCC’s characteristics, but also automatically obtained from large corpora. The results we obtained from evaluation revealed that our method outperformed the traditional case frame-based method, and the features that we presented were effective in identifying RCC’s types.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Baldwin, T., Tokunaga, T., Tanaka, H.: The parameter-based analysis of Japanese relative clause constructions. IPSJ SIGNote on Natural Language 134(8), 55–62 (1999)
Baldwin, T.: Making Sense of Japanese Relative Clause Constructions. In: Proceedings of the Second Workshop on Text Meaning and Interpretation, pp. 49–56 (2004)
Dagan, I., Lee, L., Pereira, F.: Similarity-based models of word cooccurrence probabilities. Machine Learning 34, 65–81 (1999)
EDR.: EDR electronic dictionary technical guide. Technical Report TR045, Japanese Electronic Dictionary Research Institute Ltd (1995)
Ikehara, S., Shirai, S., Yokoo, A., Nakaiwa, H.: Toward an MT system without pre-editing effect of new methods in ALT-J/E. In: Proceedings of the Third Machine Translation Summit (1991)
Ishizako, T., Kataoka, A., Masuyama, S., Yamamoto, K., Nakagawa, S.: Reduction of overlapping expressions using dependency relations. Natural Language Processing 7(4), 119–142 (2000) (in Japanese)
Kawahara, D., Kurohashi, S.: Fertilization of case frame dictionary for robust Japanese case analysis. In: Proceedings of the 19th International Conference on Computational Linguistics, pp. 425–431 (2002)
Kurohashi, S., Nagao, M.: Kn parser: Japanese dependency/case structure analyzer. In: Proceeding of the International Workshop on Sharable Natural Language Resources, pp. 48–55 (1994)
Lin, J.: Divergence measures based on the shannon entropy. IEEE Transactions On Information Theory 37(1), 145–151 (1991)
Manning, C., Schutze, H.: Foundations of Statistical Natural Language Processing. MIT Press, Cambridge (1999)
Murata, M.: Extraction of negative examples based on positive examples automatic detection of mis-spelled Japanese expressions and relative clauses that do not have case relations with their heads. IPSJ SIGNote on Natural Language 144(15), 105–112 (2001) (in Japanese)
Narita, H.: Parsing Japanese clauses modifying nominals. IPSJ SIGNote on Natural Language 99(11), 79–86 (1994) (in Japanese)
Quinlan, J.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)
Teramura, H.: Rentai-shuushoku no shintakusu to imi. No.1-4, Nihongo Nihonbunka, 4-7 (1975-1978) (in Japanese)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Abekawa, T., Okumura, M. (2005). Corpus-Based Analysis of Japanese Relative Clause Constructions. In: Dale, R., Wong, KF., Su, J., Kwong, O.Y. (eds) Natural Language Processing – IJCNLP 2005. IJCNLP 2005. Lecture Notes in Computer Science(), vol 3651. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11562214_5
Download citation
DOI: https://doi.org/10.1007/11562214_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29172-5
Online ISBN: 978-3-540-31724-1
eBook Packages: Computer ScienceComputer Science (R0)