Abstract
Patent MT is emerging as a killer application domain for MT, as there is a strong need for fast and cheap translations for patent documents. Most of the Korean-English MT engines hitherto have suffered from the poor syntactic analysis performance and the lack of linguistic resources. We customized a Korean syntactic parser based on the linguistic characteristics of the patent documents, especially focusing on minimizing the noise in acquiring the co-occurrence data for Korean syntactic parsing. We will show that the improvement of the quality of co-occurrence data and other customization processes can lead to the im prove ment of the parsing accuracy and thus the improvement of the translation quality.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Shinmori, A., Okumura, M., Marukawa, Y., Iwayama, M.: Patent Claim Processing for Readability. In: Proceedings of ACL 2003 Workshop on Patent Corpus Processing Workshop (2003)
Kobayashi, A.: Machine Translation and Japio’s role in disseminating Japanese information (2004), http://www.european-patent-office.org/epidos/conf/jpinfo/2004/_pdf/pres/japio_kobayashi_machine_translation_and_japio_role.pdf
Kim, Y., Hong, M., Kim, C., Park, S.: Word Sense Disambiguation Using Lexical and Semantic Information within Local Syntactic Relations. In: Proceedings of the 30th Annual Conference of the IEEE Industrial Electronics Society (2004)
Hong, M., Kim, Y., Kim, C., Yang, S., Seo, Y., Ryu, C., Park, S.: Customizing a Korean-English MT System for Patent Translation. In: Proceedings of the tenth MT Summit (2005)
Sheremetyeva, S.: Natural Language Analysis of Patent Claims. In: Proceedings of ACL 2003 Workshop on Patent Corpus Processing Workshop (2003)
Lee, H., Kang, I., Lee, J.: Determination of Unknown Syntactic Relation in Korean using Concept patterns and Statistical Information. In: Conference on Korean Information Processing, pp. 261–266 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kim, CH., Hong, M. (2006). A Korean Syntactic Parser Customized for Korean-English Patent MT System. In: Salakoski, T., Ginter, F., Pyysalo, S., Pahikkala, T. (eds) Advances in Natural Language Processing. FinTAL 2006. Lecture Notes in Computer Science(), vol 4139. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11816508_7
Download citation
DOI: https://doi.org/10.1007/11816508_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37334-6
Online ISBN: 978-3-540-37336-0
eBook Packages: Computer ScienceComputer Science (R0)