Skip to main content

Klex: A Finite-State Transducer Lexicon of Korean

  • Conference paper
  • 693 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4002))

Abstract

This paper describes the implementation and system details of Klex, a finite-state transducer lexicon for the Korean language, developed using XRCE’s Xerox Finite State Tool (XFST). Klex is essentially a transducer network representing the lexicon of the Korean language with the lexical string on the upper side and the inflected surface string on the lower side. Two major applications for Klex are morphological analysis and generation: given a well-formed inflected lower string, a language-independent algorithm derives the upper lexical string from the network and vice versa. Klex was written to conform to the part-of-speech tagging standards of the Korean Treebank Project, and is currently operating as the morphological analysis engine for the project.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Back, D.H., Lee, H., Rim, H.C.: A structure of korean electronic dictionary using the finite state transducer. In: Proceedings of the 7th Symposium for Information Processing of Hangul and Korean (1995) (in Korean)

    Google Scholar 

  2. Beesley, K.R., Karttunen, L.: Finite-State Morphology: Xerox Tools and Techniques. CSLI Publications, Stanford, California (2003)

    Google Scholar 

  3. Han, C.H., Han, N.R.: Part of speech tagging guidelines for penn korean treebank. Technical report, IRCS, University of Pennsylvania (2001)

    Google Scholar 

  4. Han, N.R.: Klex: Finite-state lexical transducer for korean. Linguistic Data Consortium (LDC) (2004), catalog number LDC2004L01 and ISBN 1-58563-283-x

    Google Scholar 

  5. Han, N.R.: Morphologically annotated korean text. Linguistic Data Consortium (LDC) (2004), catalog number LDC2004T03 and ISBN 1-58563-284-8

    Google Scholar 

  6. Kim, S.: Korean Morphology. Tap Publishing, Seoul, Korea (1992) (in Korean)

    Google Scholar 

  7. Ko, Y.: A Study of Korean Morphology. Seoul National University Press, Seoul, Korea (1989) (in Korean)

    Google Scholar 

  8. Koskenniemi, K.: Two-level morphology: A general computational model for word form recognition and production. Publication No: 11, Department of General Linguistics, University of Helsinki (1983)

    Google Scholar 

  9. Minjungseorim (ed.): Minjung Eutteum Korean Dictionary for Elementary School Students. Minjungseorim, Seoul, Korea (1998) (in Korean)

    Google Scholar 

  10. Palmer, M., Han, C.H., Han, N.R., Ko, E.S., Yi, H.J., Lee, A., Walker, C., Duda, J., Xue, N.: Korean english treebank annotations. Linguistic Data Consortium (LDC) catalog number LDC2002T26 and ISBN 1-58563-236-8 (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Han, NR. (2006). Klex: A Finite-State Transducer Lexicon of Korean. In: Yli-Jyrä, A., Karttunen, L., Karhumäki, J. (eds) Finite-State Methods and Natural Language Processing. FSMNLP 2005. Lecture Notes in Computer Science(), vol 4002. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11780885_8

Download citation

  • DOI: https://doi.org/10.1007/11780885_8

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-35467-3

  • Online ISBN: 978-3-540-35469-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics