Skip to main content

JMorpher: A Finite-State Morphological Parser in Java for Android

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8775))

Abstract

This paper presents JMorpher, a morphological parsing utility that is implemented in pure Java. It is apparently the first tool of this type that natively runs on Android mobile devices. JMorpher compiles a lexical transducer definition in the AT&T raw text format, of the type generated by Foma and other open source finite-state packages, into an internal Java representation which is drawn upon to parse input strings. Besides the API, JMorpher comprises of a simple graphical interface that allows the user to load a transducer file, type in some text and parse it. Results of an evaluation based on large Portuguese lexical transducers of different complexity degrees are provided. The implementation was shown to be very efficient on a desktop PC. Although, on an Android smartphone, JMorpher’s performance is much lower, it is still suited to the needs of NLP tasks in this environment.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Beesley, K.R., Karttunen, L.: Finite State Morphology. CSLI, Stanford (2003)

    Google Scholar 

  2. Bird, S., Klein, E., Loper, E.: Natural Language Processing with Python: Analyzing Text with the Natural Language Toolkit. O’Reilly, Sebastopol (2009)

    Google Scholar 

  3. Branco, A., Silva, J.: Evaluating Solutions for the Rapid Development of State-of-the-Art POS Taggers for Portuguese. In: Lino, M.T., Xavier, M.F., Ferreira, F., Costa, R., Silva, R. (eds.) Proceedings of the 4th International Conference on Language Resources and Evaluation (LREC 2004), pp. 507–510. ELRA, Paris (2004)

    Google Scholar 

  4. Dale, R.: Classical Approaches to Natural Language Processing. In: Indurkhya, N., Damerau, F.J. (eds.) Handbook of Natural Language Processing, 2nd edn., pp. 3–7. Chapman & Hall/CRC, Boca Raton (2009)

    Google Scholar 

  5. Fradin, B.: Abbréviation des gloses morphologiques. Laboratoire de Linguistique Formelle, Paris (2013), http://www.llf.cnrs.fr/gloses-fr.php

    Google Scholar 

  6. Hardwick, S.: HFST: Optimized Lookup Format (2009), https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstOptimizedLookupFormat

  7. Hippisley, A.: Lexical Analysis. In: Indurkhya, N., Damerau, F.J. (eds.) Handbook of Natural Language Processing, 2nd edn., pp. 31–58. Chapman & Hall/CRC, Boca Raton (2009)

    Google Scholar 

  8. Hulden, M.: Foma: A Finite-State Compiler and Library. In: EACL (Demos), pp. 29–32 (2009)

    Google Scholar 

  9. Jurafsky, D., Martin, J.H.: Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Pearson, London (2009)

    Google Scholar 

  10. Leipzig Glossing Rules, http://www.eva.mpg.de/lingua/resources/glossing-rules.php

  11. Michaelis Moderno Dicionário da Língua Portuguesa. Melhoramentos, São Paulo (2009), http://michaelis.uol.com.br/moderno/portugues/index.php

  12. Lindén, K., Silfverberg, M., Pirinen, T.: HFST Tools for Morphology: An Efficient Open-Source Package for Construction of Morphological Analyzers. In: Mahlow, C., Piotrowski, M. (eds.) SFCM 2009. CCIS, vol. 41, pp. 28–47. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  13. Muniz, M.C.M.: Projeto Unitex-PB. NILC, São Paulo (2004), http://www.nilc.icmc.usp.br/nilc/projects/unitex-pb/web/

  14. Silfverberg, M., Lindén, K.: HFST Runtime Format: A Compacted Transducer Format Allowing for Fast Lookup. In: FSMNLP (2009), http://www.ling.helsinki.fi/klinden/pubs/fsmnlp2009runtime.pdf

  15. Xerox: Linguistic Tools: Morphological Analysis. Morphology, http://open.xerox.com/Services/fst-nlp-tools/Pages/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

de Alencar, L.F., França, M.J.C., Andrade, K.M., Costa, P.B., Vasconcelos, H.S., Madeira, F.P. (2014). JMorpher: A Finite-State Morphological Parser in Java for Android. In: Baptista, J., Mamede, N., Candeias, S., Paraboni, I., Pardo, T.A.S., Volpe Nunes, M.d.G. (eds) Computational Processing of the Portuguese Language. PROPOR 2014. Lecture Notes in Computer Science(), vol 8775. Springer, Cham. https://doi.org/10.1007/978-3-319-09761-9_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-09761-9_6

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-09760-2

  • Online ISBN: 978-3-319-09761-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics