Abstract
This paper presents JMorpher, a morphological parsing utility that is implemented in pure Java. It is apparently the first tool of this type that natively runs on Android mobile devices. JMorpher compiles a lexical transducer definition in the AT&T raw text format, of the type generated by Foma and other open source finite-state packages, into an internal Java representation which is drawn upon to parse input strings. Besides the API, JMorpher comprises of a simple graphical interface that allows the user to load a transducer file, type in some text and parse it. Results of an evaluation based on large Portuguese lexical transducers of different complexity degrees are provided. The implementation was shown to be very efficient on a desktop PC. Although, on an Android smartphone, JMorpher’s performance is much lower, it is still suited to the needs of NLP tasks in this environment.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Beesley, K.R., Karttunen, L.: Finite State Morphology. CSLI, Stanford (2003)
Bird, S., Klein, E., Loper, E.: Natural Language Processing with Python: Analyzing Text with the Natural Language Toolkit. O’Reilly, Sebastopol (2009)
Branco, A., Silva, J.: Evaluating Solutions for the Rapid Development of State-of-the-Art POS Taggers for Portuguese. In: Lino, M.T., Xavier, M.F., Ferreira, F., Costa, R., Silva, R. (eds.) Proceedings of the 4th International Conference on Language Resources and Evaluation (LREC 2004), pp. 507–510. ELRA, Paris (2004)
Dale, R.: Classical Approaches to Natural Language Processing. In: Indurkhya, N., Damerau, F.J. (eds.) Handbook of Natural Language Processing, 2nd edn., pp. 3–7. Chapman & Hall/CRC, Boca Raton (2009)
Fradin, B.: Abbréviation des gloses morphologiques. Laboratoire de Linguistique Formelle, Paris (2013), http://www.llf.cnrs.fr/gloses-fr.php
Hardwick, S.: HFST: Optimized Lookup Format (2009), https://kitwiki.csc.fi/twiki/bin/view/KitWiki/HfstOptimizedLookupFormat
Hippisley, A.: Lexical Analysis. In: Indurkhya, N., Damerau, F.J. (eds.) Handbook of Natural Language Processing, 2nd edn., pp. 31–58. Chapman & Hall/CRC, Boca Raton (2009)
Hulden, M.: Foma: A Finite-State Compiler and Library. In: EACL (Demos), pp. 29–32 (2009)
Jurafsky, D., Martin, J.H.: Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Pearson, London (2009)
Leipzig Glossing Rules, http://www.eva.mpg.de/lingua/resources/glossing-rules.php
Michaelis Moderno Dicionário da Língua Portuguesa. Melhoramentos, São Paulo (2009), http://michaelis.uol.com.br/moderno/portugues/index.php
Lindén, K., Silfverberg, M., Pirinen, T.: HFST Tools for Morphology: An Efficient Open-Source Package for Construction of Morphological Analyzers. In: Mahlow, C., Piotrowski, M. (eds.) SFCM 2009. CCIS, vol. 41, pp. 28–47. Springer, Heidelberg (2009)
Muniz, M.C.M.: Projeto Unitex-PB. NILC, São Paulo (2004), http://www.nilc.icmc.usp.br/nilc/projects/unitex-pb/web/
Silfverberg, M., Lindén, K.: HFST Runtime Format: A Compacted Transducer Format Allowing for Fast Lookup. In: FSMNLP (2009), http://www.ling.helsinki.fi/klinden/pubs/fsmnlp2009runtime.pdf
Xerox: Linguistic Tools: Morphological Analysis. Morphology, http://open.xerox.com/Services/fst-nlp-tools/Pages/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
de Alencar, L.F., França, M.J.C., Andrade, K.M., Costa, P.B., Vasconcelos, H.S., Madeira, F.P. (2014). JMorpher: A Finite-State Morphological Parser in Java for Android. In: Baptista, J., Mamede, N., Candeias, S., Paraboni, I., Pardo, T.A.S., Volpe Nunes, M.d.G. (eds) Computational Processing of the Portuguese Language. PROPOR 2014. Lecture Notes in Computer Science(), vol 8775. Springer, Cham. https://doi.org/10.1007/978-3-319-09761-9_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-09761-9_6
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09760-2
Online ISBN: 978-3-319-09761-9
eBook Packages: Computer ScienceComputer Science (R0)