Abstract
Our project involves the construction of a web-based system to facilitate the reading and comprehension of Indonesian text. The system will help users to understand difficult words in a text by displaying dictionary information about the words in a window. A large number of words in the Indonesian language are formed by combining root words with affixes and other combining forms. To search for the related dictionary entry, we need a stemming program to extract these root words. We develop an Indonesian stemming program for ourselves. Our stemmer does not need to be perfect because our application is limited to that of a text reading system. In this paper, we describe such a stemmer and present the results of preliminary examinations to evaluate it. We also describe a design for the text reading support system that uses the developed stemming program.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Yusuf, H.R.: An analysis of indonesian language for interlingual machine-ranslation system. In: Proceedings of the 15th International Conference on Computational Linguistics, pp. 1228–1232 (1992)
Nazief, B.: Panel: Development of computational linguistics research: A challenge for indonesia. In: Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics, pp. 1–2. Association for Computational Linguistics, Hong Kong (2000), http://www.aclweb.org/anthology/P00-1075
Adriani, M., Asian, J., Nazief, B., Tahaghoghi, S.M.M., Williams, H.E.: Stemming indonesian: A confix-stripping approach. ACM Transactions on Asian Language Information Processing 6(4), 1–33 (2007)
TruAlfa and IndoDic.com. Forming Indonesian Words & using Indonesian Affixes, http://indodic.com/index.html
CICC, Indonesian basic dictionary, Center of the International Cooperation for Computerization Technical Report. Tech. Rep. 6-CICC-MT 53 (1995)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mochizuki, H., Nakamura, Y., Shibano, K. (2012). Indonesian Shallow Stemmer for Text Reading Support System. In: Gaol, F. (eds) Recent Progress in Data Engineering and Internet Technology. Lecture Notes in Electrical Engineering, vol 157. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28798-5_33
Download citation
DOI: https://doi.org/10.1007/978-3-642-28798-5_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28797-8
Online ISBN: 978-3-642-28798-5
eBook Packages: EngineeringEngineering (R0)