Abstract
Question classification plays an important role in question answering systems. This paper presents the Conditional Random field (CRF) model based on Morpheme features for Tamil question classification. It is a process that analyzes a question and labels it based on its question type and expected answer type (EAT). The selected features are the morpheme parts of the question terms and its dependent terms. The main contribution in this work is in the way of selection of features for constructing CRF Model. They discriminates the position of expected answer type information with respect to question term’s position. The CRF model to find out the phrase which contains the information about EAT is trained with tagged question corpus. The EAT is semantically derived by analyzing the phrase obtained from CRF engine using WordNet. The performance of this morpheme based CRF model is compared with the generic CRF engine.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Anandan, P., Saravanan, K., Parthasarathi, R., Geetha, T.V.: Morphological Analyzer for Tamil. In: International Conference On Natural language processing (2002)
Lee, C., Wang, J.-H., Kim, H.-J., Jang, M.-G.: Extracting Template for Knowledge-based Question- Answering Using Conditional Random Fields. In: ACM SIGIR Workshop on Mathematical/Formal Methods in Information Retrieval MF/IR 2005 Salvador, Brazil, August 15-19 (2005)
Zhang, D., Lee, W.S.: Question Classification using Support Vector Machines. In: 26th annual international ACM SIGIR conference on Research and development in information retrieval, Toronto, Canada (2003)
Devi Poongulhali, P., Kavitha Noel, N., Preeda Lakshmi, R., Manavazhahan, A., Geetha, T.V.: Tamil WordNet 1st International Global Wordnet Conference, Jan. 21-25, 2002. CIIL, Mysore (2002)
Graesser, A.C., Person, N.K.: Question asking during tutoring. American Educational Research Journal 31, 104–137 (1994)
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: Proc. of the International Conference on Machine Learning (ICML) (2001)
Xin, L., Xuan-Jing, H., Li-de, W.: Question Classification by Ensemble Learning. IJCSNS International Journal of Computer Science and Network Security 6(3) (2006)
Pinto, D., Branstein, M., Coleman, R., King, M., Li, W., Wei, X., Croft, W.B.: QuASM: A System for Question Answering Using Semi-Structured Data. In: Proceedings of the JCDL 2002 Joint Conference on Digital Libraries (2002)
Rabiner, L.R.: A tutorial on hidden Markov models and selected applications in speech recognition. In: Proceedings of the IEEE. IEEE, Los Alamitos (1989); IEEE Log Number 8825949
Krishnan, V., Das, S., Chakrabarti, S.: Enhanced Answer Type Inference from Questions using Sequential Models. In: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, Vancouver, British Columbia, Canada, October 6-8,2005, pp. 315–322 (2005)
Li, W.: Question Classification Using Language Modeling, http://citeseer.ist.psu.edu/576076.html
Li Dan Roth, X.: Learning Question Classifiers International Conference on Computational Linguistics (COLING 2002), Taipei, Taiwan, 24 August (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lakshmana Pandian, S., Geetha, T.V. (2008). Tamil Question Classification Using Morpheme Features. In: Nordström, B., Ranta, A. (eds) Advances in Natural Language Processing. GoTAL 2008. Lecture Notes in Computer Science(), vol 5221. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85287-2_26
Download citation
DOI: https://doi.org/10.1007/978-3-540-85287-2_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85286-5
Online ISBN: 978-3-540-85287-2
eBook Packages: Computer ScienceComputer Science (R0)