Skip to main content

Using Machine Learning and Text Mining in Question Answering

  • Conference paper
Evaluation of Multilingual and Multi-modal Information Retrieval (CLEF 2006)

Abstract

This paper describes a QA system centered in a full data-driven architecture. It applies machine learning and text mining techniques to identify the most probable answers to factoid and definition questions respectively. Its major quality is that it mainly relies on the use of lexical information and avoids applying any complex language processing resources such as named entity classifiers, parsers and ontologies. Experimental results on the Spanish Question Answering task at CLEF 2006 show that the proposed architecture can be a practical solution for monolingual question answering by reaching a precision as high as 51%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Magnini, B., Vallin, A., Ayache, C., Erbach, G., Peñas, A., de Rijke, M., Rocha, P., Simov, K., Sutcliffe, R.: Overview of the CLEF 2004 Multilingual Question Answerig Track. In: Peters, C., Clough, P.D., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds.) CLEF 2004. LNCS, vol. 3491, Springer, Heidelberg (2005)

    Google Scholar 

  2. Ferrández, S., López-Moreno, P., Roger, S., Ferrández, A., Peral, J., Alvarado, X., Noguera, E., Llopis, F.: AliQAn and BRILI QA Systems at CLEF 2006. In: Working notes for the Cross Language Evaluation Forum Workshop, September 2006, (CLEF 2006) (2006)

    Google Scholar 

  3. Buscaldi, D., Gomez, J.M., Rosso, P., Sanchis, E.: The UPV at QA@CLEF 2006. In: Working notes for the Cross Language Evaluation Forum Workshop, September 2006, (CLEF 2006) (2006)

    Google Scholar 

  4. de-Pablo-Sánchez, C., González-Ledesma, A., Moreno, A., Martínez-Fernández, J.L., Martínez, P.: MIRACLE at the Spanish CLEF@QA 2006 Track. In: Working notes for the Cross Language Evaluation Forum Workshop, September 2006, (CLEF 2006) (2006)

    Google Scholar 

  5. Ferrés, D., Kanaan, S., González, E., Ageno Al, R.H., Turmo, J.: The TALP-QA System for Spanish at CLEF-2005. In: Peters, C., Gey, F.C., Gonzalo, J., Müller, H., Jones, G.J.F., Kluck, M., Magnini, B., de Rijke, M., Giampiccolo, D. (eds.) CLEF 2005. LNCS, vol. 4022, Springer, Heidelberg (2006)

    Google Scholar 

  6. Montes-y-Gómez, M., Villaseñor-Pineda, L., Pérez-Coutiño, M., Gómez-Soriano, J.M., Sanchis-Arnal, E., Rosso, P.: INAOE-UPV Joint Participation at CLEF 2005: Experiments in Monolingual Question Answering. In: Peters, C., Gey, F.C., Gonzalo, J., Müller, H., Jones, G.J.F., Kluck, M., Magnini, B., de Rijke, M., Giampiccolo, D. (eds.) CLEF 2005. LNCS, vol. 4022, Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  7. Gómez-Soriano, J.M., Montes-y-Gómez, M., Sanchis-Arnal, E., Villaseñor-Pineda, L., Rosso, P.: Language Independent Passage Retrieval for Question Answering. In: Gelbukh, A., de Albornoz, Á., Terashima-Marín, H. (eds.) MICAI 2005. LNCS (LNAI), vol. 3789, Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  8. Denicia-Carral, C., Montes-y-Gómez, M., Villaseñor-Pineda, L., García-Hernández, R.: A Text Mining Approach for Definition Question Answering. In: Proceedings for the Fifth International Conference on Natural Language Processing (FinTal 2006), Turku, Finland (August 2006)

    Google Scholar 

  9. García-Hernández, R., Martínez-Trinidad, F., Carrasco-Ochoa, A.: A New Algorithm for Fast Discovery of Maximal Sequential Patterns in a Document Collection. In: Gelbukh, A. (ed.) CICLing 2006. LNCS, vol. 3878, Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  10. Cassan, A., Figueira, H., Martins, A., Mendes, A., Mendes, P., Pinto, P., Vidal, D.: Priberam’s question answering system in a cross-language environment. In: Working notes for the Cross Language Evaluation Forum Workshop, September 2006, (CLEF 2006) (2006)

    Google Scholar 

  11. Magnini, B., Giampiccolo, D., Forner, P., Ayache, C., Jijkoun, V., Osenova, P., Peñas, A., Rocha, P., Sacaleanu, B., Sutcliffe, R.: Overview of the CLEF 2006 Multilingual Question Answering Track. In: Working notes for the Cross Language Evaluation Forum Workshop, September 2006 (CLEF 2006) (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Carol Peters Paul Clough Fredric C. Gey Jussi Karlgren Bernardo Magnini Douglas W. Oard Maarten de Rijke Maximilian Stempfhuber

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Juárez-González, A., Téllez-Valero, A., Denicia-Carral, C., Montes-y-Gómez, M., Villaseñor-Pineda, L. (2007). Using Machine Learning and Text Mining in Question Answering. In: Peters, C., et al. Evaluation of Multilingual and Multi-modal Information Retrieval. CLEF 2006. Lecture Notes in Computer Science, vol 4730. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74999-8_49

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-74999-8_49

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-74998-1

  • Online ISBN: 978-3-540-74999-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics