Advertisement

An Application of NLP Rules to Spoken Document Segmentation Task

  • Rafael M. Terol
  • Patricio Martínez-Barco
  • Fernando Llopis
  • Trinitario Martínez
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3513)

Abstract

One of the main differences between Spoken Document Retrieval (SDR) systems and Text Retrieval systems is the need of a segmentation process that detects the story boundaries. However, until now, SDR researchers have not paid attention in building correct segments more than considering slidding windows of a fixed size in time. In this paper, new methodology for evaluating segments to SDR task, and the evaluation of three possible strategies are presented over the TREC-9 SDR collection. Moreover, the justification of each strategy is discussed.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    The Ninth Text Retrieval Conference (TREC 9), Gaithersburg, Maryland (2000)Google Scholar
  2. 2.
    Llopis, F., Martínez-Barco, P.: Spoken Document Retrieval experiments with IR-n system. In: Proceedings of the CLEF 2003: Workshop on Cross-Language Information Retrieval and Evaluation, Trondheim, pp. 427–433 (2003)Google Scholar
  3. 3.
    Federico, M., Bertoldi, N., Levow, G.A., Jones, G.J.: CLEF 2004 Cross-Language Spoken Document Retrieval Track, 631–634Google Scholar
  4. 4.
    Johnson, S., Jourlin, P., Jones, K.S., Woodland, P.: Spoken Document Retrieval for TREC-9 at Cambridge University. In: [1], pp. 117–126 Google Scholar
  5. 5.
    Renals, S., Abberley, D.: The Thisl SDR System at TREC-9. In: [1], pp. 627–634 Google Scholar
  6. 6.
    Gauvain, J.L., Lamel, L., Barras, C., Adda, G., de Kercardio, Y.: The LIMSI SDR System for TREC-9. In: [1], pp. 335–360Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Rafael M. Terol
    • 1
  • Patricio Martínez-Barco
    • 1
  • Fernando Llopis
    • 1
  • Trinitario Martínez
    • 1
  1. 1.Departamento de Lenguajes y Sistemas InformáticosUniversidad de AlicanteAlicanteSpain

Personalised recommendations