The Rich Transcription 2005 Spring Meeting Recognition Evaluation

Fiscus, Jonathan G.; Radde, Nicolas; Garofolo, John S.; Le, Audrey; Ajot, Jerome; Laprun, Christophe

doi:10.1007/11677482_32

Jonathan G. Fiscus¹⁸,
Nicolas Radde¹⁸,
John S. Garofolo¹⁸,
Audrey Le¹⁸,
Jerome Ajot¹⁸ &
…
Christophe Laprun^18,19

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3869))

Included in the following conference series:

International Workshop on Machine Learning for Multimodal Interaction

2020 Accesses
13 Citations

Abstract

This paper presents the design and results of the Rich Transcription Spring 2005 (RT-05S) Meeting Recognition Evaluation. This evaluation is the third in a series of community-wide evaluations of language technologies in the meeting domain. For 2005, four evaluation tasks were supported. These included a speech-to-text (STT) transcription task and three diarization tasks: “Who Spoke When”, “Speech Activity Detection”, and “Source Localization.” The latter two were first-time experimental proof-of-concept tasks and were treated as “dry runs”. For the STT task, the lowest word error rate for the multiple distant microphone condition was 30.0% which represented an impressive 33% relative reduction from the best result obtained in the last such evaluation – the Rich Transcription Spring 2004 Meeting Recognition Evaluation. For the diarization “Who Spoke When” task, the lowest diarization error rate was 18.56% which represented a 19% relative reduction from that of RT-04S.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Fiscus, et al.: Results of the Fall 2004 STT and MDE Evaluation. In: RT-04F Evaluation Workshop Proceedings, November 7-10 (2004)
Google Scholar
Garofolo, et al.: The Rich Transcription 2004 Spring Meeting Recognition Evaluation. In: ICASSP 2004 Meeting Recognition Workshop, May 17 (2004)
Google Scholar
Spring 2005 (RT-05S) Rich Transcription Meeting Recognition Evaluation Plan (2005), http://www.nist.gov/speech/tests/rt/rt2005/spring/rt05s-meeting-eval-plan-V1.pdf
Speaker Localization and Tracking – Evaluation Criteria, http://www.nist.gov/speech/tests/rt/t2005/spring/sloc/CHIL-IRST_SpeakerLocEval-V5.0-2005-01-18.pdf
LDC Meeting Recording Transcription, http://www.ldc.upenn.edu/Projects/Transcription/NISTMeet
SCTK toolkit, http://www.nist.gov/speech/tools/index.htm
Janin, A., Ang, J., Bhagat, S., Dhillon, R., Edwards, J., Macias-Guarasa, J., Morgan, N., Peskin, B., Shriberg, E., Stolcke, A., Wooters, C., Wrede, B.: The ICSI Meeting Project: Resources and Research. In: NIST ICASSP 2004 Meeting Recognition Workshop, Montreal (2004)
Google Scholar
Garofolo, J.S., Laprun, C.D., Michel, M., Stanford, V.M., Tabassi, E.: The NIST Meeting Room Pilot Corpus. In: LREC 2004 (2004)
Google Scholar
The ISL Meeting Corpus: The Impact of Meeting Type on Speech Style, Susanne Burger, Victoria MacLaren, Hua Yu, ICSLP 2002 (2002)
Google Scholar
Huang, Z., Harper, M.P.: Speech Activity Detection on Multichannels of Meeting Recordings. In: Proceedings from the RT 2005 Workshop at MLML 2005 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

National Institute of Standards and Technology, 100 Bureau Drive Stop 8940, Gaithersburg, MD, 20899, USA
Jonathan G. Fiscus, Nicolas Radde, John S. Garofolo, Audrey Le, Jerome Ajot & Christophe Laprun
Systems Plus, Inc., 1370 Piccard Drive, Suite 270, Rockville, MD, 20850, USA
Christophe Laprun

Authors

Jonathan G. Fiscus
View author publications
You can also search for this author in PubMed Google Scholar
Nicolas Radde
View author publications
You can also search for this author in PubMed Google Scholar
John S. Garofolo
View author publications
You can also search for this author in PubMed Google Scholar
Audrey Le
View author publications
You can also search for this author in PubMed Google Scholar
Jerome Ajot
View author publications
You can also search for this author in PubMed Google Scholar
Christophe Laprun
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Edinburgh, Edinburgh, Scotland
Steve Renals
IDIAP Research Institute, Martigny, Switzerland
Samy Bengio

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fiscus, J.G., Radde, N., Garofolo, J.S., Le, A., Ajot, J., Laprun, C. (2006). The Rich Transcription 2005 Spring Meeting Recognition Evaluation. In: Renals, S., Bengio, S. (eds) Machine Learning for Multimodal Interaction. MLMI 2005. Lecture Notes in Computer Science, vol 3869. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11677482_32

Download citation

DOI: https://doi.org/10.1007/11677482_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-32549-9
Online ISBN: 978-3-540-32550-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics