Skip to main content

LIUM ASR System for ETAPE French Evaluation Campaign: Experiments on System Combination Using Open-Source Recognizers

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8082))

Abstract

In this paper, we report the LIUM participation in the ETAPE [1] (Évaluations en Traitement Automatique de la Parole) evaluation campaign, on the rich transcription task for French track. After describing the ETAPE goals and guidelines, we present our ASR system, which ranked first in the ETAPE evaluation campaign. Two ASR systems were used for our participation in ETAPE 2011. In addition to the LIUM ASR system based on CMU Sphinx project, we utilized an additional open-source ASR system based on the RASR toolkit. We evaluate, in this paper, the gain obtained with various acoustics modeling and adaptation techniques for each of the two systems, as well as with various system combination techniques. The combination of two different ASR systems allows a significant WER reduction, from 23.6% for the best single ASR system to 22.6% for the combination.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Gravier, G., Adda, G., Paulsson, N., Carré, M., Giraudel, A., Galibert, O.: The ETAPE corpus for the evaluation of speech-based TV content processing in the French language. In: International Conference on Language Resources, Evaluation and Corpora, LREC (May 2012)

    Google Scholar 

  2. Deléglise, P., Estève, Y., Meignier, S., Merlin, T.: Improvements to the LIUM French ASR system based on CMU Sphinx: what helps to significantly reduce the word error rate? In: Interspeech (2009)

    Google Scholar 

  3. Galliano, S., Geoffrois, E., Mostefa, D., Choukri, K., Bonastre, J.F., Gravier, G.: The ESTER phase II evaluation campaign for the rich transcription of French broadcast news. In: Eurospeech, European Conference on Speech Communication and Technology, Lisbon, Portugal (September 2005)

    Google Scholar 

  4. Meignier, S., Merlin, T.: LIUM SpkDiarization: an open source toolkit for diarization. In: ASRU (2010)

    Google Scholar 

  5. Zhu, Q., Chen, B., Morgan, N., Stolcke, A.: On using MLP features in LVCSR. In: Proc. ICSLP, Jeju, Korea, pp. 921–924 (2004)

    Google Scholar 

  6. Béchet, F.: LIA_PHON: un système complet de phonétisation de textes. Traitement Automatique des Langues – TAL 42, 47–67 (2001)

    Google Scholar 

  7. Stolcke, A.: SRILM – An extensible language modeling toolkit (2002)

    Google Scholar 

  8. Rybach, D., Gollan, C., Heigold, G., Hoffmeister, B., Lööf, J., Schlüter, R., Hermann, N.: The RWTH Aachen university open source speech recognition system. In: Interspeech, pp. 2111–2114 (2009)

    Google Scholar 

  9. Stüker, S., Fügen, C., Burger, S., Wölfel, M.: Cross-system adaptation and combination for continuous speech recognition: The influence of phoneme set and acoustic front-end?(2006)

    Google Scholar 

  10. Giuliani, D., Brugnara, F.: Experiments on cross-system acoustic model adaptation. In: ASRU, pp. 117–122 (2007)

    Google Scholar 

  11. Li, X., Singh, R., Stern, R.M.: Lattice combination for improved speech recogniton. In: Interspeech (2002)

    Google Scholar 

  12. Schwenk, H.: CSLM - A modular Open-Source Continuous Space Language Modeling Toolkit. In: Interspeech (2013)

    Google Scholar 

  13. Fiscus, J.G.: A post-processing system to yield reduced word error rates: Recognizer Output Voting Error Reduction (ROVER). In: IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 347–352 (1997)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bougares, F., Deléglise, P., Estève, Y., Rouvier, M. (2013). LIUM ASR System for ETAPE French Evaluation Campaign: Experiments on System Combination Using Open-Source Recognizers. In: Habernal, I., Matoušek, V. (eds) Text, Speech, and Dialogue. TSD 2013. Lecture Notes in Computer Science(), vol 8082. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40585-3_41

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-40585-3_41

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-40584-6

  • Online ISBN: 978-3-642-40585-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics