The FBK ASR System for Evalita 2011

Ronny, Ronny; Shakoor, Aamir; Brugnara, Fabio; Gretter, Roberto

doi:10.1007/978-3-642-35828-9_32

The FBK ASR System for Evalita 2011

Ronny Ronny²³,
Aamir Shakoor²³,
Fabio Brugnara²³ &
…
Roberto Gretter²³

Conference paper

654 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7689))

Abstract

This report describes the system used in FBK for participating in the large vocabulary Automatic Speech Recognition tasks of the Evalita 2011 evaluation campaign. The paper provides some details on the techniques included in the transcription system. The official FBK submissions were only related to the closed modality, were only data distributed within the campaign could be exploited. In this paper, results are given that were obtained with a system trained on larger corpora, thus allowing to appreciate the difference between the two modalities.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Giuliani, D., Brugnara, F.: Experiments on cross-system acoustic model adaptation. In: Proceedings of Workshop on Automatic Speech Recognition and Understanding, Kyoto, Japan, pp. 117–120 (2007)
Google Scholar
Cettolo, M.: Segmentation, Classification and Clustering of an Italian Broadcast News Corpus. In: Proceedings of RIAO, pp. 372–381 (2000)
Google Scholar
Young, S.J., Odell, J.J., Woodland, P.C.: Tree-based state tying for high accuracy acoustic modelling. In: Proceedings of ARPA Human Language Technology Workshop, pp. 286–291 (1994)
Google Scholar
Giuliani, D., Gerosa, M., Brugnara, F.: Improved automatic speech recognition through speaker normalization. Computer Speech and Language 20, 107–123 (2006)
Article Google Scholar
Gales, M.J.F.: Maximum likelihood linear transformations for HMM-based speech recognition. Computer Speech and Language 20, 75–98 (1998)
Article Google Scholar
Gales, M.J.F.: Adaptive training for robust ASR. In: Proceedings of Workshop on Automatic Speech Recognition and Understanding, pp. 15–20 (2001)
Google Scholar
Kumar, N., Andreou, A.G.: Heteroscedastic discriminant analysis and reduced rank HMMs for improved speech recognition. Speech Communication 26, 283–297 (1998)
Article Google Scholar
Stemmer, G., Brugnara, F.: Integration of Heteroscedastic Linear Discriminant Analysis (HLDA) into Adapative Training. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, Toulouse (2006)
Google Scholar
Stemmer, G., Brugnara, F., Giuliani, D.: Adaptive training using simple target models. In: Proceedings of International Conference on Acoustics, Speech and Signal Processing, vol. 1, pp. 997–1000 (2005)
Google Scholar
http://www.phon.ucl.ac.uk/home/sampa/italian.html
Ney, H., Essen, U.: On smoothing techniques for bigram-based natural language modeling. In: Proceedings of International Conference on Acoustics, Speech and Signal Processing, Toronto, Canada, pp. 825–828 (1991)
Google Scholar
Federico, M., Bertoldi, N., Cettolo, M.: IRSTLM: an Open Source Toolkit for Handling Large Scale Language Models. In: Proceedings of Interspeech, Brisbane, Australia (2008)
Google Scholar
Ney, H., Haeb-Umbach, R., Tran, B.-H., Oerder, M.: Improvements in beam search for 10000-word continuous speech recognition. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, S.Francisco, CA, vol. I, pp. 9–12 (1992)
Google Scholar
Brugnara, F., Cettolo, M.: Improvements in Tree based Language Model Representation. In: Proceedings of EUROSPEECH, Madrid, Spain, pp. 1797–1800 (1995)
Google Scholar
Brugnara, F.: Context-dependent Search in a Context-independent Network. In: Proceedings of International Conference on Acoustics, Speech and Signal Processing, Hong Kong, vol. 1, pp. 360–363 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

FBK-irst, via Sommarive 18, Povo, TN, 38123, Italy
Ronny Ronny, Aamir Shakoor, Fabio Brugnara & Roberto Gretter

Authors

Ronny Ronny
View author publications
You can also search for this author in PubMed Google Scholar
Aamir Shakoor
View author publications
You can also search for this author in PubMed Google Scholar
Fabio Brugnara
View author publications
You can also search for this author in PubMed Google Scholar
Roberto Gretter
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Fondazione Bruno Kessler, Via Sommarive 18, 38123, Povo, TN, Italy
Bernardo Magnini
University of Naples, Via Cinthia, 80126, Napoli, NA, Italy
Francesco Cutugno
Fondazione Ugo Bordoni, Viale del Policlinico, 161, Roma, Italy
Mauro Falcone
CELCT, Via alla Cascata, 38123, Povo, TN, Italy
Emanuele Pianta

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ronny, R., Shakoor, A., Brugnara, F., Gretter, R. (2013). The FBK ASR System for Evalita 2011. In: Magnini, B., Cutugno, F., Falcone, M., Pianta, E. (eds) Evaluation of Natural Language and Speech Tools for Italian. EVALITA 2012. Lecture Notes in Computer Science(), vol 7689. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35828-9_32

Download citation

DOI: https://doi.org/10.1007/978-3-642-35828-9_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35827-2
Online ISBN: 978-3-642-35828-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics