Error Detection in Broadcast News ASR Using Markov Chains

Pellegrini, Thomas; Trancoso, Isabel

doi:10.1007/978-3-642-20095-3_6

Thomas Pellegrini²⁰ &
Isabel Trancoso²⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6562))

Included in the following conference series:

Language and Technology Conference

1084 Accesses
1 Citations

Abstract

This article addresses error detection in broadcast news automatic transcription, as a post-processing stage. Based on the observation that many errors appear in bursts, we investigated the use of Markov Chains (MC) for their temporal modelling capabilities. Experiments were conducted on a large Amercian English broadcast news corpus from NIST. Common features in error detection were used, all decoder-based. MC classification performance was compared with a discriminative maximum entropy model (Maxent), currently used in our in-house decoder to estimate confidence measures, and also with Gaussian Mixture Models (GMM). The MC classifier obtained the best results, by detecting 16.2% of the errors, with the lowest classification error rate of 16.7%. To be compared with the GMM classifier, MC allowed to lower the number of false detections, by 23.5% relative. The Maxent system achieved the same CER, but detected only 7.2% of the errors.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Gillick, L., Ito, Y., Young, J.: A probabilistic approach to confidence estimation and evaluation. In: Proceedings of ICASSP, Munich, pp. 879–882 (1997)
Google Scholar
Allauzen, A.: Error detection in confusion. In: Proceedings of INTERSPEECH, Antwerp, pp. 1749–1752 (2007)
Google Scholar
Weintraub, M., Beaufays, F., Rivlin, Z., Konig, Y., Stolcke, A.: Neural - Network Based Measures of Confidence for Word Recognition. In: Proceedings of ICASSP, Los Alamitos, pp. 887–890 (1997)
Google Scholar
Xue, J., Zhao, Y.: Random forests-based confidence annotation using novel features from confusion network. In: Proceedings of ICASSP, Toulouse, pp. 1149–1152 (2006)
Google Scholar
Hillard, D., Ostendorf, M.: Compensating for Word Posterior Estimation Bias in Confusion Networks. In: Proceedings of ICASSP, Toulouse, pp. 1153–1156 (2006)
Google Scholar
Schwartz, R., Nguyen, L., Kubala, F., Chou, G., Zavaliagkos, G., Makhoul, J.: On Using Written Language Training Data for Spoken Language Modeling. In: Proceedings of ACL, New Jersey, pp. 94–97 (1994)
Google Scholar
Ratnaparkhi, A.: A Maximum Entropy Model for Part-Of-Speech Tagging. In: Proceedings of EMLNP, Philadelphia, pp. 133–142 (1996)
Google Scholar
Jaynes, E.T.: Information theory and statistical mechanics. Physical review 106(4), 620–630 (1957)
Article MathSciNet MATH Google Scholar
Rabiner, L.R., Juang, B.H.: An Introduction to Hidden Markov Models. IEEE Acoustics Speech and Signal Processing Magazine ASSP-3(1), 4–16 (1986)
Google Scholar
Bishop, C.: Pattern recognition and machine learning. Springer, Heidelberg (2006)
MATH Google Scholar
Meinedo, H., Caseiro, D., Neto, J., Trancoso, I.: AUDIMUS.MEDIA: A broadcast news speech recognition system for the european portuguese language. In: Mamede, N.J., Baptista, J., Trancoso, I., Nunes, M.d.G.V. (eds.) PROPOR 2003. LNCS, vol. 2721, pp. 9–17. Springer, Heidelberg (2003)
Chapter Google Scholar
Abad, A., Neto, J.: Incorporating Acoustical Modelling of Phone Transitions in a Hybrid ANN/HMM Speech Recognizer. In: Proceedings of INTERSPEECH, Brisbane, pp. 2394–2397 (2008)
Google Scholar
Marujo, L., Lopes, J., Mamede, N., Trancoso, I., Pino, J., Eskenazi, M., Baptista, J., Viana, C.: Porting REAP to European Portuguese. In: SLATE 2009 - Speech and Language Technology in Education, Brighton (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

INESC-ID, R. Alves Redol, 9, 1000-029, LISBON, Portugal
Thomas Pellegrini & Isabel Trancoso

Authors

Thomas Pellegrini
View author publications
You can also search for this author in PubMed Google Scholar
Isabel Trancoso
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Mathematics and Computer Science, Adam Mickiewicz University in Poznan, ul. Umultowska 87, 61614, Poznan, Poland
Zygmunt Vetulani

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pellegrini, T., Trancoso, I. (2011). Error Detection in Broadcast News ASR Using Markov Chains. In: Vetulani, Z. (eds) Human Language Technology. Challenges for Computer Science and Linguistics. LTC 2009. Lecture Notes in Computer Science(), vol 6562. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20095-3_6

Download citation

DOI: https://doi.org/10.1007/978-3-642-20095-3_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20094-6
Online ISBN: 978-3-642-20095-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics