Pronunciation Error Detection Using Dynamic Time Warping Algorithm

Bugdol, Marcin; Segiet, Zuzanna; Kręcichwost, Michał

doi:10.1007/978-3-319-06596-0_32

Marcin Bugdol⁵,
Zuzanna Segiet⁵ &
Michał Kręcichwost⁵

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 284))

1271 Accesses
3 Citations

Abstract

In the paper a pronunciation error detection method has been presented, wchich is based on word structural features. A lowcomplexity classifier has been proposed, that is not concentrated on a limited base of error patterns, but is flexible enough to find unspecified mispronunciations. Two classification variants using Dynamic Time Warping (DTW) algorithm has been tested on speech corpus containing recordings of 30 people.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Zhao, T., Hoshino, A., Suzuki, M., Minematsu, N., Hirose, K.: Automatic chinese pronunciation error detection using svm trained with structural features. In: SLT. IEEE (2012)
Google Scholar
Liang, M.S., Hung, J.Y., Lyu, R.Y., Chin Chiang, Y.: Pronunciation error detection for computer assisted pronunciation teaching in mandarin. In: 6th International Symposium on Chinese Spoken Language Processing, ISCSLP 2008, pp. 1–4 (2008)
Google Scholar
Russell, M., Series, R.W., Wallace, J.L., Brown, C., Skilling, A.: The star system: an interactive pronunciation tutor for young children, pp. 161–175 (2000)
Google Scholar
Demenko, G., Wagner, A., Cylwik, N.: The use of speech technology in foreign language pronunciation training. Archives of Acoustics 35(5), 309–330 (2010)
Google Scholar
Demenko, G., Cylwik, N., Agnieszka, W.: Applying speech and language technology to foreign language education (2009)
Google Scholar
Demenko, G., Wagner, A., Cylwik, N., Jokisch, O.: An audiovisual feedback system for acquiring l2 pronunciation and l2 prosody. In: SLaTE 2009 (2009)
Google Scholar
Cylwik, N., Wagner, A., Demenko, G.: The EURONOUNCE corpus of non-native polish for ASR-based pronunciation tutoring system. In: SLaTE 2009 (2009)
Google Scholar
Rosenberg, A., Colla, A.: A connected speech recognition system based on spotting diphone-like segments–preliminary results. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1987, vol. 12, pp. 85–88 (1987)
Google Scholar
Saraclar, M., Nock, H.J., Khudanpur, S.: Pronunciation modeling by sharing gaussian densities across phonetic models, pp. 137–160 (2000)
Google Scholar
Saraclar, M., Khudanpur, S.: Pronunciation change in conversational speech and its implications for automatic speech recognition, pp. 375–395 (2004)
Google Scholar
Jokisch, O., Wagner, A., Sabo, R., Jaeckel, R., Cylwik, N., Rusko, M., Ronzhin, A., Hoffman, R.: Multilingual speech data collection for the assessment of pronunciation and prosody training in a language learning system. In: Proc. of Speech and Computer (SPECOM) (2009)
Google Scholar
Xu, S., Jiang, J., Chen, Z., Xu, B.: Automatic pronunciation error detection based on linguistic knowledge and pronunciation space. In: ICASSP, pp. 4841–4844. IEEE (2009)
Google Scholar
Ng, R.W.M., Hirose, K.: Syllable: A self-contained unit to model pronunciation variation. In: ICASSP, pp. 4457–4460. IEEE (2012)
Google Scholar
Liang, M.S., Hong, Z.Y., Lyu, R.Y., Chiang, Y.C.: Data-driven approach to pronunciation error detection for computer assisted language teaching. In: Spector, J.M., Sampson, D.G., Okamoto, T., Kinshuk, C.S.A., Ueno, M., Kashihara, A. (eds.) ICALT, pp. 359–361. IEEE Computer Society (2007)
Google Scholar
Strik, H., Truong, K.P., de Wet, F., Cucchiarini, C.: Comparing classifiers for pronunciation error detection. In: INTERSPEECH, pp. 1837–1840. ISCA (2007)
Google Scholar
Witt, S.M.: Use of speech recognition in computer-assisted language learning (1999)
Google Scholar
Witt, S.M., Young, S.J.: Phone-level pronunciation scoring and assessment for interactive language learning. Speech Communication 30(2-3), 95–108 (2000)
Article Google Scholar
Kanters, S., Cucchiarini, C., Strik, H.: The goodness of pronunciation algorithm: a detailed performance study. In: SLaTE 2009 (2009)
Google Scholar
Wang, Y.B., Shan Lee, L.: Improved approaches of modeling and detecting error patterns with empirical analysis for computer-aided pronunciation training. In: 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 5049–5052 (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Biomedical Engineering, Silesian University of Technology, Zabrze, Poland
Marcin Bugdol, Zuzanna Segiet & Michał Kręcichwost

Authors

Marcin Bugdol
View author publications
You can also search for this author in PubMed Google Scholar
Zuzanna Segiet
View author publications
You can also search for this author in PubMed Google Scholar
Michał Kręcichwost
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marcin Bugdol .

Editor information

Editors and Affiliations

Faculty of Biomedical Engineering, Silesian University of Technology, Gliwice, Poland
Ewa Piętka
Faculty of Biomedical Engineering, Silesian University of Technology, Gliwice, Poland
Jacek Kawa
Faculty of Biomedical Engineering, Silesian University of Technology, Gliwice, Poland
Wojciech Wieclawek

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bugdol, M., Segiet, Z., Kręcichwost, M. (2014). Pronunciation Error Detection Using Dynamic Time Warping Algorithm. In: Piętka, E., Kawa, J., Wieclawek, W. (eds) Information Technologies in Biomedicine, Volume 4. Advances in Intelligent Systems and Computing, vol 284. Springer, Cham. https://doi.org/10.1007/978-3-319-06596-0_32

Download citation

DOI: https://doi.org/10.1007/978-3-319-06596-0_32
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-06595-3
Online ISBN: 978-3-319-06596-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics