Skip to main content

Pronunciation Error Detection Using Dynamic Time Warping Algorithm

  • Conference paper
Information Technologies in Biomedicine, Volume 4

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 284))

Abstract

In the paper a pronunciation error detection method has been presented, wchich is based on word structural features. A lowcomplexity classifier has been proposed, that is not concentrated on a limited base of error patterns, but is flexible enough to find unspecified mispronunciations. Two classification variants using Dynamic Time Warping (DTW) algorithm has been tested on speech corpus containing recordings of 30 people.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Zhao, T., Hoshino, A., Suzuki, M., Minematsu, N., Hirose, K.: Automatic chinese pronunciation error detection using svm trained with structural features. In: SLT. IEEE (2012)

    Google Scholar 

  2. Liang, M.S., Hung, J.Y., Lyu, R.Y., Chin Chiang, Y.: Pronunciation error detection for computer assisted pronunciation teaching in mandarin. In: 6th International Symposium on Chinese Spoken Language Processing, ISCSLP 2008, pp. 1–4 (2008)

    Google Scholar 

  3. Russell, M., Series, R.W., Wallace, J.L., Brown, C., Skilling, A.: The star system: an interactive pronunciation tutor for young children, pp. 161–175 (2000)

    Google Scholar 

  4. Demenko, G., Wagner, A., Cylwik, N.: The use of speech technology in foreign language pronunciation training. Archives of Acoustics 35(5), 309–330 (2010)

    Google Scholar 

  5. Demenko, G., Cylwik, N., Agnieszka, W.: Applying speech and language technology to foreign language education (2009)

    Google Scholar 

  6. Demenko, G., Wagner, A., Cylwik, N., Jokisch, O.: An audiovisual feedback system for acquiring l2 pronunciation and l2 prosody. In: SLaTE 2009 (2009)

    Google Scholar 

  7. Cylwik, N., Wagner, A., Demenko, G.: The EURONOUNCE corpus of non-native polish for ASR-based pronunciation tutoring system. In: SLaTE 2009 (2009)

    Google Scholar 

  8. Rosenberg, A., Colla, A.: A connected speech recognition system based on spotting diphone-like segments–preliminary results. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1987, vol. 12, pp. 85–88 (1987)

    Google Scholar 

  9. Saraclar, M., Nock, H.J., Khudanpur, S.: Pronunciation modeling by sharing gaussian densities across phonetic models, pp. 137–160 (2000)

    Google Scholar 

  10. Saraclar, M., Khudanpur, S.: Pronunciation change in conversational speech and its implications for automatic speech recognition, pp. 375–395 (2004)

    Google Scholar 

  11. Jokisch, O., Wagner, A., Sabo, R., Jaeckel, R., Cylwik, N., Rusko, M., Ronzhin, A., Hoffman, R.: Multilingual speech data collection for the assessment of pronunciation and prosody training in a language learning system. In: Proc. of Speech and Computer (SPECOM) (2009)

    Google Scholar 

  12. Xu, S., Jiang, J., Chen, Z., Xu, B.: Automatic pronunciation error detection based on linguistic knowledge and pronunciation space. In: ICASSP, pp. 4841–4844. IEEE (2009)

    Google Scholar 

  13. Ng, R.W.M., Hirose, K.: Syllable: A self-contained unit to model pronunciation variation. In: ICASSP, pp. 4457–4460. IEEE (2012)

    Google Scholar 

  14. Liang, M.S., Hong, Z.Y., Lyu, R.Y., Chiang, Y.C.: Data-driven approach to pronunciation error detection for computer assisted language teaching. In: Spector, J.M., Sampson, D.G., Okamoto, T., Kinshuk, C.S.A., Ueno, M., Kashihara, A. (eds.) ICALT, pp. 359–361. IEEE Computer Society (2007)

    Google Scholar 

  15. Strik, H., Truong, K.P., de Wet, F., Cucchiarini, C.: Comparing classifiers for pronunciation error detection. In: INTERSPEECH, pp. 1837–1840. ISCA (2007)

    Google Scholar 

  16. Witt, S.M.: Use of speech recognition in computer-assisted language learning (1999)

    Google Scholar 

  17. Witt, S.M., Young, S.J.: Phone-level pronunciation scoring and assessment for interactive language learning. Speech Communication 30(2-3), 95–108 (2000)

    Article  Google Scholar 

  18. Kanters, S., Cucchiarini, C., Strik, H.: The goodness of pronunciation algorithm: a detailed performance study. In: SLaTE 2009 (2009)

    Google Scholar 

  19. Wang, Y.B., Shan Lee, L.: Improved approaches of modeling and detecting error patterns with empirical analysis for computer-aided pronunciation training. In: 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 5049–5052 (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Marcin Bugdol .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Bugdol, M., Segiet, Z., Kręcichwost, M. (2014). Pronunciation Error Detection Using Dynamic Time Warping Algorithm. In: Piętka, E., Kawa, J., Wieclawek, W. (eds) Information Technologies in Biomedicine, Volume 4. Advances in Intelligent Systems and Computing, vol 284. Springer, Cham. https://doi.org/10.1007/978-3-319-06596-0_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-06596-0_32

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-06595-3

  • Online ISBN: 978-3-319-06596-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics