Detecting Stuttering Events in Transcripts of Children’s Speech

Alharbi, Sadeen; Hasan, Madina; Simons, Anthony J. H.; Brumfitt, Shelagh; Green, Phil

doi:10.1007/978-3-319-68456-7_18

Sadeen Alharbi¹⁶,
Madina Hasan¹⁶,
Anthony J. H. Simons¹⁶,
Shelagh Brumfitt¹⁷ &
…
Phil Green¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10583))

Included in the following conference series:

International Conference on Statistical Language and Speech Processing

817 Accesses
9 Citations
1 Altmetric

Abstract

Stuttering is a common problem in childhood that may persist into adulthood if not treated in early stages. Techniques from spoken language understanding may be applied to provide automated diagnosis of stuttering from children speech. The main challenges however lie in the lack of training data and the high dimensionality of this data. This study investigates the applicability of machine learning approaches for detecting stuttering events in transcripts. Two machine learning approaches were applied, namely HELM and CRF. The performance of these two approaches are compared, and the effect of data augmentation is examined in both approaches. Experimental results show that CRF outperforms HELM by 2.2% in the baseline experiments. Data augmentation helps improve systems performance, especially for rarely available events. In addition to the annotated augmented data, this study also adds annotated human transcriptions from real stuttered children’s speech to help expand the research in this field.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Brundage, S.B., Bothe, A.K., Lengeling, A.N., Evans, J.J.: Comparing judgments of stuttering made by students, clinicians, and highly experienced judges. J. Fluen. Dis. 31(4), 271–283 (2006)
Article Google Scholar
Craig, A., Calver, P.: Following up on treated stutterers studies of perceptions of fluency and job status. J. Speech Lang. Hear. Res. 34(2), 279–284 (1991)
Article Google Scholar
Geetha, Y., Pratibha, K., Ashok, R., Ravindra, S.K.: Classification of childhood disfluencies using neural networks. J. Fluen. Dis. 25(2), 99–117 (2000)
Article Google Scholar
Gregory, H.H., Campbell, J.H., Gregory, C.B., Hill, D.G.: Stuttering Therapy: Rationale and Procedures. Allyn & Bacon, Boston (2003)
Google Scholar
Hayhow, R., Cray, A.M., Enderby, P.: Stammering and therapy views of people who stammer. J. Fluen. Dis. 27(1), 1–17 (2002)
Article Google Scholar
Heeman, P.A., Lunsford, R., McMillin, A., Yaruss, J.S.: Using clinician annotations to improve automatic speech recognition of stuttered speech. In: Interspeech 2016, pp. 2651–2655 (2016)
Google Scholar
Heeman, P.A., McMillin, A., Yaruss, J.S.: Computer-assisted disfluency counts for stuttered speech. In: INTERSPEECH, pp. 3013–3016 (2011)
Google Scholar
Howell, P., Davis, S., Bartrip, J.: The university college London archive of stuttered speech (uclass). J. Speech Lang. Hear. Res. 52(2), 556–569 (2009)
Article Google Scholar
Howell, P., Sackin, S.: Automatic recognition of repetitions and prolongations in stuttered speech. In: Proceedings of the first World Congress on fluency disorders, vol. 2, pp. 372–374 (1995)
Google Scholar
Jaitly, N., Hinton, G.E.: Vocal tract length perturbation (VTLP) improves speech recognition. In: Proceeding ICML Workshop on Deep Learning for Audio, Speech and Language (2013)
Google Scholar
Jones, M., Onslow, M., Packman, A., Williams, S., Ormond, T., Schwarz, I., Gebski, V.: Randomised controlled trial of the lidcombe programme of early stuttering intervention. BMJ 331(7518), 659 (2005). http://www.bmj.com/content/331/7518/659
Article Google Scholar
Kudoh, T.: CRF++ (2007). https://sourceforge.net/projects/crfpp/
Liao, H., Pundak, G., Siohan, O., Carroll, M., Coccaro, N., Jiang, Q.M., Sainath, T.N., Senior, A., Beaufays, F., Bacchiani, M.: Large vocabulary automatic speech recognition for children. In: Interspeech (2015)
Google Scholar
Liu, H., Gegov, A., Cocea, M.: Complexity control in rule based models for classification in machine learning context. In: Angelov, P., Gegov, A., Jayne, C., Shen, Q. (eds.) Advances in Computational Intelligence Systems. AISC, vol. 513, pp. 125–143. Springer, Cham (2017). doi:10.1007/978-3-319-46562-3_9
Chapter Google Scholar
Mahesha, P., Vinod, D.S.: Using orthographic transcripts for stuttering dysfluency recognition and severity estimation. In: Jain, L.C., Patnaik, S., Ichalkaranje, N. (eds.) Intelligent Computing, Communication and Devices. AISC, vol. 308, pp. 613–621. Springer, New Delhi (2015). doi:10.1007/978-81-322-2012-1_66
Chapter Google Scholar
Juste, F.S., de Andrade, C.R.F.: Speech disfluency types of fluent and stuttering individuals: age effects. Folia Phoniatr. et Logop. 63(2), 57–64 (2010)
Article Google Scholar
Stolcke, A., Shriberg, E.: Statistical language modeling for speech disfluencies. In: 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP-1996, vol. 1, pp. 405–408. IEEE (1996)
Google Scholar
Stolcke, A., et al.: SRILM-an extensible language modeling toolkit. In: Interspeech, pp. 901–904 (2002)
Google Scholar
Tseng, H., Chang, P., Andrew, G., Jurafsky, D., Manning, C.: A conditional random field word segmenter for sighan bakeoff 2005. In: Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing, vol. 171. Citeseer (2005)
Google Scholar
Vertanen, K.: Csr lm-1 language model training recipe (2007)
Google Scholar
Yairi, E., Ambrose, N.G.: Early childhood stuttering ipersistency and recovery rates. J. Speech Lang. Hear. Res. 42(5), 1097–1112 (1999)
Article Google Scholar

Download references

Acknowledgments

This research has been supported by the Saudi Ministry of Education, King Saud University.

Author information

Authors and Affiliations

Department of Computer Science, The University of Sheffield, Sheffield, UK
Sadeen Alharbi, Madina Hasan, Anthony J. H. Simons & Phil Green
Department of Human Communication Sciences, The University of Sheffield, Sheffield, UK
Shelagh Brumfitt

Authors

Sadeen Alharbi
View author publications
You can also search for this author in PubMed Google Scholar
Madina Hasan
View author publications
You can also search for this author in PubMed Google Scholar
Anthony J. H. Simons
View author publications
You can also search for this author in PubMed Google Scholar
Shelagh Brumfitt
View author publications
You can also search for this author in PubMed Google Scholar
Phil Green
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sadeen Alharbi .

Editor information

Editors and Affiliations

University of Le Mans, Le Mans, France
Nathalie Camelin
University of Le Mans, Le Mans, France
Yannick Estève
Rovira i Virgili University, Tarragona, Spain
Carlos Martín-Vide

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Alharbi, S., Hasan, M., Simons, A.J.H., Brumfitt, S., Green, P. (2017). Detecting Stuttering Events in Transcripts of Children’s Speech. In: Camelin, N., Estève, Y., Martín-Vide, C. (eds) Statistical Language and Speech Processing. SLSP 2017. Lecture Notes in Computer Science(), vol 10583. Springer, Cham. https://doi.org/10.1007/978-3-319-68456-7_18

Download citation

DOI: https://doi.org/10.1007/978-3-319-68456-7_18
Published: 27 September 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-68455-0
Online ISBN: 978-3-319-68456-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics