Effect of speech segment samples selection in stutter block detection and remediation

Arbajian, Pierre; Hajja, Ayman; Raś, Zbigniew W.; Wieczorkowska, Alicja A.

doi:10.1007/s10844-019-00546-z

Effect of speech segment samples selection in stutter block detection and remediation

Published: 11 February 2019

Volume 53, pages 241–264, (2019)
Cite this article

Journal of Intelligent Information Systems Aims and scope Submit manuscript

Pierre Arbajian¹,
Ayman Hajja²,
Zbigniew W. Raś^1,3 &
…
Alicja A. Wieczorkowska³

342 Accesses
3 Citations
Explore all metrics

Abstract

Speech remediation by identifying those segments which take away from the substance of the speech content can be performed by identifying portions of speech which may be deleted without diminishing from the speech quality, but rather improving the speech. Speech remediation is important when the speech is disfluent as in the case of stuttered speech. We describe two stuttered speech remediation approaches based on the identification of those segments of speech which, when removed, would enhance speech understandability in terms of both, speech content and speech flow. We adopted two approaches, in the first approach we identify and extract speech segments that have weak semantic significance due to their low relative intensity; we subsequently trained several classifiers using a large set of inherent and derived features which provided a second layer filtering stage. The first approach was effective but required a two step process. In order to streamline the detection and remediation process, we introduced an enhancement which expands the realm of disfluency detection to include a broader range of speech anomalies by eliminating the need for a domain-dependent pre-qualification stage. The results of the new approach offer improved accuracy with enhanced simplicity, flexibility and extensibility.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Segment-Removal Based Stuttered Speech Remediation

Enhancing Stutter Detection in Speech Using Zero Time Windowing Cepstral Coefficients and Phase Information

Intelligent stuttering speech recognition: A succinct review

Article 19 March 2022

Nilanjan Banerjee, Samarjeet Borah & Nilambar Sethi

References

Ai, O.C., Hariharan, M., Yaacob, S., Chee, L.S. (2012). Classification of speech dysfluencies with MFCC and LPCC features, (Vol. 39 pp. 2157–2165).
Arbajian, P., Hajja, A., Raś, Z.W., Wieczorkowska, A.A. (2017). Segment-Removal based stuttered speech remediation. In International workshop on new frontiers in mining complex patterns (pp. 16–34). Cham: Springer.
Chapter Google Scholar
Boersma, P. (1993). Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound. In Proceedings of the institute of phonetic sciences, vol. 17, no. 1193.
Boersma, P. (2001). Praat, a system for doing phonetics by computer. Glot International, 5(9/10), 341–345.
Google Scholar
Chee, L.S., Ai, O.C., Yaacob, S. (2009). Overview of automatic stuttering recognition system. In Proc. International conference on man-machine systems (pp. 1–6) no. october, Batu Ferringhi, Penang Malaysia.
Czyzewski, A., Kaczmarek, A., Kostek, B. (2003). Intelligent processing of stuttered speech. J. Intell. IN Inf. Syst., 21, 143–171.
Article Google Scholar
Esmaili, I., Dabanloo, N.J., Vali, M. (2016). Automatic classification of speech dysfluencies in continuous speech based on similarity measures and morphological image processing tools. Biomedical Signal Processing and Control, 23, 104–114.
Article Google Scholar
Fook, C.Y., Muthusamy, H., Chee, L.S., Yaacob, S.B., Adom, A.H.B. (2013). Comparison of speech parameterization techniques for the classification of speech disfluencies. In Turkish journal of electrical engineering & computer sciences, vol. 21, no. Sup. 1.
Article Google Scholar
Hariharan, M., Chee, L.S., Ai, O.C., Yaacob, S. (2012). Classification of speech dysfluencies using LPC based parameterization techniques, (Vol. 36 pp. 1821–1830).
Article Google Scholar
Honal, M., & Schultz, T. (2003). Correction of disfluencies in spontaneous speech using a noisy-channel approach. In Interspeech.
Honal, M., & Schultz, T. (2005). Automatic disfluency removal on recognized spontaneous Speech-Rapid adaptation to speaker dependent disfluencies. In ICASSP (no. 1, pp. 969–972).
Howell, P., Davis, S., Bartrip, J. (2009). The UCLASS archive of stuttered speech, (Vol. 52 pp. 556–569).
Kuhn, M. (2008). Building predictive models in R using the caret package. Journal of Statistical Software, 28(5), 1–26. https://doi.org/10.18637/jss.v028.i05.
Article Google Scholar
KM, R.K., & Ganesan, S. (2011). Comparison of multidimensional MFCC feature vectors for objective assessment of stuttered disfluencies. International Journal of Advanced Networking Applications, 2(05), 854–860.
Google Scholar
Lease, M., Johnson, M., Charniak, E. (2006). Recognizing disfluencies in conversational speech. IEEE Transactions on Audio, Speech, and Language Processing, 14(5), 1566–1573.
Article Google Scholar
Liu, Y., Shriberg, E., Stolcke, A., Harper, M.P. (2005). Comparing HMM, maximum entropy, and conditional random fields for disfluency detection. In Interspeech (pp. 3313–3316).
Raghavendra, M., & Rajeswari, P. (2016). Determination of disfluencies associated in stuttered speech using MFCC feature extraction, (Vol. 4 pp. 2321–9939).
Ravikumar, K.M., Rajagopal, R., Nagaraj, H.C. (2009). An approach for objective assessment of stuttered speech using MFCC. In The international congress for global science and technology (p. 19).
Ribeiro, M.T., Singh, S., Guestrin, C. (2016). Why should i trust you?: Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining (pp. 1135-1144). ACM.
Świetlicka, I., Kuniszyk-Jóźkowiak, W., Smołka, E. (2013). Hierarchical ANN system for stuttering identification, (Vol. 27 pp. 228–242).
Article Google Scholar
The H2O.ai team. (2015). h2o: Python Interface for H2O Python package version 3.1.0.99999, https://github.com/h2oai/h2o-3.
Winkelmann, R., & Raess, G. (2014). Introducing a web application for labeling, visualizing speech and correcting derived speech signals. In LREC (pp. 4129–4133).

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of North Carolina, 9201 University City Blvd., Charlotte, NC, 28223, USA
Pierre Arbajian & Zbigniew W. Raś
Department of Computer Science, College of Charleston, 66 George Street, Charleston, SC, 29424, USA
Ayman Hajja
Polish-Japanese Academy of Information Technology, Koszykowa 86, 02-008, Warsaw, Poland
Zbigniew W. Raś & Alicja A. Wieczorkowska

Authors

Pierre Arbajian
View author publications
You can also search for this author in PubMed Google Scholar
Ayman Hajja
View author publications
You can also search for this author in PubMed Google Scholar
Zbigniew W. Raś
View author publications
You can also search for this author in PubMed Google Scholar
Alicja A. Wieczorkowska
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pierre Arbajian.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Arbajian, P., Hajja, A., Raś, Z.W. et al. Effect of speech segment samples selection in stutter block detection and remediation. J Intell Inf Syst 53, 241–264 (2019). https://doi.org/10.1007/s10844-019-00546-z

Download citation

Received: 16 July 2018
Revised: 16 December 2018
Accepted: 17 January 2019
Published: 11 February 2019
Issue Date: October 2019
DOI: https://doi.org/10.1007/s10844-019-00546-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Effect of speech segment samples selection in stutter block detection and remediation

Abstract

Access this article

Similar content being viewed by others

Segment-Removal Based Stuttered Speech Remediation

Enhancing Stutter Detection in Speech Using Zero Time Windowing Cepstral Coefficients and Phase Information

Intelligent stuttering speech recognition: A succinct review

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Effect of speech segment samples selection in stutter block detection and remediation

Abstract

Access this article

Similar content being viewed by others

Segment-Removal Based Stuttered Speech Remediation

Enhancing Stutter Detection in Speech Using Zero Time Windowing Cepstral Coefficients and Phase Information

Intelligent stuttering speech recognition: A succinct review

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation