Abstract
The process of counting stuttering events could be carried out more objectively through the automatic detection of stop-gaps, syllable repetitions and vowel prolongations. The alternative would be based on the subjective evaluations of speech fluency and may be dependent on a subjective evaluation method. Meanwhile, the automatic detection of intervocalic intervals, stop-gaps, voice onset time and vowel durations may depend on the speaker and the rules derived for a single speaker might be unreliable when trying to consider them as universal ones. This implies that learning algorithms having strong generalization capabilities could be applied to solve the problem. Nevertheless, such a system requires vectors of parameters, which characterize the distinctive features in a subject's speech patterns. In addition, an appropriate selection of the parameters and feature vectors while learning may augment the performance of an automatic detection system.
The paper reports on automatic recognition of stuttered speech in normal and frequency altered feedback speech. It presents several methods of analyzing stuttered speech and describes attempts to establish those parameters that represent stuttering event. It also reports results of some experiments on automatic detection of speech disorder events that were based on both rough sets and artificial neural networks.
Similar content being viewed by others
References
Archibald, L. and de Nil, L.F. (1999). The Relationship between Stuttering Severity and Kinesthetic Acuity for Jaw Movements in Adults who Stutter. J. Fluency Disorders, 24(1), 25–42.
Czyzewski, A. (1996). Speaker-Independent Recognition of Digits–Experiments with Neural Networks, Fuzzy Logic and Rough Sets. Intelligent Automation and Soft Computing–an International Journal, 2(2), 133–146.
Czyzewski, A., Kostek, B., and Roland-Mieszkowski, M. (1993). Digital Speech Aid Based on the Modification of the Auditory Feedback Loop. Poster paper presented at the Eurospeech'93, 3rd Eurospeech Congress, Berlin, Germany.
Czyzewski, A., Kostek, B., and Roland-Mieszkowski, M. (1994). Digital Speech Aid to decrease stuttering–Clinical Results and Patient's Reaction. In FirstWorld Congress of Fluency Disorders, Chap. 11 (pp. 526–529). Munich, Germany.
Czyzewski, A. and Skorka, P. (1996). Modification of the Auditory Loop-Effects on Hearing and Speech Production. 100th Audio Engineering Society Convention, preprint No. 4148 (C-1), Copenhagen, Denmark.
Howell, P. (1995). The Acoustic Properties of Stuttered Speech. In C.W. Starkweather and H.F.M. Peters (Eds.), Proceedings of the FirstWorld Congress on Fluency Disorders, vol. II (pp. 48–50). Nijmegen, The Netherlands: University Press Nijmegen.
Howell, P., Au-Yeung, J., and Pilgrim, L. (1999). Utterance Rate and Linguistic Properties as Determinants of Lexical Dysfluencies in Children who Stutter. J. Acoust. Soc. Amer., 105(1), 481–490.
Howell, P., Au-Yeung, J., Sackin, S., Glenn, K., and Rustin, L. (1997). Detection of Supralexical Dysfluencies in a Text Read by Child Stutterers. Journal of Fluency Disorders, 22, 299–307.
Howell, P., EI-Yaniv, N., and Powell, D.J. (1987). Factors Affecting Fluency in Stutterers when Speaking Under Altered Auditory Feedback. In H. Peters and W. Hulstijn (Eds.), Speech Motor Dynamics in Stuttering (pp. 361–369). New York: Springer Press.
Howell, P., Hamilton, A., and Kyriacopoulos, A. (1986). Automatic Detection of Repetitions and Prolongations in Stuttered Speech. Speech Input/Output: Techniques and Applications (pp. 252–256). IEE Publications.
Howell, P., Sackin, S., and Glenn, K. (1997a). Development of A Two-Stage Procedure for the Automatic Recognition of Dysfluencies in the Speech of Children Who Stutter: I. Psychometric Procedures Appropriate for Selection of Training Material for Lexical Dysfluency Classifiers. Journal of Speech, Language and Hearing Research, 40, 1073–1084.
Howell, P., Sackin, S., and Glenn, K. (1997b). Development of A Two-Stage Procedure for the Automatic Recognition of Dysfluencies in the Speech of Children Who Stutter: II. ANN Recognition of Repetitions and Prolongations with Supplied Word Segment Markers. Journal of Speech, Language and Hearing Research, 40, 1085–1096.
Howell, P., Staveley, A., Sackin, S., and Rustin, L. (1995). Comparison of Speech Motor Development in Stutterers and Fluent Speakers between 7 and 12 Years Old. J. Fluency Disorders, 20(1), 243–255.
Howell, P., Staveley, A., Sackin, S., and Rustin, L. (1998). Methods of Interval Selection, Presence of Noise and Their Effects on Detectability of Repetitions and Prolongations. J. Acoust. Soc. Amer., 104(6), 3558–3567.
Howell, P. and Vause, L. (1986). Acoustic Analysis and Perception of Vowels in Stuttered Speech. Journal of the Acoustical Society of America, 79, 1571–1579.
Howell, P. and Williams, M. (1988). The Contribution of the Excitatory Source to the Perception of Neutral Vowels in Stuttered Speech. Journal of the Acoustical Society of America, 84, 80–89.
Howell, P. and Williams, M. (1992). Acoustic Analysis and Perception of Vowels in Children's and Teenagers' Stuttered Speech. Journal of the Acoustical Society of America, 91, 1697–1706.
Howell, P. and Wingfield, T. (1990). Perceptual and Acoustic Evidence for Reduced Fluency Speech in theVicinity of Stuttering Episodes. Lang. Speech, 33(1), 31–46.
Kaczmarek, A. and Skorka, P. (1997). Investigation of the Pitch of the Vocal Tone for Stuttered Speech. 102nd Audio Engineering Society Convention, preprint No. 4486 (L7), Munich, Germany.
Kalinowski, J., Armson, J., and Stuart, A. (1995). Effect of Normal and Fast Articulatory Rates on Stuttering. J. Fluency Disorders, 20(3), 293–302.
Kostek, B. (1999). Soft Computing in Acoustics, Applications of Neural Networks, Fuzzy Logic and Rough Sets to Musical Acoustics, Studies in Fuzziness and Soft Computing. Physica Verlag, Heilderberg, New York.
Kuniszyk-Jozkowiak, A. (1995). The Statistical Analysis of Speech Envelopes in Stutterers and Non-Stutterers. J. Fluency Disorders, 20(1), 11–23.
Kuniszyk-Jozkowiak, W. (1996). A Comparison of Speech Envelopes of Stutterers and Nonstutterers. J. Acoust. Soc. Amer., 100(2), 1105–1110.
Lee, B. (1950). Effects of Delayed Speech Feedback. J. Acoust. Soc. Amer., 22, 824–826.
Michaelis, D., Frohlich, M., and Strube, H.W. (1998). Selection and Combination of Acoustic Features for the Description of Pathologic Voices. J. Acoust. Soc. Amer., 103(3), 1628–1639.
Pawlak, Z. (1982). Rough sets. J. Computer and Information Science, 11(5), 341–356.
Polkowski, L. and Skowron, A. (Eds.) (1998). Rough Sets in Knowledge Discovery 1: Methodology and Applications. Heidelberg and New York: Physica-Verlag.
Robb, M. and Blomgren, M. (1997). Analysis of F2 Transitions in the Speech of Stutterers and Non-stutterers. J. Fluency Disorders, 22(1), 1–16.
Roland-Mieszkowski, M., Czyzewski, A., and Kostek, B. (1995). DSA (Digital Speech Aid)–A New Device to Decrease or Eliminate Stuttering. In First World Congress of Fluency Disorders. Chap. 11 (pp. 535–539). Munich, Germany.
Slowinski, R. (Ed.) (1992). Intelligent Decision Support. Handbook on Applications and Advances of the Rough Sets Theory. Dordrecht/Boston/London: Kluwer Academic Publisher.
Zebrowski, P.M. (1991). Duration of the Speech Disfluencies of Beginning Stutterers. J. Speech-Hear-Res., 34(3).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Czyzewski, A., Kaczmarek, A. & Kostek, B. Intelligent Processing of Stuttered Speech. Journal of Intelligent Information Systems 21, 143–171 (2003). https://doi.org/10.1023/A:1024710532716
Issue Date:
DOI: https://doi.org/10.1023/A:1024710532716