3D Research

, 9:55 | Cite as

Evaluation of Supervised Learning Algorithms Based on Speech Features as Predictors to the Diagnosis of Mild to Moderate Intellectual Disability

  • Gaurav AggarwalEmail author
  • Latika Singh
3DR Express


Due to age-bound onset of symptoms used for diagnosis of mild to moderate intellectual disability, early diagnosis of these problems has long been a difficult issue. The diagnosis includes tests pertaining to intellectual functioning and adaptive behaviours including communication skills etc. In this paper, it is proposed to use speech features as an early indicator of the disorder which can be used to train machine learning algorithms for differentiating between speech of normally developing children and children with intellectual disability. In this paper, speech abnormalities are quantified using acoustic parameters including Linear Predictive Cepstral Coefficients, Mel Frequency Cepstral Coefficients and spectral features in speech samples of 48 participants (24 with intellectual disability and 24 age-matched controls). A training dataset was created by extracting these features which was used for learning by various classifiers. The experiments show promising results where Support Vector Machine gives an accuracy of 98%. Consequently, a well-trained classification algorithm can be used as an aid in early detection of mild to moderate intellectual disability.


Intellectual disability Speech LPCC MFCC Classification Typically developed 


  1. 1.
    Abbeduto, L., & Nuccio, J. B. (1991). Relation between receptive language and cognitive maturity in persons with mental retardation. American Journal of Mental Retardation, 96, 143–149.Google Scholar
  2. 2.
    Abbeduto, L., Furman, L., & Davies, B. (1989). Relation between the receptive language and mental age of persons with mental retardation. American Journal of Mental Retardation, 93, 535–543.Google Scholar
  3. 3.
    Aggarwal, G., & Singh, L. (2018). Classification of intellectual disability using LPC, LPCC, and WLPCC parameterization techniques. International Journal of Computers and Applications. Scholar
  4. 4.
    Ai, O. C., Hariharan, M., Yaacob, S., & Chee, L. S. (2012). Classification of speech dysfluencies with MFCC and LPCC features. Expert Systems with Applications, 39(2), 2157–2165.CrossRefGoogle Scholar
  5. 5.
    American Psychiatric Association. (2013). Diagnostic and statistical manual of mental disorders (DSM-5 ® ). Arlington: American Psychiatric Publishing.CrossRefGoogle Scholar
  6. 6.
    Antoniol, G., Rollo, V. F., & Venturi, G. (2005, May). Linear predictive coding and cepstrum coefficients for mining time variant information from software repositories. In ACM SIGSOFT software engineering notes (Vol. 30, No. 4, pp. 1–5). ACM.Google Scholar
  7. 7.
    Batshaw, M. L. (2002). Children with disabilities (5th ed.). Baltimore: Brookes.Google Scholar
  8. 8.
    Belva, B. C., Matson, J. L., Sipes, M., & Bamburg, J. W. (2012). An examination of specific communication deficits in adults with profound intellectual disabilities. Research in Developmental Disabilities, 33, 525–529.CrossRefGoogle Scholar
  9. 9.
    Berry, P. B. (1972). Comprehension of possessive and present continuous sentences by nonretarded, mildly retarded, and severely retarded children. American Journal of Mental Deficiency, 76, 540–544.Google Scholar
  10. 10.
    Biagetti, G., Crippa, P., Curzi, A., Orcioni, S., & Turchetti, C. (2015, January). Speaker identification with short sequences of speech frames. In Proceedings of the international conference on pattern recognition applications and methods (Vol. 2, pp. 178–185). SciTePress, Setúbal.Google Scholar
  11. 11.
    Biagetti, G., Crippa, P., Falaschetti, L., Orcioni, S., & Turchetti, C. (2017). An investigation on the accuracy of truncated DKLT representation for speaker identification with short sequences of speech frames. IEEE transactions on cybernetics, 47(12), 4235–4249.CrossRefGoogle Scholar
  12. 12.
    Bourlard, H., & Wellekens, C. J. (1989). Speech pattern discrimination and multilayer perceptrons. Computer Speech & Language, 3(1), 1–19.CrossRefGoogle Scholar
  13. 13.
    Cabanas, R. (1954). Some findings in speech and voice therapy among mentally deficient children. Folia Phoniatrica et Logopaedica, 6(1), 34–37.CrossRefGoogle Scholar
  14. 14.
    Cheslock, M. A., Barton-Hulsey, A., Romski, M., & Sevcik, R. A. (2008). Using a speech-generating device to enhance communicative abilities for an adult with moderate intellectual disability. Journal of Intellectual and Developmental Disability, 46, 376–386.CrossRefGoogle Scholar
  15. 15.
    Clegg, J., Hollis, C., Mawhood, L., & Rutter, M. (2005). Developmental language disorders—A follow-up in later adult life. Cognitive, language and psychosocial outcomes. Journal of Child Psychology and Psychiatry, 46(2), 128–149.CrossRefGoogle Scholar
  16. 16.
    Davis, S. B., & Mermelstein, P. (1990). Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. In Readings in speech recognition (pp. 65–74).CrossRefGoogle Scholar
  17. 17.
    Griffiths, R. (1970). The ability of young children. A study in mental measurement. London: University of London Press.Google Scholar
  18. 18.
    Harel, S., Greenstein, Y., Kramer, U., Yifat, R., Samuel, E., Nevo, Y., et al. (1996). Clinical characteristics of children referred to a child development center for evaluation of speech, language, and communication disorders. Pediatric Neurology, 15(4), 305–311.CrossRefGoogle Scholar
  19. 19.
    Hariharan, M., Chee, L. S., Ai, O. C., & Yaacob, S. (2012). Classification of speech dysfluencies using LPC based parameterization techniques. Journal of Medical Systems, 36(3), 1821–1830.CrossRefGoogle Scholar
  20. 20.
    Harrison, P., & Oakland, T. (2003). Adaptive behavior assessment system (ABAS-II). San Antonio, TX: The Psychological Corporation.Google Scholar
  21. 21.
    Hau, C. C. (Ed.). (2015). Handbook of pattern recognition and computer vision. Singapore: World Scientific.Google Scholar
  22. 22.
    Jothilakshmi, S., Ramalingam, V., & Palanivel, S. (2009). Unsupervised speaker segmentation with residual phase and MFCC features. Expert Systems with Applications, 36(6), 9799–9804.CrossRefGoogle Scholar
  23. 23.
    Kail, R. (1992). General slowing of information-processing by persons with mental retardation. American Journal of Mental Retardation, 97, 333–341.Google Scholar
  24. 24.
    Koul, R., & Clapsaddle, K. C. (2006). Effects of repeated listening experiences on the perception of synthetic speech by individuals with mild-to-moderate intellectual disabilities. Augmentative & Alternative Communication, 22, 112–122.CrossRefGoogle Scholar
  25. 25.
    Lehiste, I., & Lass, N. J. (1976). Suprasegmental features of speech. Contemporary Issues in Experimental Phonetics, 225, 239.Google Scholar
  26. 26.
    Lesser, R., & Hassip, S. (1986). Knowledge and opinions of speech therapy in teachers, doctors and nurses. Child: Care, Health and Development, 12(4), 235–249.Google Scholar
  27. 27.
    Lynch, M. P., Oller, D. K., Eilers, R. E., & Basinger, D. (1990). Vocal development of infants with Down syndrome. In University of Wisconsin 11th annual symposium for research on child language disorders, Madison, WI.Google Scholar
  28. 28.
    Maber-Aleksandrowicz, S., Avent, C., & Hassiotis, A. (2016). A systematic review of animal-assisted therapy on psychosocial outcomes in people with intellectual disability. Research in Developmental Disabilities, 49, 322–338.CrossRefGoogle Scholar
  29. 29.
    McLean, L. K., Brady, N. C., McLean, J. E., & Behrens, G. A. (1999). Communication forms and functions of children and adults with severe mental retardation in community and institutional settings. Journal of Speech, Language, and Hearing Research, 42, 231–240.CrossRefGoogle Scholar
  30. 30.
    Memisevic, H., & Hadzic, S. (2013). Speech and language disorders in children with intellectual disability in Bosnia and Herzegovina. Disability, CBR & Inclusive Development, 24(2), 92–99.CrossRefGoogle Scholar
  31. 31.
    Merrill, E. C., & Jackson, T. S. (1992). Degree of associative relatedness and sentence processing by adolescents with and without mental retardation. American Journal of Mental Retardation, 97, 173–185.Google Scholar
  32. 32.
    Merrill, E. C., & Jackson, T. S. (1992). Sentence processing by adolescents with and without mental retardation. American Journal of Mental Retardation, 97, 342–350.Google Scholar
  33. 33.
    Merrill, E. C., & Mar, H. H. (1987). Differences between mentally retarded and nonretarded persons’ efficiency of auditory sentence processing. American Journal of Mental Deficiency, 91, 406–414.Google Scholar
  34. 34.
    Mervis, C. B., & Becerra, A. M. (2007). Language and communicative development in Williams syndrome. Mental Retardation and Developmental Disabilities Research Reviews, 13(1), 3–15.CrossRefGoogle Scholar
  35. 35.
    Mervis, C. B., & Cicchetti, D. (1990). Early conceptual development of children with Down syndrome. In Children with Down syndrome: A developmental perspective (pp. 252–301).Google Scholar
  36. 36.
    Rabiner, L. R., & Juang, B. H. (1993). Fundamentals of speech recognition (Vol. 14). Englewood Cliffs: PTR Prentice Hall.Google Scholar
  37. 37.
    Räsänen, O., & Pohjalainen, J. (2013). Random subset feature selection in automatic recognition of developmental disorders, affective states, and level of conflict from speech. In INTERSPEECH (pp. 210–214).Google Scholar
  38. 38.
    Rumelhart, D. E., Hinton, G. E., & Williams, R. J. (1986). Learning representations by back-propagating errors. Nature, 323(6088), 533.zbMATHCrossRefGoogle Scholar
  39. 39.
    Schalock, R. L., Borthwick-Duffy, S. A., Bradley, V. J., Buntinx, W. H., Coulter, D. L., Craig, E. M., et al. (2010). Intellectual disability: Definition, classification, and systems of supports. Washington, DC: American Association on Intellectual and Developmental Disabilities.Google Scholar
  40. 40.
    Tager-Flusberg, H. E. L. E. N., & Sullivan, K. (1998). Children with mental retardation. In Handbook of mental retardation and development (p. 208–239).Google Scholar
  41. 41.
    Vapnik, V. N. (1999). An overview of statistical learning theory. IEEE Transactions on Neural Networks, 10(5), 988–999.CrossRefGoogle Scholar
  42. 42.
    Wechsler, D. (2012). WPPSI-IV: Wechsler Preschool and Primary Scale of Intelligence. London: Pearson, Psychological Corporation.Google Scholar
  43. 43.
    Xiao, X., Chng, E. S., & Li, H. (2008). Normalization of the speech modulation spectra for robust speech recognition. IEEE Transactions on Audio, Speech and Language Processing, 16(8), 1662–1674.CrossRefGoogle Scholar
  44. 44.
    Zhong, J., Hu, W., Soong, F., & Meng, H. (2017). DNN i-vector speaker verification with short, text-constrained test utterances. In Proceedings of Interspeech 2017 (pp. 1507–1511).Google Scholar

Copyright information

© 3D Research Center, Kwangwoon University and Springer-Verlag GmbH Germany, part of Springer Nature 2018

Authors and Affiliations

  1. 1.The NorthCap UniversityGurgaonIndia
  2. 2.Manipal University JaipurJaipurIndia

Personalised recommendations