Abstract
In this paper, we explore the problem of automatic recognition of psychoneurological states: Autism Spectrum Disorders, Down Syndrome, Typical Development of 7–10 years old children from their speech in the Russian language. We described the results of fully automatic recognition based on our proprietary speech dataset. Along with SVM, we used the ComParE features from Computational Paralinguistic Challenges. The results on our dataset showed high performance of automated recognition of psychoneurological states of 7–10 years old children from their speech. The results are theoretically and practically valuable, they will expand the knowledge about human voice uniqueness, possibilities of diagnostics of human psychoneurological states by voice and speech features, and creation of alternative communicative systems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Kanner, L.: Autistic disturbances of affective contact. Nervous Child 2, 217–250 (1943)
Bonneh, Y.S., Levanov, Y., Dean-Pardo, O., Lossos, L., Adini, Y.: Abnormal speech spectrum and increased pitch variability in young autistic children. Front. Hum. Neurosci. 4, 1–7 (2011)
Lyakso, E., Frolova, O.: Early development indicators predict speech features of autistic children. In: Proc. 2020 International Conference on Multimodal Interaction (ICMI’20 Companion) – WoCBU’20 Workshop, pp. 514–521 (2020)
Kanamori, M.W., Brown, J., Williams-Smith, L.: Otolaryngologic manifestations of Down syndrome. Otolaryngol. Clin. North Am. 33(6), 1285–1292 (2000)
Kent, R.D., Vorperian, H.K.: Speech impairment in Down syndrome: a review. J. Speech Lang. Hear. Res. 56(1), 178–210 (2013)
Moura, C.P., Cunha, L.M., et al.: Voice parameters in children with Down syndrome. J. Voice. 22(1), 34–42 (2008)
Dykens, E., Hodapp, R.M., Evans, D.W.: Profiles and development of adaptive behavior in children with Down syndrome. Am. J. Ment. Retard. 98(5), 580–587 (1994)
Fidler, D.J.: The emerging Down syndrome behavioral phenotype in early childhood implications for practice. Infants Young Child. 18(2), 86–103 (2005)
Lyakso, E., Frolova, O., Gorodniy, V., Grigovev, A., Nikolaev, A., Matveev, Yu.: Reflection of the emotional state in the characteristics of voice and speech of children with Down syndrome. In: Proceedings SpeD 2019, 10th IEEE International Conference on Speech Technology and Human-Computer Dialogue, pp. 1–6. Timisoara, Romania (2019)
Frolova, O., Gorodnyi, V., Nikolaev, A., Grigorev, A., Grechanyi, S., Lyakso, E.: Developmental disorders manifestation in the characteristics of the child’s voice and speech: perceptual and acoustic study. In: Salah, A.A., Karpov, A., Potapova, R. (eds.) SPECOM 2019. LNCS (LNAI), vol. 11658, pp. 103–112. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-26061-3_11
Lyakso, E., Frolova, O.: Adult recognition of the emotional state and intonation in speech of children with Autism Spectrum Disorders: a pilot study. Int. J. Autism Relat. Disabil. 18(3), 1–5 (2018)
Kumar, M., Kim, S.H., Lord, C., Lyon, T.D., Narayanan, S.: Leveraging linguistic context in dyadic interactions to improve automatic speech recognition for children. Comput. Speech Lang. 63(101101) (2020)
Schuller, B.W., Zhang, Y., Weninger, F.: Three recent trends in Paralinguistics on the way to omniscient machine intelligence. J. Multimodal User Interfaces 12(4), 273–283 (2018). https://doi.org/10.1007/s12193-018-0270-6
Fusaroli, R., Lambrechts, A., Bang, D., Bowler, D.M., Gaigg, S.B.: Is voice a marker for Autism Spectrum Disorder? A systematic review and meta-analysis. Autism Res. 10, 384–407 (2017)
Corrales-Astorgano, M., Escudero-Mancebo, D., González-Ferreras, C.: Acoustic characterization and perceptual analysis of the relative importance of prosody in speech of people with Down syndrome. Speech Commun. 99, 90–100 (2018)
Tomblin, J.B.: The EpiSLI database: a publicly available database on speech and language. Lang. Speech Hear. Serv. Sch. 41(1), 108–117 (2010)
He, L., Zhang, J., Liu, Q., et al.: Automatic evaluation of hyper-nasality based on a cleft palate speech database. J. Med. Syst. 39(5) (2015)
Grill, P., Tučková, J.: Speech databases of typical children and children with SLI. PLOS ONE 11(3), #e0150365 (2016)
Lyakso, E., Frolova, O., Kaliyev, A., Gorodnyi, V., Grigorev, A., Matveev, Y.: AD-Child.Ru: speech corpus for Russian children with atypical development. In: Salah, A.A., Karpov, A., Potapova, R. (eds.) SPECOM 2019. LNCS (LNAI), vol. 11658, pp. 299–308. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-26061-3_31
Lyakso, E., Frolova, O., Karpov, A.: A new method for collection and annotation of speech data of atypically developing children. In: Proc. of 2018 International Conference on Sensor Networks and Signal Processing, pp. 175–180 (2018)
Verma, R.S., Huq, A.: Sex ratio of children with trisomy 21 or Down syndrome. Cytobios. 51, 206–207 (1987)
Loomes, R., Hull, L., Mandy, W.: What is the male-to-female ratio in Autism spectrum disorder? A systematic review and meta-analysis. J. Am. Acad. Child Adolesc. Psychiatry 56(6), 466–474 (2017)
Kovaleva, N.V., Btomo, V., Körblein, A.: Sex ratio in Down syndrome. Studies in patients with confirmed trisomy. Tsitologiia i genetika 35(6), 43–49 (2001)
Kadakia, S., Carlson, D., Sataloff, R.T.: The effect of hormones on the voice. Care of the professional voice. J. Sing. 69(5), 571–574 (2013)
Schopler, E., Reichler, R.J., DeVellis, R.F., Daly, K.: Toward objective classification of childhood autism: Childhood Autism Rating Scale (CARS). J. Autism Dev. Disord. 10(1), 91–103 (1980)
Schuller, B., Weninger, F., Zhang, Y., et al.: Affective and behavioural computing: lessons learnt from the first computational paralinguistics challenge. Comput. Speech Lang. 53, 156–180 (2019)
Eyben, F., Scherer, K.R., Schuller, B.W., et al.: The Geneva minimalistic acoustic parameter set (GeMAPS) for voice research and affective computing. IEEE Trans. Affect. Comput. 7, 190–202 (2016)
PyCM: Multiclass confusion matrix library in Python. https://joss.theoj.org/papers/10.21105/joss.00729
Bubnova, G.I.: The articulation base of the Russian and French languages: a dynamic aspect. Bulletin of the Moscow State Linguistic University. Humanit. Sci. 9(825), 47–56 (2019)
Svyatozarova, N.D.: The Intonation System of the Russian Language. Leningrad University Publishing House, Leningrad (1982)
Kaliyev, A., Zeno, B., Rybin, S.V., Matveev, Y.N., Lyakso, E.E.: GAN acoustic model for Kazakh speech synthesis. Int. J. Speech Technol. 24(3), 729–735 (2021). https://doi.org/10.1007/s10772-021-09840-0
Acknowledgments
The study is financially supported by the Russian Science Foundation (project № 18-18-00063) and the Russian Foundation for Basic Research (project 19-57-45008–IND_a).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Matveev, Y., Matveev, A., Frolova, O., Lyakso, E. (2021). Automatic Recognition of the Psychoneurological State of Children: Autism Spectrum Disorders, Down Syndrome, Typical Development. In: Karpov, A., Potapova, R. (eds) Speech and Computer. SPECOM 2021. Lecture Notes in Computer Science(), vol 12997. Springer, Cham. https://doi.org/10.1007/978-3-030-87802-3_38
Download citation
DOI: https://doi.org/10.1007/978-3-030-87802-3_38
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-87801-6
Online ISBN: 978-3-030-87802-3
eBook Packages: Computer ScienceComputer Science (R0)