Abstract
For humans, speech is one of the vital communication channel used for interchanging information, knowledge, and thoughts. Identifying the age of a person based on his/her speech is an essential part of speech therapy and many telecommunication applications. Many speech-related disorders can be diagnosed and cured using age identification at early ages. Depending on the age group, particular speech therapy can be given to a child. In this research, typical speech sentences were used to identify the age of 200 Indian children from the age group of 4–8 years. Linear predictive cepstral coefficients (LPCC) (formant frequencies) was applied to extract 128 acoustic features using sustained phonation, reading and imitation tasks. Artificial neural network (ANN) and support vector machine (SVM) were used to build two classification models. Comparisons were made on classification accuracy. Classification results were substantially higher between the age group of 4 and 8 years. This work will further be extended to gender classification with more robust features and algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Eguchi, S., Hirsh, I.J.: Development of speech sounds in children. Acta Oto-Laryngol. Suppl. 257, 1–51 (1968)
Bennett, S.: Vowel formant frequency characteristics of preadolescent males and females. J. Acoust. Soc. Am. 69, 231–238 (1981)
Busby, P.A., Plant, G.L.: Formant frequency values of vowels produced by preadolescent boys and grils. J. Acoust. Soc. Am. 97(4), 2603–2606 (1995)
Lee, S., Potamianos, A., Narayanan, S.: Acoustics of children’s speech: developmental changes of temporal and spectral parameters. J. Acoust. Soc. Am. 105, 1455–1468 (1999)
Xue, W.R., Hao, G.J.: Changes in the human vocal tract due to aging and acoustic correlates of speech production: a pilot study. J. Speech Lang. Hear. Res. 46, 689–701 (2003)
Harrington, J., Palethorpe, S., Watson, C.I.: Age-related changes in fundamental frequency and formants: a longitudinal study of four speakers. In: INTERSPEECH-2007, pp. 2753–2756 (2007)
Reubold, U., Harrimgton, J., Kleber, F.: Vocal aging effect on F0 and the first formant: a longitudinal analysis in adult speakers. Speech Commun. 52, 638–651 (2010)
Huang, X., Acero, A., Hon, H.: Spoken Language Processing: A Guide to Theory, Algorithm, and System Development. Prentice Hall PTR, Upper Saddle River, NJ, USA (2001)
Rabiner, L., Juang, B.: Fundamentals of Speech Recognition. Prentice Hall (1993)
Deng, L., Xiao, L.: Machine learning paradigms for speech recognition: an overview. IEEE Trans. Audio Speech Lang. Process. 21(5) (2013)
Jiang, H., Bai, J., Zhang, S., Xu, B.: SVM-based audio scene classification. In: Natural Language Processing and Knowledge Engineering, IEEE NLP-KE’05, pp. 131–136 (2005)
Guo, G., Li, S.Z.: Content-based audio classification and retrieval by support vector machines. IEEE Trans. Neural Netw. 14(1), 209–215 (2003)
Master, T.: Practical Neural Network Recipes. Wiley, New York (1993)
White, P.: Formant Frequency Analysis of Children’s Spoken and Sung Vowels Using Sweeping Fundamental Frequency Production (1999)
Hillenbrand, J., Getty, L.A., Clark, M.J., Wheeler, K.: Acoustic characteristics of American English Vowel. J. Acoust. Soc. Am. 97(5), 3099–3111 (1995)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Aggarwal, G., Singh, L. (2019). Age Classification with LPCC Features Using SVM and ANN. In: Fong, S., Akashe, S., Mahalle, P. (eds) Information and Communication Technology for Competitive Strategies. Lecture Notes in Networks and Systems, vol 40. Springer, Singapore. https://doi.org/10.1007/978-981-13-0586-3_40
Download citation
DOI: https://doi.org/10.1007/978-981-13-0586-3_40
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-0585-6
Online ISBN: 978-981-13-0586-3
eBook Packages: EngineeringEngineering (R0)