Age Classification with LPCC Features Using SVM and ANN

Aggarwal, Gaurav; Singh, Latika

doi:10.1007/978-981-13-0586-3_40

Gaurav Aggarwal⁵ &
Latika Singh⁶

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 40))

1076 Accesses
1 Citations

Abstract

For humans, speech is one of the vital communication channel used for interchanging information, knowledge, and thoughts. Identifying the age of a person based on his/her speech is an essential part of speech therapy and many telecommunication applications. Many speech-related disorders can be diagnosed and cured using age identification at early ages. Depending on the age group, particular speech therapy can be given to a child. In this research, typical speech sentences were used to identify the age of 200 Indian children from the age group of 4–8 years. Linear predictive cepstral coefficients (LPCC) (formant frequencies) was applied to extract 128 acoustic features using sustained phonation, reading and imitation tasks. Artificial neural network (ANN) and support vector machine (SVM) were used to build two classification models. Comparisons were made on classification accuracy. Classification results were substantially higher between the age group of 4 and 8 years. This work will further be extended to gender classification with more robust features and algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Eguchi, S., Hirsh, I.J.: Development of speech sounds in children. Acta Oto-Laryngol. Suppl. 257, 1–51 (1968)
Google Scholar
Bennett, S.: Vowel formant frequency characteristics of preadolescent males and females. J. Acoust. Soc. Am. 69, 231–238 (1981)
Article Google Scholar
Busby, P.A., Plant, G.L.: Formant frequency values of vowels produced by preadolescent boys and grils. J. Acoust. Soc. Am. 97(4), 2603–2606 (1995)
Article Google Scholar
Lee, S., Potamianos, A., Narayanan, S.: Acoustics of children’s speech: developmental changes of temporal and spectral parameters. J. Acoust. Soc. Am. 105, 1455–1468 (1999)
Article Google Scholar
Xue, W.R., Hao, G.J.: Changes in the human vocal tract due to aging and acoustic correlates of speech production: a pilot study. J. Speech Lang. Hear. Res. 46, 689–701 (2003)
Article Google Scholar
Harrington, J., Palethorpe, S., Watson, C.I.: Age-related changes in fundamental frequency and formants: a longitudinal study of four speakers. In: INTERSPEECH-2007, pp. 2753–2756 (2007)
Google Scholar
Reubold, U., Harrimgton, J., Kleber, F.: Vocal aging effect on F0 and the first formant: a longitudinal analysis in adult speakers. Speech Commun. 52, 638–651 (2010)
Article Google Scholar
Huang, X., Acero, A., Hon, H.: Spoken Language Processing: A Guide to Theory, Algorithm, and System Development. Prentice Hall PTR, Upper Saddle River, NJ, USA (2001)
Google Scholar
Rabiner, L., Juang, B.: Fundamentals of Speech Recognition. Prentice Hall (1993)
Google Scholar
Deng, L., Xiao, L.: Machine learning paradigms for speech recognition: an overview. IEEE Trans. Audio Speech Lang. Process. 21(5) (2013)
Google Scholar
Jiang, H., Bai, J., Zhang, S., Xu, B.: SVM-based audio scene classification. In: Natural Language Processing and Knowledge Engineering, IEEE NLP-KE’05, pp. 131–136 (2005)
Google Scholar
Guo, G., Li, S.Z.: Content-based audio classification and retrieval by support vector machines. IEEE Trans. Neural Netw. 14(1), 209–215 (2003)
Article MathSciNet Google Scholar
Master, T.: Practical Neural Network Recipes. Wiley, New York (1993)
Google Scholar
White, P.: Formant Frequency Analysis of Children’s Spoken and Sung Vowels Using Sweeping Fundamental Frequency Production (1999)
Google Scholar
Hillenbrand, J., Getty, L.A., Clark, M.J., Wheeler, K.: Acoustic characteristics of American English Vowel. J. Acoust. Soc. Am. 97(5), 3099–3111 (1995)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Manipal University Jaipur, VPO Dehmikalan, Ajmer Highway, Jaipur, India
Gaurav Aggarwal
The NorthCap University, Sector-23A, Gurugram, India
Latika Singh

Authors

Gaurav Aggarwal
View author publications
You can also search for this author in PubMed Google Scholar
Latika Singh
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gaurav Aggarwal .

Editor information

Editors and Affiliations

Department of Computer and Information Science, University of Macau, Macau, China
Simon Fong
Department of Electronics and Communication Engineering, ITM University, Gwalior, India
Shyam Akashe
Smt. Kashibai Navale College of Engineering, Pune, India
Parikshit N. Mahalle

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Aggarwal, G., Singh, L. (2019). Age Classification with LPCC Features Using SVM and ANN. In: Fong, S., Akashe, S., Mahalle, P. (eds) Information and Communication Technology for Competitive Strategies. Lecture Notes in Networks and Systems, vol 40. Springer, Singapore. https://doi.org/10.1007/978-981-13-0586-3_40

Download citation

DOI: https://doi.org/10.1007/978-981-13-0586-3_40
Published: 31 August 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-0585-6
Online ISBN: 978-981-13-0586-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics