Language Identification Using Spectral Features

Rao, K. Sreenivasa; Reddy, V. Ramu; Maity, Sudhamay

doi:10.1007/978-3-319-17163-0_3

K. Sreenivasa Rao⁵,
V. Ramu Reddy⁶ &
Sudhamay Maity⁷

Part of the book series: SpringerBriefs in Electrical and Computer Engineering ((BRIEFSSPEECHTECH))

662 Accesses

Abstract

This chapter introduces multilingual Indian language speech corpus consisting of 27 regional Indian languages for analyzing the language identification (LID) performance. Speaker-dependent and independent language models are also discussed in view of LID. Spectral features extracted from conventional block processing, pitch synchronous analysis, and glottal closure regions are examined for discriminating the languages.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Rao KS, Maity S, Reddy VR (2013) Pitch synchronous and glottal closure based speech analysis for language recognition. Int J Speech Technol (Springer) 16(4):413–430
Google Scholar
Maity S, Vuppala AK, Rao KS, Nandi D (2012) IITKGP-MLILSC speech database for language identification. In: National conference on communication, Feb 2012
Google Scholar
Muthusamy YK, Cole RA, Oshika BT (1992) The OGI multi-language telephone speech corpus. In: Proceedings of international conference spoken language processing, pp 895–898, Oct 1992
Google Scholar
Lander T, Cole R, Oshika B, Noel M (1995) The OGI 22 language telephone speech corpus. In: Proceedings of EUROSPEECH-1995, pp 817–820
Google Scholar
Zheng F, Zhang G, Song Z (2001) Comparison of different implementations of MFCC. J Comput Sci Technol 16(6):582–589
Google Scholar
Reynolds D (2009) Enclopedia of biometrics. Springer, New York, pp 659–663
Google Scholar
Murty K, Yegnanarayana B (2008) Epoch extraction from speech signals. IEEEASLP 16:1602–1613
Google Scholar
Sreenivasa Rao K, Prasanna SRM, Yegnanarayana B (2007) Determination of instants of significant excitation in speech using Hilbert envelope and group delay function. In: IEEE signal processing letters, vol 14, no 10, pp 762–765, Oct 2007
Google Scholar
Varga A, Steeneken HJ (1993) Assessment for automatic speech recognition: II. NOISEX-92: a database and an experiment to study the effect of additive noise on speech recognition systems. Speech Commun 12:247–251
Article Google Scholar

Download references

Author information

Authors and Affiliations

Indian Institute of Technology Kharagpur, Kharagpur, West Bengal, India
K. Sreenivasa Rao
Innovation Lab Kolkata, Kolkata, West Bengal, India
V. Ramu Reddy
Indian Institute of Technology Kharagpur, Kharagpur, West Bengal, India
Sudhamay Maity

Authors

K. Sreenivasa Rao
View author publications
You can also search for this author in PubMed Google Scholar
V. Ramu Reddy
View author publications
You can also search for this author in PubMed Google Scholar
Sudhamay Maity
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to K. Sreenivasa Rao .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Rao, K.S., Reddy, V.R., Maity, S. (2015). Language Identification Using Spectral Features. In: Language Identification Using Spectral and Prosodic Features. SpringerBriefs in Electrical and Computer Engineering(). Springer, Cham. https://doi.org/10.1007/978-3-319-17163-0_3

Download citation

DOI: https://doi.org/10.1007/978-3-319-17163-0_3
Published: 01 April 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-17162-3
Online ISBN: 978-3-319-17163-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Language Identification Using Spectral Features