Sentence-Based Dialect Identification System Using Extreme Gradient Boosting Algorithm

Chittaragi, Nagaratna B.; Koolagudi, Shashidhar G.

doi:10.1007/978-981-13-9683-0_14

Sentence-Based Dialect Identification System Using Extreme Gradient Boosting Algorithm

Nagaratna B. Chittaragi^20,21 &
Shashidhar G. Koolagudi²¹

Conference paper
First Online: 01 December 2019

280 Accesses
2 Citations

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 766))

Abstract

In this paper, a dialect identification system (DIS) is proposed by exploring the dialect specific prosodic features and cepstral coefficients from sentence-level utterances. Commonly, people belonging to a specific region follow a unique speaking style among them known as dialects. Sentence speech units are chosen for dialect identification since it is observed that a unique intonation and energy patterns are followed in sentences. Sentences are derived from a standard Intonational Variations in English (IViE) speech dataset. In this paper, pitch and energy contour are used to derive intonation and energy features respectively by using Legendre polynomial fit function along with five statistical features. Further, Mel frequency cepstral coefficients (MFCCs) are added to capture dialect specific spectral information. Extreme Gradient Boosting (XGB) ensemble method is employed for evaluation of the system under individual and combinations of features. Obtained results have indicated the influences of both prosodic and spectral features in recognition of dialects, also combined feature vectors have shown a better DIS performance of about 89.6%.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Chambers. J.K., Trudgill, P.: Dialectology, 2 edn. Cambridge University Press (1998)
Google Scholar
Rouas, J.L.: Automatic prosodic variations modeling for language and dialect discrimination. IEEE Trans. Audio Speech Lang. Process. 15(6), 1904–1911 (2007)
Article Google Scholar
Mehrabani, M., Hansen, J.H.L.: Automatic analysis of dialect/language sets. Int. J. Speech Technol. 18(3), 277–286 (2015)
Article Google Scholar
Huang, R., Hansen, J.H.L., Angkititrakul, P.: Dialect/accent classification using unrestricted audio. IEEE Trans. Audio Speech Lang. Process. 15(2), 453–464 (2007)
Article Google Scholar
Chen, N.F., Shen, W., Campbell, J.P.: A linguistically-informative approach to dialect recognition using dialect-discriminating context-dependent phonetic models. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 5014–5017 (2010)
Google Scholar
Biadsy, F.: Automatic dialect and accent recognition and its application to speech recognition. PhD thesis (2011). Columbia University
Google Scholar
Zissman, M.A., Gleason, T.P., Rekart D.M., Losiewicz, B.L.: Automatic dialect identification of extemporaneous conversational, Latin American Spanish speech. In: ICASSP, pp. 777–780 (1996)
Google Scholar
Xu, F., Wang, M., Li, M.: Sentence-level dialects identification in the Greater China region. Int. J. Nat. Lang. Comput. (IJNLC) 5(6), 9–20 (2016)
Article Google Scholar
Chittaragi, N.B., Prakash, A., Koolagudi, S.G.: Dialect identification using spectral and prosodic features on single and ensemble classifiers. In: Arabian Journal for Science and Engineering (2017, November)
Google Scholar
Grabe, E., Post, B.: Intonational variation in the British Isles. In: Speech Prosody, International Conference (2002)
Google Scholar
Giannakopoulos, T.: Study and Application of Acoustic Information for the Detection of Harmful Content, and Fusion with Visual Information. University of Athens, Greece, Department of Informatics and Telecommunications (2009)
Google Scholar
Chittaragi, N.B., Koolagudi, S.G.: Acoustic features based word level dialect classification using SVM and ensemble methods. In: 2017 Tenth International Conference on Contemporary Computing (IC3), pp. 1–6 (2017)
Google Scholar
Hermansky, H., Morgan, N.: Rasta processing of speech. IEEE Trans. Speech Audio Process. 2(4), 578–589 (1994)
Article Google Scholar
Chen, T., Guestrin, C.: Xgboost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 785–794 (2016)
Google Scholar
Harris, M.J., Gries, S.T., Miglio, V.G.: Prosody and its application to forensic linguistics. LESLI: Linguist. Evid. Secur. Law Intell. 2(2), 11–29 (2014)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information Science and Engineering, Siddaganga Institute of Technology, Tumkur, India
Nagaratna B. Chittaragi
Department of Computer Science and Engineering, National Institute of Technology Karnataka, Surathkal, 575025, Karnataka, India
Nagaratna B. Chittaragi & Shashidhar G. Koolagudi

Authors

Nagaratna B. Chittaragi
View author publications
You can also search for this author in PubMed Google Scholar
Shashidhar G. Koolagudi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nagaratna B. Chittaragi .

Editor information

Editors and Affiliations

Engineering Faculty, Aksaray University, Sağlık, Aksaray, Turkey
Atilla Elçi
Department of Computer Science and Engineering, National Institute of Technology Rourkela, Rourkela, Odisha, India
Pankaj Kumar Sa
National Institute of Technology, Goa, India
Chirag N. Modi
CICESE, Ensenada, Baja California, Mexico
Gustavo Olague
Department of Computer Science and Engineering, National Institute of Technology Rourkela, Rourkela, Odisha, India
Manmath N. Sahoo
Department of Computer Science and Engineering, National Institute of Technology Rourkela, Rourkela, Odisha, India
Sambit Bakshi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chittaragi, N.B., Koolagudi, S.G. (2020). Sentence-Based Dialect Identification System Using Extreme Gradient Boosting Algorithm. In: Elçi, A., Sa, P., Modi, C., Olague, G., Sahoo, M., Bakshi, S. (eds) Smart Computing Paradigms: New Progresses and Challenges. Advances in Intelligent Systems and Computing, vol 766. Springer, Singapore. https://doi.org/10.1007/978-981-13-9683-0_14

Download citation

DOI: https://doi.org/10.1007/978-981-13-9683-0_14
Published: 01 December 2019
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-9682-3
Online ISBN: 978-981-13-9683-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics