Abstract
In this paper, a dialect identification system (DIS) is proposed by exploring the dialect specific prosodic features and cepstral coefficients from sentence-level utterances. Commonly, people belonging to a specific region follow a unique speaking style among them known as dialects. Sentence speech units are chosen for dialect identification since it is observed that a unique intonation and energy patterns are followed in sentences. Sentences are derived from a standard Intonational Variations in English (IViE) speech dataset. In this paper, pitch and energy contour are used to derive intonation and energy features respectively by using Legendre polynomial fit function along with five statistical features. Further, Mel frequency cepstral coefficients (MFCCs) are added to capture dialect specific spectral information. Extreme Gradient Boosting (XGB) ensemble method is employed for evaluation of the system under individual and combinations of features. Obtained results have indicated the influences of both prosodic and spectral features in recognition of dialects, also combined feature vectors have shown a better DIS performance of about 89.6%.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Chambers. J.K., Trudgill, P.: Dialectology, 2 edn. Cambridge University Press (1998)
Rouas, J.L.: Automatic prosodic variations modeling for language and dialect discrimination. IEEE Trans. Audio Speech Lang. Process. 15(6), 1904–1911 (2007)
Mehrabani, M., Hansen, J.H.L.: Automatic analysis of dialect/language sets. Int. J. Speech Technol. 18(3), 277–286 (2015)
Huang, R., Hansen, J.H.L., Angkititrakul, P.: Dialect/accent classification using unrestricted audio. IEEE Trans. Audio Speech Lang. Process. 15(2), 453–464 (2007)
Chen, N.F., Shen, W., Campbell, J.P.: A linguistically-informative approach to dialect recognition using dialect-discriminating context-dependent phonetic models. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 5014–5017 (2010)
Biadsy, F.: Automatic dialect and accent recognition and its application to speech recognition. PhD thesis (2011). Columbia University
Zissman, M.A., Gleason, T.P., Rekart D.M., Losiewicz, B.L.: Automatic dialect identification of extemporaneous conversational, Latin American Spanish speech. In: ICASSP, pp. 777–780 (1996)
Xu, F., Wang, M., Li, M.: Sentence-level dialects identification in the Greater China region. Int. J. Nat. Lang. Comput. (IJNLC) 5(6), 9–20 (2016)
Chittaragi, N.B., Prakash, A., Koolagudi, S.G.: Dialect identification using spectral and prosodic features on single and ensemble classifiers. In: Arabian Journal for Science and Engineering (2017, November)
Grabe, E., Post, B.: Intonational variation in the British Isles. In: Speech Prosody, International Conference (2002)
Giannakopoulos, T.: Study and Application of Acoustic Information for the Detection of Harmful Content, and Fusion with Visual Information. University of Athens, Greece, Department of Informatics and Telecommunications (2009)
Chittaragi, N.B., Koolagudi, S.G.: Acoustic features based word level dialect classification using SVM and ensemble methods. In: 2017 Tenth International Conference on Contemporary Computing (IC3), pp. 1–6 (2017)
Hermansky, H., Morgan, N.: Rasta processing of speech. IEEE Trans. Speech Audio Process. 2(4), 578–589 (1994)
Chen, T., Guestrin, C.: Xgboost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 785–794 (2016)
Harris, M.J., Gries, S.T., Miglio, V.G.: Prosody and its application to forensic linguistics. LESLI: Linguist. Evid. Secur. Law Intell. 2(2), 11–29 (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Chittaragi, N.B., Koolagudi, S.G. (2020). Sentence-Based Dialect Identification System Using Extreme Gradient Boosting Algorithm. In: Elçi, A., Sa, P., Modi, C., Olague, G., Sahoo, M., Bakshi, S. (eds) Smart Computing Paradigms: New Progresses and Challenges. Advances in Intelligent Systems and Computing, vol 766. Springer, Singapore. https://doi.org/10.1007/978-981-13-9683-0_14
Download citation
DOI: https://doi.org/10.1007/978-981-13-9683-0_14
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-9682-3
Online ISBN: 978-981-13-9683-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)