Skip to main content

Sentence-Based Dialect Identification System Using Extreme Gradient Boosting Algorithm

  • Conference paper
  • First Online:

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 766))

Abstract

In this paper, a dialect identification system (DIS) is proposed by exploring the dialect specific prosodic features and cepstral coefficients from sentence-level utterances. Commonly, people belonging to a specific region follow a unique speaking style among them known as dialects. Sentence speech units are chosen for dialect identification since it is observed that a unique intonation and energy patterns are followed in sentences. Sentences are derived from a standard Intonational Variations in English (IViE) speech dataset. In this paper, pitch and energy contour are used to derive intonation and energy features respectively by using Legendre polynomial fit function along with five statistical features. Further, Mel frequency cepstral coefficients (MFCCs) are added to capture dialect specific spectral information. Extreme Gradient Boosting (XGB) ensemble method is employed for evaluation of the system under individual and combinations of features. Obtained results have indicated the influences of both prosodic and spectral features in recognition of dialects, also combined feature vectors have shown a better DIS performance of about 89.6%.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Chambers. J.K., Trudgill, P.: Dialectology, 2 edn. Cambridge University Press (1998)

    Google Scholar 

  2. Rouas, J.L.: Automatic prosodic variations modeling for language and dialect discrimination. IEEE Trans. Audio Speech Lang. Process. 15(6), 1904–1911 (2007)

    Article  Google Scholar 

  3. Mehrabani, M., Hansen, J.H.L.: Automatic analysis of dialect/language sets. Int. J. Speech Technol. 18(3), 277–286 (2015)

    Article  Google Scholar 

  4. Huang, R., Hansen, J.H.L., Angkititrakul, P.: Dialect/accent classification using unrestricted audio. IEEE Trans. Audio Speech Lang. Process. 15(2), 453–464 (2007)

    Article  Google Scholar 

  5. Chen, N.F., Shen, W., Campbell, J.P.: A linguistically-informative approach to dialect recognition using dialect-discriminating context-dependent phonetic models. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 5014–5017 (2010)

    Google Scholar 

  6. Biadsy, F.: Automatic dialect and accent recognition and its application to speech recognition. PhD thesis (2011). Columbia University

    Google Scholar 

  7. Zissman, M.A., Gleason, T.P., Rekart D.M., Losiewicz, B.L.: Automatic dialect identification of extemporaneous conversational, Latin American Spanish speech. In: ICASSP, pp. 777–780 (1996)

    Google Scholar 

  8. Xu, F., Wang, M., Li, M.: Sentence-level dialects identification in the Greater China region. Int. J. Nat. Lang. Comput. (IJNLC) 5(6), 9–20 (2016)

    Article  Google Scholar 

  9. Chittaragi, N.B., Prakash, A., Koolagudi, S.G.: Dialect identification using spectral and prosodic features on single and ensemble classifiers. In: Arabian Journal for Science and Engineering (2017, November)

    Google Scholar 

  10. Grabe, E., Post, B.: Intonational variation in the British Isles. In: Speech Prosody, International Conference (2002)

    Google Scholar 

  11. Giannakopoulos, T.: Study and Application of Acoustic Information for the Detection of Harmful Content, and Fusion with Visual Information. University of Athens, Greece, Department of Informatics and Telecommunications (2009)

    Google Scholar 

  12. Chittaragi, N.B., Koolagudi, S.G.: Acoustic features based word level dialect classification using SVM and ensemble methods. In: 2017 Tenth International Conference on Contemporary Computing (IC3), pp. 1–6 (2017)

    Google Scholar 

  13. Hermansky, H., Morgan, N.: Rasta processing of speech. IEEE Trans. Speech Audio Process. 2(4), 578–589 (1994)

    Article  Google Scholar 

  14. Chen, T., Guestrin, C.: Xgboost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 785–794 (2016)

    Google Scholar 

  15. Harris, M.J., Gries, S.T., Miglio, V.G.: Prosody and its application to forensic linguistics. LESLI: Linguist. Evid. Secur. Law Intell. 2(2), 11–29 (2014)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nagaratna B. Chittaragi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Chittaragi, N.B., Koolagudi, S.G. (2020). Sentence-Based Dialect Identification System Using Extreme Gradient Boosting Algorithm. In: Elçi, A., Sa, P., Modi, C., Olague, G., Sahoo, M., Bakshi, S. (eds) Smart Computing Paradigms: New Progresses and Challenges. Advances in Intelligent Systems and Computing, vol 766. Springer, Singapore. https://doi.org/10.1007/978-981-13-9683-0_14

Download citation

Publish with us

Policies and ethics