Phonemes: An Explanatory Study Applied to Identify a Speaker

Kinkiri, Saritha; Barakat, Basel; Keates, Simeon

doi:10.1007/978-981-15-6318-8_6

Saritha Kinkiri¹¹,
Basel Barakat¹² &
Simeon Keates¹²

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1241))

Included in the following conference series:

International Conference on Machine Learning, Image Processing, Network Security and Data Sciences

1097 Accesses

Abstract

Speaker Identification (SI) is a process of identifying a speaker automatically via a machine using the speaker’s voice. In SI, one speaker’s voice is compared with n- number of speakers’ templates within the reference database to find the best match among the potential speakers. Speakers are capable of changing their voice, though, such as their accent, which makes is more challenging to identify who is talking. In this paper, we extracted phonemes from a speaker’s voice recording and investigated the associated frequencies and amplitudes to be assist in identifying the person who is speaking. This paper demonstrates the importance of phonemes in both speech and voice recognition systems. The results demonstrate that we can use phonemes to help the machine identify a particular speaker, however, phonemes get better accuracy in speech recognition than speaker identification.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bazyar, M., Sudirman, R.: A new speaker change detection method in a speaker identification system for two-speakers segmentation. In: 2014 IEEE Symposium on Computer Applications and Industrial Electronics (ISCAIE), Penang, pp. 141–145 (2014)
Google Scholar
Chowdhury, F.R., Selouani, S., O’Shaughnessy, D.: Distributed automatic text-independent speaker identification using GMM-UBM speaker models. In: 2009 Canadian Conference on Electrical and Computer Engineering, St. John’s, NL, pp. 372–375 (2009)
Google Scholar
Nagaraja, B.G., Jayanna, H.S.: Efficient window for monolingual and cross lingual speaker identification using MFCC. In: 2013 International Conference on Advanced Computing and Communication Systems, Coimbatore, pp. 1–4 (2013)
Google Scholar
Al-Hattami, A.: A phonetic and phonological study of the consonants of English and Arabic. Lang. India 10, 242–365 (2010)
Google Scholar
Bacha, S., Ghozi, R., Jaidane, M., Gouider-Khoujia, N.: Arabic adaption of phonology and memory test using entropy-based analysis of word complexity. In: 2012 11th International Conference on Information Science, Signal Processing and their Applications, (ISSPA), Montreal, QC, pp. 672–677 (2012)
Google Scholar
Ngo, G.H., Nguyen, M., Chen, N.F.: Phonology-augmented statistical framework for machine transliteration using limited linguistic resources. IEEE/ACM Trans. Audio Speech Lang. Process. 27(1), 192–211 (2019)
Article Google Scholar
Shih, S.S., Inkelas, S.: Auto segmental aims in surface-optimizing phonology. Linguist. J. 50(1), 137–196 (2018)
Google Scholar
Uma Maheswari, N., Kabilan, A.P., Venkatesh, R.: Speaker independent speech recognition system based on phoneme identification. In: 2008 International Conference on Computing, Communication and Networking, St. Thomas, VI, pp. 1–6 (2008)
Google Scholar
Rashid, R.A., Mahalin, N.H., Sarijari, M.A., Abdul Aziz, A.A.: Security system using biometric technology: design and implementation of voice recognition system (VRS). In: 2008 International Conference on Computer and Communication Engineering, Kuala Lumpur, pp. 898–902 (2008)
Google Scholar
Akhila, K.S., Kumaraswamy, R.: Comparative analysis of Kannada phoneme recognition using different classifies. In: 2015 International Conference on Trends in Automation, Communications and Computing Technology (I-TACT 2015), Bangalore, pp. 1–6 (2015)
Google Scholar
Panda, S.P.: Automated speech recognition system in advancement of human-computer interaction. In: 2017 International Conference on Computing Methodologies and Communication (ICCMC), Erode, pp. 302–306 (2017)
Google Scholar
Xue, M., Zhu, C.: A study and application on machine learning of artificial intelligence. In: 2009 International Joint Conference on Artificial Intelligence, pp. 272–274 (2009)
Google Scholar
Zhao, C., Wang, H., Hyon, S., Wei, J., Dang, J.: Efficient feature extraction of speaker identification using phoneme mean F-ratio for Chinese. In: 2012 8th International Symposium on Chinese Spoken Language Processing, pp. 345–348 (2012)
Google Scholar
Lavan, N., Burton, A.M., Scott, S.K., McGettigan, C.: Flexible voices: identity perception from variable vocal signals. Psychon. Bull. Rev. J. 26(1), 90–102 (2019)
Article Google Scholar
Kinkiri, S., Keates, S.: Identification of a speaker from familiar and unfamiliar voices. In: 2019 5th International Conference on Robotics and Artificial, pp. 94–97 (2019)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Greenwich, Chatham, ME4 4TB, UK
Saritha Kinkiri
Edinburgh Napier University, Edinburgh, EH11 4DY, UK
Basel Barakat & Simeon Keates

Authors

Saritha Kinkiri
View author publications
You can also search for this author in PubMed Google Scholar
Basel Barakat
View author publications
You can also search for this author in PubMed Google Scholar
Simeon Keates
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Saritha Kinkiri .

Editor information

Editors and Affiliations

National Institute of Technology Silchar, Silchar, India
Arup Bhattacharjee
National Institute Of Technology Silchar, Silchar, India
Samir Kr. Borgohain
National Institute of Technology Silchar, Silchar, India
Badal Soni
National Institute of Technology Kurukshetra, Kurukshetra, India
Gyanendra Verma
University of Eastern Finland, Kuopio, Finland
Xiao-Zhi Gao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kinkiri, S., Barakat, B., Keates, S. (2020). Phonemes: An Explanatory Study Applied to Identify a Speaker. In: Bhattacharjee, A., Borgohain, S., Soni, B., Verma, G., Gao, XZ. (eds) Machine Learning, Image Processing, Network Security and Data Sciences. MIND 2020. Communications in Computer and Information Science, vol 1241. Springer, Singapore. https://doi.org/10.1007/978-981-15-6318-8_6

Download citation

DOI: https://doi.org/10.1007/978-981-15-6318-8_6
Published: 15 June 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-6317-1
Online ISBN: 978-981-15-6318-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics