Skip to main content

Distributed Speech and Speaker Identification System for Personalized Domotic Control

  • Conference paper
  • First Online:
Mobile Networks for Biometric Data Analysis

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 392))

Abstract

This paper presents a combined speech recognition/speaker identification system that can be efficiently used for personalized domotic control. The proposed system works as a distributed framework and it is designed to identify a speaker in home environments in order to provide user access to customized options. Human speech signals contain both language and speaker dependent information. Using this information the system realizes a personalized control in home environments and this approach can also be applied in more generic scenarios such as car customization settings. The system was optimized with the aim to allow an immediate use only with the addition of small and cheap audio front-ends that will capture commands spoken by the user. Meanwhile a remote server performs the speech recognition as well as user identification and combines these informations to provides user specific settings which are sent back to the desired actuator at home.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Alessandrini M, Biagetti G, Curzi A, Turchetti C (2011) Semi-automatic acoustic model generation from large unsynchronized audio and text chunks. In: 12th Annual conference of the international speech communication association (Interspeech 2011). Florence, Italy, pp 1681–1684

    Google Scholar 

  2. Alessandrini M, Biagetti G, Curzi A, Turchetti C (2013) A garbage model generation technique for embedded speech recognisers. In: Signal processing: algorithms, architectures, arrangements, and applications (SPA 2013). Poznan, Poland, pp 318–322

    Google Scholar 

  3. Bazzi I, Glass JR (2000) Modeling out-of-vocabulary words for robust speech recognition. In: 6th International conference on spoken language processing (ICSLP 2000/INTERSPEECH 2000). Beijing, China, pp 401–404

    Google Scholar 

  4. Bhardwaj S, Srivastava S, Hanmandlu M, Gupta J (2013) GFM-based methods for speaker identification. IEEE Trans Cybern 43(3):1047–1058

    Article  Google Scholar 

  5. Biagetti G, Crippa P, Curzi A, Falaschetti L, Orcioni S, Turchetti C (2015) Distributed speech recognition for lighting system control. In: Intelligent decision technologies. Springer, Berlin, pp 101–111

    Google Scholar 

  6. Biagetti G, Crippa P, Curzi A, Orcioni S, Turchetti C (2015) Speaker identification with short sequences of speech frames. In: International conference on pattern recognition applications and methods (ICPRAM 2015). Lisbon, Portugal, pp 178–185

    Google Scholar 

  7. Bimbot F, Bonastre JF, Fredouille C, Gravier G, Magrin-Chagnolleau I, Meignier S, Merlin T, Ortega-Garca J, Petrovska-Delacrétaz D, Reynolds DA (2004) A tutorial on text-independent speaker verification. EURASIP J Appl Sig Process 2004:430–451

    Article  Google Scholar 

  8. Campbell JPJ (1997) Speaker recognition: a tutorial. Proc IEEE 85(9):1437–1462

    Article  Google Scholar 

  9. Chen K, Wang L, Chi H (1997) Methods of combining multiple classifiers with different features and their applications to text-independent speaker identification. Int J Pattern Recognit Artif Intell 11(03):417–445

    Article  Google Scholar 

  10. Dobrowolski A, Majda E (2011) Cepstral analysis in the speakers recognition systems. In: Signal processing algorithms, architectures, arrangements, and applications conference proceedings (SPA 2011). Poznan, Poland, pp 1–6

    Google Scholar 

  11. ETSI ES 202 050 V1.1.5 (2007) Speech processing, transmission and quality aspects (STQ); distributed speech recognition; advanced front-end feature extraction algorithm; compression algorithms

    Google Scholar 

  12. Figueiredo MAT, Jain A (2002) Unsupervised learning of finite mixture models. IEEE Trans Pattern Anal Mach Intell 24(3):381–396

    Article  Google Scholar 

  13. Gish H, Schmidt M (1994) Text-independent speaker identification. IEEE Signal Process Mag 11(4):18–32

    Article  Google Scholar 

  14. Hirota S, Hayasaka N, Iiguni Y (2012) Experimental evaluation of structure of garbage model generated from in-vocabulary words. In: 2012 International symposium on communications and information technologies (ISCIT 2012). Gold Coast, Australia, pp 87–92

    Google Scholar 

  15. Huang X, Acero A, Hon HW (2001) Spoken language processing: a guide to theory, algorithm, and system development. Prentice Hall PTR, Upper Saddle River

    Google Scholar 

  16. Jain A, Duin RPW, Mao J (2000) Statistical pattern recognition: a review. IEEE Trans Pattern Anal Mach Intell 22(1):4–37

    Article  Google Scholar 

  17. Jain A, Ross A, Prabhakar S (2004) An introduction to biometric recognition. IEEE Trans Circuits Syst Video Technol 14(1):4–20

    Article  Google Scholar 

  18. Kinnunen T, Li H (2010) An overview of text-independent speaker recognition: from features to supervectors. Speech Commun 52(1):12–40

    Article  Google Scholar 

  19. Levit M, Chang S, Buntschuh B (2009) Garbage modeling with decoys for a sequential recognition scenario. In: IEEE workshop on automatic speech recognition understanding (ASRU 2009). Merano, Italy, pp 468–473

    Google Scholar 

  20. Reynolds D (1994) Experimental evaluation of features for robust speaker identification. IEEE Trans Speech Audio Process 2(4):639–643

    Article  Google Scholar 

  21. Reynolds D (2002) An overview of automatic speaker recognition technology. In: 2002 IEEE international conference on acoustics, speech, and signal processing (ICASSP 2002). Orlando, Florida, vol 4, pp IV–4072–IV–4075

    Google Scholar 

  22. Reynolds D, Rose R (1995) Robust text-independent speaker identification using Gaussian mixture speaker models. IEEE Trans Speech Audio Process 3(1):72–83

    Article  Google Scholar 

  23. Togneri R, Pullella D (2011) An overview of speaker identification: accuracy and robustness issues. IEEE Circuits Syst Mag 11(2):23–61

    Article  Google Scholar 

  24. Walker W, Lamere P, Kwok P, Raj B, Singh R, Gouvea E, Wolf P, Woelfel J (2007) Sphinx-4: a flexible open source framework for speech recognition

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Giorgio Biagetti .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Biagetti, G., Crippa, P., Falaschetti, L., Orcioni, S., Turchetti, C. (2016). Distributed Speech and Speaker Identification System for Personalized Domotic Control. In: Conti, M., Martínez Madrid, N., Seepold, R., Orcioni, S. (eds) Mobile Networks for Biometric Data Analysis. Lecture Notes in Electrical Engineering, vol 392. Springer, Cham. https://doi.org/10.1007/978-3-319-39700-9_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-39700-9_13

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-39698-9

  • Online ISBN: 978-3-319-39700-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics