Abstract
During the production of a speech, the instant of significant excitation’s are called epochs. In speech processing, epochs plays a significant role and used in many applications. Accurate detection of epochs from the speech is a challenging task due to time varying nature of the vocal-tract system and excitation source. To detect the epochs from the speech signal several algorithms are already proposed. Zero Frequency Filter (ZFF) approach is one among the different techniques which gives better performance. This method is based on the impulse nature of the excitation source and not affected by the vocal-tract system characteristics. The original filter design of ZFF realized as Infinite Impulse Response (IIR) filter followed by two detrenders. Due to the unstable nature of IIR filter, later the ZFF is realized as the Zero-Band Filter (ZBF). In this paper, we have designed the hardware architectures for IIR and ZBF realization of ZFF. The hardware architectures of ZFF are verified by implementing it on FPGA (ZedBoard Zynq Evaluation and Development Kit xc7z020clg4841) using Xilinx system generator-2016.2.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Murty, K.S.R., Yegnanarayana, B.: Epoch extraction from speech signals. IEEE Trans. Audio, Speech, Lang. Process. 16(8), 1602–1613 (2008)
Atal, B.S., Hanauer, S.L.: Speech analysis and synthesis by linear prediction of the speech wave. J. Acoust. Soc. Am. 50(2B), 637–655 (1971)
Ananthapadmanabha, T., Yegnanarayana, B.: Epoch extraction from linear prediction residual for identification of closed glottis interval. IEEE Trans. Acoust. Speech Signal Process. 27(4), 309–319 (1979)
Smits, R., Yegnanarayana, B.: Determination of instants of significant excitation in speech using group delay function. IEEE Trans. Speech Audio Process. 3(5), 325–333 (1995)
Strube, H.W.: Determination of the instant of glottal closure from the speech wave. J. Acoust. Soc. Am. 56(5), 1625–1629 (1974)
Kounoudes, A., Naylor, P.A., Brookes, M.: The DYPSA algorithm for estimation of glottal closure instants in voiced speech. In: 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, p. I-349. IEEE (2002)
Deepak, K.T., Prasanna, S.R.M.: Epoch extraction using zero band filtering from speech signal. Circuits, Syst. Signal Process. 34(7), 2309–2333 (2014). https://doi.org/10.1007%2Fs00034-014-9957-4
Prasanna, S.R.M., Govind, D., Rao, K.S., Yenanarayana, B.: Fast prosody modification using instants of significant excitation. In: Proceedings of Speech Prosody (2010)
Deepak, K., Sarma, B.D., Prasanna, S.M.: Foreground speech segmentation using zero frequency filtered signal. In: Thirteenth Annual Conference of the International Speech Communication Association (2012)
Pradhan, G., Prasanna, S.M.: Speaker verification by vowel and nonvowel like segmentation. IEEE Trans. Audio, Speech, Lang. Process. 21(4), 854–867 (2013)
Dhananjaya, N., Yegnanarayana, B.: Voiced/nonvoiced detection based on robustness of voiced epochs. IEEE Signal Process. Lett. 17(3), 273276 (2010)
Seshadri, G., Yegnanarayana, B.: Performance of an event-based instantaneous fundamental frequency estimator for distant speech signals. IEEE Trans. Audio Speech Lang. Process. 19(7), 1853–1864 (2011)
Monmasson, E., Cirstea, M.: FPGA design methodology for industrial control systems—a review. IEEE Trans. Ind. Electron. 54(4), 1824–1842 (2007)
Jimenez-Fernandez, A., Linares-Barranco, A., Paz-Vicente, R., Lujan-Martenez, C.D., Jimenez, G., Civit, A.: AER and dynamic systems co-simulation over Simulink with Xilinx System Generator. In: Proceedings of the 15th IEEE International Conference on Electronics, Circuits and Systems, ICECS 2008, pp. 1281–1284 (2008)
Rabah, H., Amira, A., Mohanty, B.K., Almaadeed, S., Meher, P.K.: FPGA implementation of orthogonal matching pursuit for compressive sensing reconstruction. IEEE Trans. Very Large Scale Integr. (VLSI) Syst. 23(10), 2209–2220 (2015)
Kasap, S., Redif, S.: Novel field-programmable gate array architecture for computing the eigenvalue decomposition of para-hermitian polynomial matrices. IEEE Trans. Very Large Scale Integr. (VLSI) Syst. 22(3), 522–536 (2014)
Prince, A.A., Ganesh, S., Verma, P.K., George, P., Raju, D.: Efficient implementation of empirical mode decomposition in FPGA using Xilinx system generator. In: IECON Proceedings (Industrial Electronics Conference), pp. 895–900 (2016)
Athar, S., Ieee, M., Siddiqi, M.A., Masud, S., Member, S.: Teaching and research in FPGA based digital signal processing using Xilinx system generator, pp. 2765–2768 (2012)
Selvamuthukumaran, R., Gupta, R.: Rapid prototyping of power electronics converters for photovoltaic system application using Xilinx system generator. Power Electron. IET 7(9), 2269–2278 (2014)
Parmar, C.A., Ramanadham, B., Darji, A.D.: FPGA implementation of hardware efficient adaptive filter robust to impulsive noise. IET Comput. Digit. Tech. 11(3), 107–116 (2017). https://doi.org/10.1049/iet-cdt.2016.0067
Pinto, S.J., Panda, G., Peesapati, R.: An implementation of hybrid control strategy for distributed generation system interface using xilinx system generator. IEEE Trans. Ind. Inform. 13(5), 2735–2745 (2017)
Vayada,M.G., Patel, H.R., Muduli, B.R.: Hardware software co-design simulation modeling for image security concept using Matlab-Simulink with Xilinx system generator. In: Proceedings of 2017 3rd IEEE International Conference on Sensing, Signal Processing and Security, ICSSS 2017, pp. 134–137 (2017)
Bahoura, M., Ezzaidi, H.: FPGA-implementation of a sequential adaptive noise canceller using Xilinx system generator. Proc. Int. Conf. Microelectr. ICM 4, 213–216 (2009)
Ownby, M., Mahmoud, W.H.: Dr. Wagdy H. Mahmoud, pp. 404–408
Bahoura, M., Ezzaidi, H.: FPGA-implementation of discrete wavelet transform with application to signal denoising. Circ. Syst. Signal Process. 31(3), 987–1015 (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Srinivas, N., Srinivas, K., Pradhan, G., Kumar, P.K. (2018). FPGA Implementation for Real-Time Epoch Extraction in Speech Signal. In: Singh, M., Gupta, P., Tyagi, V., Flusser, J., Ă–ren, T. (eds) Advances in Computing and Data Sciences. ICACDS 2018. Communications in Computer and Information Science, vol 905. Springer, Singapore. https://doi.org/10.1007/978-981-13-1810-8_39
Download citation
DOI: https://doi.org/10.1007/978-981-13-1810-8_39
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1809-2
Online ISBN: 978-981-13-1810-8
eBook Packages: Computer ScienceComputer Science (R0)