Skip to main content

Speech Enhancement and Recognition of Compressed Speech Signal in Noisy Reverberant Conditions

  • Conference paper

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 132))

Abstract

Speech compression, enhancement and recognition in noisy, reverberant conditions is a challenging task. In this paper a new approach to this problem, which is developed in the framework of probabilistic random modeling. speech coding techniques are commonly used in low bit rate analysis and synthesis . Coding algorithms seek to minimize the bit rate in the digital representation of a signal without an objectionable loss of signal quality in the process. Speech enhancement aims to improve speech quality by using various algorithms This paper deals with multistage vector quantization technique used for coding of narrow band speech signals. The parameter used for coding of speech signals are the line spectral frequencies, so as to ensure filter stability after quantization. A new approach incorporates the information about statistical random nature of uncompressed speech signal using LBG algorithm .The code books used for quantization are generated by using Linde, Buzo and Gray(LBG) algorithm. Speech model is characterized by LPC coefficients and parameterized by the coefficients of the reverberation filtersThe results of the multistage vector quantizer are compared with unconstrained vector quantization Technique. The performance of quantization is measured in terms of spectral distortion measured in dB, Computational complexity measured in KFlops and Memory Requirements measured in Floats. From the results it can be proved that multistage vector quantization is having better spectral distortion performance, less computational complexity and memory requirements when compared to unconstrained vector quantization. The proposed approach yields significantly estimating the parameters from the data , better performance in both signal to noise ratio and subjective filter methods.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   259.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Suman, M., Khan, H., Madhavi Latha, M., Aruna Kumari, D.: Dimensions of performance in compressive speech signals and its enhancement. International Journal of Engineering Sciences Research (IJSER) 2(2), 87–93 (2011)

    Google Scholar 

  2. Suman, M., Satya Sai Ram, M., Khan, H.: Compression of Speech signals: LBG Algorithm. In: Third International Conference on SEEC 2010, Cochin, Kerala (2010)

    Google Scholar 

  3. Madhavi Latha, M., Satya Sai Ram, M., Siddaiah, P.: Multi Switched Split Vector Quantizer. International Journal of Computer,Information and Systems Science and Engineering 2(1)

    Google Scholar 

  4. Madhavi Latha, M., Satya Sai Ram, M., Siddaiah, P.: Multi Switched Split Vector Quantization. In: Proceedings of World Academy of Science, Engineering and Technology, vol. 27 (February 2008) ISSN 1307-6884

    Google Scholar 

  5. Harma, A.: Linear predictive coding with modified filter structures. IEEE Trans. Speech Audio Process. 9, 769–777 (2001)

    Article  Google Scholar 

  6. Stephen, S., Paliwal, K.K.: Efficient product code vector quantization using switched split vector quantiser. Digital Signal Processing Journal 17, 138–171 (2007)

    Article  Google Scholar 

  7. Paliwal, K.K., Atal, B.S.: Efficient vector quantization of LPCParameters at 24 bits/frame. IEEE Trans. Speech Audio Process., 3–14 (1993)

    Google Scholar 

  8. Soong, F., Juang, B.: Line spectrum pair (LSP) and speech data compression. In: IEEE International Conference on ICASSP, vol. 9, pp. 37–40 (1984)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Suman, M., Khan, H., Latha, M.M., Aruna Kumari, D. (2012). Speech Enhancement and Recognition of Compressed Speech Signal in Noisy Reverberant Conditions. In: Satapathy, S.C., Avadhani, P.S., Abraham, A. (eds) Proceedings of the International Conference on Information Systems Design and Intelligent Applications 2012 (INDIA 2012) held in Visakhapatnam, India, January 2012. Advances in Intelligent and Soft Computing, vol 132. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27443-5_43

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-27443-5_43

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-27442-8

  • Online ISBN: 978-3-642-27443-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics