Skip to main content

No Residual Transmission: Joint Spectral-Residual Quantization

  • Chapter
  • First Online:
Ultra Low Bit-Rate Speech Coding

Part of the book series: SpringerBriefs in Electrical and Computer Engineering ((BRIEFSSPEECHTECH))

  • 753 Accesses

Abstract

In this chapter, we present a unit-selection based segment quantization scheme which leads to the interesting possibility of not having to transmit any side-information about the residual at all. We propose such a ‘no residual transmission’ scheme in both the segmental unit-selection framework of Lee and Cox described in Chap. 3 and the optimal 1-pass DP based unit-selection framework proposed by us and described in Chap. 4. We arrive at this ‘no residue transmission’ scheme from the important observations that unit-selection based segment quantization systems typically employ large unit-databases as in concatenative speech synthesis and that, by virtue of the largeness of the continuous codebook, it becomes possible to quantize an input segment by an unit in the unit database in such a way that the speech corresponding to the unit, after applying ‘only’ duration modification, is a close reconstruction of the input speech (of that input segment). We propose a ‘joint spectral-residual quantization’ by defining various ‘composite measures’ that combine both the spectral match and residual match, between input speech frames and unit frames, to select units that quantize an input speech segment/frame in toto. We present the rate-distortion performance trends of such a joint spectral-residual quantization in both these unit-selection frameworks for various composite measures, and show the efficacy of the proposed approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. K.S. Lee, R.V. Cox, A segmental speech coder based on a concatenative TTS. Speech Commun. 38, 89–100 (2002)

    Article  MATH  Google Scholar 

  2. K.K. Paliwal, W.B. Kleijn, Quantization of LPC parameters, in Speech Coding and Synthesis, ed. by W.B. Kleijn, K.K. Paliwal (Elsevier, Amsterdam, 1995), pp. 433–466. Chapter 12

    Google Scholar 

  3. V. Ramasubramanian, D. Harish, An optimal unit-selection algorithm for ultra low bit-rate speech coding, in Proceedings of ICASSP ’07, Hawaii, Apr 2007, pp. IC-541–IC-544

    Google Scholar 

  4. V. Ramasubramanian, D. Harish, An unified unit-selection framework for ultra low bit-rate speech coding, in Proceedings of Interspeech ’06, Pittsburgh, Sept 2006, pp. 217–220

    Google Scholar 

  5. V. Ramasubramanian, D. Harish, Ultra low bit-rate speech coding based on unit-selection with joint spectral-residual quantization: No transmission of any residual information, in Proceedings of Interspeech ’09, Brighton, Sept 2009, pp. 2615–2618

    Google Scholar 

  6. V. Ramasubramanian, Ultra low bit-rate speech coding: An overview and recent results, IEEE SPCOM, Indian Institute of Science, Bangalore, 2012

    Google Scholar 

  7. S. Roucos, A.M. Wilgus, The waveform segment vocoder: a new approach for very-low-rate speech coding, in ICASSP ’85, 1985, pp. 236–239

    Google Scholar 

  8. J. Schroeter, M.M. Sondhi, Speech coding based on physiological models of speech production, in Advances in Speech Signal Processing, ed. by S. Furui, M.M. Sondhi (Dekker, New York, 1992), pp. 231–268

    Google Scholar 

  9. Y. Shiraki, M. Honda, LPC speech coding based on variable-length segment quantization”. IEEE Trans. Acoust. Speech Signal Process. 36(9), 1437–1444 (1988)

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2015 The Author

About this chapter

Cite this chapter

Ramasubramanian, V., Doddala, H. (2015). No Residual Transmission: Joint Spectral-Residual Quantization. In: Ultra Low Bit-Rate Speech Coding. SpringerBriefs in Electrical and Computer Engineering(). Springer, New York, NY. https://doi.org/10.1007/978-1-4939-1341-1_6

Download citation

  • DOI: https://doi.org/10.1007/978-1-4939-1341-1_6

  • Published:

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4939-1340-4

  • Online ISBN: 978-1-4939-1341-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics