Skip to main content

Unified and Optimal Unit-Selection Framework

  • Chapter
  • First Online:
Ultra Low Bit-Rate Speech Coding

Part of the book series: SpringerBriefs in Electrical and Computer Engineering ((BRIEFSSPEECHTECH))

  • 753 Accesses

Abstract

This chapter is devoted to the class of unit-selection algorithms proposed by us earlier, and which represent an optimal and unified generalization over the single-frame and sub-optimal segmental unit-selection algorithms of Lee and Cox dealt with in the previous chapter. Following a detailed treatment of this unified formulation, we present the modified one-pass DP algorithm which provides an optimal solution to this unified formulation, and characterize its performance in terms of rate-distortion curves obtained using large unit-database size. We first show how it generalizes over the single-frame algorithm towards handling longer fixed length units, with progressively enhanced rate-distortion performance with increase in unit-size. Subsequently, we compare the rate-distortion performance of the proposed algorithm with the sub-optimal segmental algorithm of Lee and Cox and demonstrate the clear advantages accrued. Following this, we answer the question of what, if any, is the advantage of moving from small clustered segment codebook, as in the classic Shiraki and Honda’s variable length quantization algorithm, to large unit-database sizes as in the unit-selection framework, and through rate-distortion performances, show the highly enhanced performances of the unit-selection algorithms in comparison to the conventional vector quantization, matrix quantization and variable length segment quantization using clustered codebooks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. D. Harish, V. Ramasubramanian, Comparison of segment quantizers: VQ, MQ, VLSQ and Unit-selection algorithms for ultra low bit-rate speech coding, in Proceedings of ICASSP ’08, Las Vegas, Mar 2008, pp. 4773–4776

    Google Scholar 

  2. K.S. Lee, R.V. Cox, A very low bit rate speech coder based on a recognition/synthesis paradigm. IEEE Trans. Speech Audio Process. 9(5), 482–491 (2001)

    Article  Google Scholar 

  3. K.S. Lee, R.V. Cox, A segmental speech coder based on a concatenative TTS. Speech Commun. 38, 89–100 (2002)

    Article  MATH  Google Scholar 

  4. H. Ney, The use of one-stage dynamic programming algorithm for connected word recognition. IEEE Trans. Acoust. Speech Signal Process. 32(2), 263–271 (1984)

    Article  Google Scholar 

  5. V. Ramasubramanian, D. Harish, An optimal unit-selection algorithm for ultra low bit-rate speech coding, in Proceedings of ICASSP ’07, Hawaii, Apr 2007, pp. IC-541–IC-544

    Google Scholar 

  6. V. Ramasubramanian, D. Harish, An unified unit-selection framework for ultra low bit-rate speech coding, in Proceedings of Interspeech ’06, Pittsburgh, Sept 2006, pp. 217–220

    Google Scholar 

  7. Y. Shiraki, M. Honda, LPC speech coding based on variable-length segment quantization”. IEEE Trans. Acoust. Speech Signal Process. 36(9), 1437–1444 (1988)

    Article  MATH  Google Scholar 

  8. C. Tsao, R.M. Gray, Matrix quantizer design for LPC speech using the generalized Lloyd algorithm. IEEE Trans. ASSP 33(3), 537–545 (1985)

    Article  Google Scholar 

  9. D.Y. Wong et al., An 800 b/s vector quantization LPC vocoder. IEEE Trans. ASSP 30(6), 770–780 (1982)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2015 The Author

About this chapter

Cite this chapter

Ramasubramanian, V., Doddala, H. (2015). Unified and Optimal Unit-Selection Framework. In: Ultra Low Bit-Rate Speech Coding. SpringerBriefs in Electrical and Computer Engineering(). Springer, New York, NY. https://doi.org/10.1007/978-1-4939-1341-1_4

Download citation

  • DOI: https://doi.org/10.1007/978-1-4939-1341-1_4

  • Published:

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4939-1340-4

  • Online ISBN: 978-1-4939-1341-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics