Skip to main content

Algorithms for Low Bit-Rate Coding with Adaptation to Statistical Characteristics of Speech Signal

  • Conference paper
  • First Online:
Speech and Computer (SPECOM 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9319))

Included in the following conference series:

Abstract

The article establishes the general trends of speech coding algorithms based on linear prediction. The task of adaptation of speech codec to the statistical characteristics of the coding parameters is set and accomplished. The main procedures of their forming are examined. The results of experimental studies of the developed adaptive low bit-rate coding algorithms are presented. The benefits of the quality of remade speech in comparison with algorithms on FS1015, FS1017 and FS1016 standards and Full-rate GSM are displayed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. GOST 26532-85. Data transmission system signal conversion modulus for unswitched voice-frequency channels. Types and basic parameters. Moscow: GOST USSR (1985) (In Russia)

    Google Scholar 

  2. Trancoso, I.M.: An overview of different trends on CELP coding. In: Ayuso, A.J.R., Soler, J.M.L. (eds.) Speech Recognition and Coding: New Advances and Trends. NATO ASI Series, vol. 147, pp. 351–367. Springer, Heidelberg (1995)

    Chapter  Google Scholar 

  3. Supplee, L.M., Cohn, R.P., Collura, J.S., McCree, A.V.: MELP: the new Federal Standard at 2400 bps. In: IEEE ICASSP-97 Conference, Munich, Germany, pp. 1591–1594 (1997)

    Google Scholar 

  4. Basov, O.O., Nosov, M.V., Shalaginov, V.A.: Pitch-jitter analysis of the speech signal. SPIIRAS Proc. 1(32), 27–44 (2014). (In Russian)

    Article  Google Scholar 

  5. Basov, O.O., Saitov, I.A.: Basic channels of interpersonal communication and their projection on the infocommunications systems. SPIIRAS Proc. 7(30), 122–140 (2013). (In Russian)

    Google Scholar 

  6. Wai, C.C.: Speech Coding Algorithms: Foundation and evolution of standardized coders. Wiley, Hoboken (2003)

    Google Scholar 

  7. Stachurski, J., McCree, A., Viswanathan, V., Heikkinen, A., Ramo, A., Himanen, S., Blocher, P.: Hybrid MELP/CELP coding at bit rates from 6.4 to 2.4 kb/s. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. II-153–II-156 (2003)

    Google Scholar 

  8. Basov, O.O.: A conceptual model of multicriterion adaptation of the linear predictive voice coding procedure. Telecommun. Radio Eng. 68(10), 923–931 (2009)

    Article  Google Scholar 

  9. Max, J.: Quantizing for minimum distortion. IRE Trans. Inform. Theory 6, 7–12 (1963)

    Article  MathSciNet  Google Scholar 

  10. Smith, B.: Instantaneous companding of quantized signals. Bell Syst. Tech. J. 36, 653–709 (1957)

    Article  Google Scholar 

  11. Palival, K.K., Atal, B.S.: Efficient vector quantization of LPC parameters at 24 bits/frame. IEEE Trans. Acoustics Speech Signal Process. 1(1), 3–14 (1993)

    Google Scholar 

  12. Saveliev, A.I., Vatamaniuk, I.V., Ronzhin, A.L.: Architecture of data exchange with minimal client-server interaction at multipoint video conferencing. In: Balandin, S., Andreev, S., Koucheryavy, Y. (eds.) NEW2AN/ruSMART 2014. LNCS, vol. 8638, pp. 164–174. Springer, Heidelberg (2014)

    Google Scholar 

  13. Potapova, R., Sobakin, A., Maslov, A.: On the possibility of the skype channel speaker identification (on the basis of acoustic parameters). In: Ronzhin, A., Potapova, R., Delic, V. (eds.) SPECOM 2014. LNCS, vol. 8773, pp. 329–336. Springer, Heidelberg (2014)

    Google Scholar 

  14. Saveliev, A.I., Prischepa, M.: Architecture of lossless data exchange in pear-to-pear web application of videoconference. Proc. Tomsk State Univ. Control Syst. Radioelectronics 2(32), 238–245 (2014)

    Google Scholar 

  15. Ronzhin, A.L., Karpov, A.A.: A software system for the audiovisual monitoring of an intelligent meeting room in support of scientific and education activities. Pattern Recogn. Image Anal. 25(2), 237–254 (2015)

    Article  Google Scholar 

  16. Ronzhin, A., Vatamaniuk, I., Ronzhin, A., Železný, M.: Algorithms for acceleration of image processing at automatic registration of meeting participants. In: Ronzhin, A., Potapova, R., Delic, V. (eds.) SPECOM 2014. LNCS(LNAI), vol. 8773, pp. 89–96. Springer, Heidelberg (2014)

    Google Scholar 

  17. Ronzhin, A.L., Ronzhin, A.L., Budkov, V.Y.: Methodology of facility automation based on audiovisual analysis and space-time structuring of situation in meeting room. In: Stephanidis, C. (ed.) HCII 2013, Part II. CCIS, vol. 374, pp. 524–528. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

Download references

Acknowledgments

This work is partially supported by the Russian Foundation for Basic Research (grants № 15-07-06-774-a, 13-08-0741-a); the scholarship of the President of the Russian Federation (project no. SP-3872.2015.5).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Andrey Ronzhin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Saveliev, A., Basov, O., Ronzhin, A., Ronzhin, A. (2015). Algorithms for Low Bit-Rate Coding with Adaptation to Statistical Characteristics of Speech Signal. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds) Speech and Computer. SPECOM 2015. Lecture Notes in Computer Science(), vol 9319. Springer, Cham. https://doi.org/10.1007/978-3-319-23132-7_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-23132-7_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-23131-0

  • Online ISBN: 978-3-319-23132-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics