Abstract
The article establishes the general trends of speech coding algorithms based on linear prediction. The task of adaptation of speech codec to the statistical characteristics of the coding parameters is set and accomplished. The main procedures of their forming are examined. The results of experimental studies of the developed adaptive low bit-rate coding algorithms are presented. The benefits of the quality of remade speech in comparison with algorithms on FS1015, FS1017 and FS1016 standards and Full-rate GSM are displayed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
GOST 26532-85. Data transmission system signal conversion modulus for unswitched voice-frequency channels. Types and basic parameters. Moscow: GOST USSR (1985) (In Russia)
Trancoso, I.M.: An overview of different trends on CELP coding. In: Ayuso, A.J.R., Soler, J.M.L. (eds.) Speech Recognition and Coding: New Advances and Trends. NATO ASI Series, vol. 147, pp. 351–367. Springer, Heidelberg (1995)
Supplee, L.M., Cohn, R.P., Collura, J.S., McCree, A.V.: MELP: the new Federal Standard at 2400 bps. In: IEEE ICASSP-97 Conference, Munich, Germany, pp. 1591–1594 (1997)
Basov, O.O., Nosov, M.V., Shalaginov, V.A.: Pitch-jitter analysis of the speech signal. SPIIRAS Proc. 1(32), 27–44 (2014). (In Russian)
Basov, O.O., Saitov, I.A.: Basic channels of interpersonal communication and their projection on the infocommunications systems. SPIIRAS Proc. 7(30), 122–140 (2013). (In Russian)
Wai, C.C.: Speech Coding Algorithms: Foundation and evolution of standardized coders. Wiley, Hoboken (2003)
Stachurski, J., McCree, A., Viswanathan, V., Heikkinen, A., Ramo, A., Himanen, S., Blocher, P.: Hybrid MELP/CELP coding at bit rates from 6.4 to 2.4 kb/s. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. II-153–II-156 (2003)
Basov, O.O.: A conceptual model of multicriterion adaptation of the linear predictive voice coding procedure. Telecommun. Radio Eng. 68(10), 923–931 (2009)
Max, J.: Quantizing for minimum distortion. IRE Trans. Inform. Theory 6, 7–12 (1963)
Smith, B.: Instantaneous companding of quantized signals. Bell Syst. Tech. J. 36, 653–709 (1957)
Palival, K.K., Atal, B.S.: Efficient vector quantization of LPC parameters at 24 bits/frame. IEEE Trans. Acoustics Speech Signal Process. 1(1), 3–14 (1993)
Saveliev, A.I., Vatamaniuk, I.V., Ronzhin, A.L.: Architecture of data exchange with minimal client-server interaction at multipoint video conferencing. In: Balandin, S., Andreev, S., Koucheryavy, Y. (eds.) NEW2AN/ruSMART 2014. LNCS, vol. 8638, pp. 164–174. Springer, Heidelberg (2014)
Potapova, R., Sobakin, A., Maslov, A.: On the possibility of the skype channel speaker identification (on the basis of acoustic parameters). In: Ronzhin, A., Potapova, R., Delic, V. (eds.) SPECOM 2014. LNCS, vol. 8773, pp. 329–336. Springer, Heidelberg (2014)
Saveliev, A.I., Prischepa, M.: Architecture of lossless data exchange in pear-to-pear web application of videoconference. Proc. Tomsk State Univ. Control Syst. Radioelectronics 2(32), 238–245 (2014)
Ronzhin, A.L., Karpov, A.A.: A software system for the audiovisual monitoring of an intelligent meeting room in support of scientific and education activities. Pattern Recogn. Image Anal. 25(2), 237–254 (2015)
Ronzhin, A., Vatamaniuk, I., Ronzhin, A., Železný, M.: Algorithms for acceleration of image processing at automatic registration of meeting participants. In: Ronzhin, A., Potapova, R., Delic, V. (eds.) SPECOM 2014. LNCS(LNAI), vol. 8773, pp. 89–96. Springer, Heidelberg (2014)
Ronzhin, A.L., Ronzhin, A.L., Budkov, V.Y.: Methodology of facility automation based on audiovisual analysis and space-time structuring of situation in meeting room. In: Stephanidis, C. (ed.) HCII 2013, Part II. CCIS, vol. 374, pp. 524–528. Springer, Heidelberg (2013)
Acknowledgments
This work is partially supported by the Russian Foundation for Basic Research (grants № 15-07-06-774-a, 13-08-0741-a); the scholarship of the President of the Russian Federation (project no. SP-3872.2015.5).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Saveliev, A., Basov, O., Ronzhin, A., Ronzhin, A. (2015). Algorithms for Low Bit-Rate Coding with Adaptation to Statistical Characteristics of Speech Signal. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds) Speech and Computer. SPECOM 2015. Lecture Notes in Computer Science(), vol 9319. Springer, Cham. https://doi.org/10.1007/978-3-319-23132-7_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-23132-7_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23131-0
Online ISBN: 978-3-319-23132-7
eBook Packages: Computer ScienceComputer Science (R0)