Speech Coding Employing Intelligent Signal Processing Techniques

Czyzewski, Andrzej

doi:10.1007/978-3-540-71663-1_1

Andrzej Czyzewski¹

Part of the book series: Lecture Notes in Computer Science ((TRS,volume 4400))

521 Accesses

Abstract

The concepts and experiments presented are focused on modifications of an existing parametric speech coding algorithm (CELP) introduced in order to improve subjective speech quality in telephone connections. The perceptual coding to bit rate limiting was added and algorithms qualifying speech components to the categories of ”voiced”, ”unvoiced”, ”transients” using rough sets were studied. The speech signal quality achieved with the proposed hybrid codec was compared to the quality offered by some standard speech codecs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Pawlak, Z.: A Treatise on Rough Sets. In: Peters, J.F., Skowron, A. (eds.) Transactions on Rough Sets IV, pp. 1–17. Springer, Berlin (2005)
Chapter Google Scholar
Kulesza, M., Szwoch, G., Czyzewski, A.: Improving signal quality in speech codec using a hybrid perceptual-parametric algorithm. In: Multimedia and Network Information Systems’ (MISSI), Wroclaw, Poland, 21-22 Sept., 2006, pp. 181–192 (2006)
Google Scholar
Ritz, C.H.: Lossless wideband speech coding. In: 10^th International Conference on Speech Science and Technology, Sydney, Australia (Dec. 2004)
Google Scholar
Czyzewski, A.: Applications of Neural Networks and Perceptual Masking to Audio Restoration. Journal of New Music Research 22(5), 339–349 (2001)
Article Google Scholar
Verma, T.S., Levine, S.N., Meng, T.H.: Transient Modeling Synthesis: a flexible analysis/synthesis tool for transient signals. In: International Computer Music Conference, Greece (1997)
Google Scholar
Chu, W.C.: Speech Coding Algorithms. Foundation and Evolution of Standardized Coders. John Wiley & Sons, Hoboken (2003)
MATH Google Scholar
Goldberg, R., Riek, L.: A Practical Handbook of Speech Coders. CRC Press, Boca Raton (2000)
MATH Google Scholar
Kliewer, J., Mertins, A.: Audio subband coding with improved representation of transient signal segments. In: Proc 9th European Signal Processing Conference (EUSICPO-98), Rhodes, Greece, September 1998, pp. 1245–1248 (1998)
Google Scholar
Babu, V.S., et al.: Transient Detection for Transform Domain Coders. In: AES 116^th Convention, Berlin (2004)
Google Scholar
ISO/IEC 14496-3:2001 Information technology - Generic coding of moving pictures and associated audio information: Part 3: Advanced Audio Coding (AAC) (2001)
Google Scholar
OGG Vorbis Specification: http://xiph.org/vorbis/
Painter, T., Spanias, A.: Perceptual Coding of Digital Audio. Proceedings of IEEE 88, 451–513 (2000)
Article Google Scholar
Opticom, Opera your digital ear. User manual, version 3.5 (2002)
Google Scholar
Czyzewski, A., et al.: Intelligent Algorithms for Movie Sound Tracks Restoration. In: Peters, J.F., Skowron, A. (eds.) Transactions on Rough Sets V (2006)
Google Scholar
ITU-T Recommendation P.862, Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs (2003)
Google Scholar
Kulesza, M., Szwoch, G., Czyzewski, A.: High quality speech coding using combined parametric and perceptual modules. In: 13th World Enformatika Conference Proc., Budapest, Hungary, 26–28 May, 2006, pp. 244–249 (2006)
Google Scholar
Czyzewski, A., Królikowski, R.: Neuro-Rough Control of Masking Thresholds for Audio Signal Enhancement. Journal of Neurocomputing 36, 5–27 (2001)
Article MATH Google Scholar
Annadana, R., Ferreira, A., Sinha, D.: A new low bit rate speech coding scheme for mixed content. In: 120^th AES Convention, Paris, France (May 2006)
Google Scholar
Ahmadi, S., Jelinek, M.: n the architecture, operation, and applications of VMRWB: The new cdma2000 wideband speech coding standard. IEEE Communication Magazine 44(5), 74–81 (2006)
Article Google Scholar
Chazan, D., et al.: High quality sinusoidal modeling of wideband speech for the purposes of speech synthesis and modification. In: IEEE International Conference on Acoustic, Speech, and Signal Processing - ICASSP, Toulouse, May 2006, IEEE, Los Alamitos (2006)
Google Scholar
Fuemmeler, J., Hardie, R., Gardner, W.: Techniques for the regeneration of wideband speech form narrow band speech. EURASIP Journal on Applied Signal Processing 2001(4), 266–274 (2001)
Article Google Scholar
Levine, S., Smith III., J.: Improvements to the Switched Parametric & Transform Audio Coder. In: Proc. 1999 IEEE Workshop on Application of Signal Processing to Audio and Acoustics, New York, Oct. 1999, IEEE Computer Society Press, Los Alamitos (1999)
Google Scholar
Najafzadeh-Azghandi, H., Kabal, P.: Perceptual coding of narrowband audio signals at 8 kbit/s. In: Proc. IEEE Workshop Speech Coding, Pocono Manor, IEEE Computer Society Press, Los Alamitos (1997)
Google Scholar
Ojala, P., et al.: The adaptive multirate wideband speech codec: system characteristics, quality advances, and deployment strategies. IEEE Communication Magazine 44(5), 59–65 (2006)
Article Google Scholar
Kulesza, M., et al.: High Quality Speech Codec Employing Sines+Noise+Transients Model. In: 53^rd Open Seminar on Acoustics, Zakopane, Poland, 11–15 Sept. (2006)
Google Scholar
Yang, M.: Low bit rate speech coding. IEEE Potentials 23(4), 32–36 (2004)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Multimedia Systems Department, Gdansk University of Technology, ul. Narutowicza 11/12, 80-952 Gdansk, Poland
Andrzej Czyzewski

Authors

Andrzej Czyzewski
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

James F. Peters Andrzej Skowron Victor W. Marek Ewa Orłowska Roman Słowiński Wojciech Ziarko

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Czyzewski, A. (2007). Speech Coding Employing Intelligent Signal Processing Techniques. In: Peters, J.F., Skowron, A., Marek, V.W., Orłowska, E., Słowiński, R., Ziarko, W. (eds) Transactions on Rough Sets VII. Lecture Notes in Computer Science, vol 4400. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71663-1_1

Download citation

DOI: https://doi.org/10.1007/978-3-540-71663-1_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71662-4
Online ISBN: 978-3-540-71663-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics