Basic Framework of Vocoders for Speech Processing

  • R. Chinna RaoEmail author
  • D. Elizabath Rani
  • S. Srinivasa Rao
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 898)


The main objective of this paper is to develop a basic framework of a vocoder for speech processing. The basic framework of a vocoder consists of voice preprocessing, encoder, decoder, and post-processing blocks and performing multithreading among these blocks. A GUI can be prepared which will take the input from the user and set the environment as required. The voice subsystem consists of transmission path (TX chain) and receiver path (RX chain) having various signal processing blocks. These blocks are executed in sequential order to process the voice signal. This basic framework allows user to completely analyze the effect of preprocessing blocks, post-processing blocks, and vocoders on the speech signal at different points. A thread is a process which takes samples as input, calls an appropriate block for processing, and gives processed samples as output.


Framework Multithreading Encoder Decoder 


  1. 1.
    Mishra, S., Singh, S.: A survey paper on different data compression techniques. Indian J. Res. Pap. 6(5) (2016)Google Scholar
  2. 2.
    Holmes, J.N.: The JSRU channel vocoder. IEEE Proc. 127, 53–60 (1980)CrossRefGoogle Scholar
  3. 3.
    Lukasaik, J., McElory, C., Chang, E.: Compression transparent low-level description of audio signals. In: IEEE International Conference Multimedia and Expo, 6 July 2005Google Scholar
  4. 4.
    Akyol, E., Rose, K.: A necessary and sufficient condition for transform optimality in source coding. In: IEEE International Symposium on Information Theory Proceedings (2011)Google Scholar
  5. 5.
    Nanjundaswamy, T., Rose, K.: Cascaded long term prediction for coding polyphonic audio signals. In: IEEE w/s on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, Oct 2011Google Scholar
  6. 6.
    Raut, R., Kullat, K.: SDR design with advanced algorithms for cognitive radio. IJACS 1(4), 134–141 (2011)Google Scholar
  7. 7.
    Srinivasan, P., Jameason, L.H.: High quality audio compression using an adaptive wavelet packet decomposition and psychoacoustic modelling. IEEE Trans. Signal Process. XX(Y) (1999)Google Scholar
  8. 8.
    Makhoul, J.: Linear prediction: a tutorial review. IEEE Proc. 63(4), 561–580CrossRefGoogle Scholar
  9. 9.
    Giacobello, D., Christensen, M.G., Murthi, M.N., Jenson, S.H., Moonen, M.: Speech coding based on sparse linear prediction. IEEE Trans. Acoust. Speech Signal Process. 15(5) (2010)Google Scholar
  10. 10.
    Joseph, S.M., Shah, F., Babu Anto P.: comparing speech compression using waveform coding and parametric coding. Int. J. Electron. Eng. 3(1), 35–38 (2011)Google Scholar
  11. 11.
    Giacobello, D., Christensen, M.G., Murthi, M.N., Jenson, S.H., Moonen, M.: Retrieving sparse patterns using a compressed sensing framework: applications to speech coding based on sparse linear prediction. IEEE Signal Proc. Lett. 17(1)CrossRefGoogle Scholar
  12. 12.
    Wichman, S.: Comparison of Speech Coding Algorithms ADPCM vs CELP. Department of Electrical Engineering. Texas University (1999)Google Scholar
  13. 13.
    Gersho, A.: Advances in speech and audio compression. IEEE Proc. 82(6) (1994)CrossRefGoogle Scholar
  14. 14.
    Kemper, G., Iano, Y.: An audio compression method based on wavelets subband coding. Lat. Am. Trans. IEEE (Revista IEEE Am.) 09(05), 610–621 (2011)CrossRefGoogle Scholar
  15. 15.
    Strahl, Stefan, Hansen, Heiko, Mertins, Alfred: A dynamic fine-grain scalable compression scheme with application to progressive audio coding. IEEE Trans. Audio Speech Lang. Process. 19(1), 14–23 (2011)CrossRefGoogle Scholar
  16. 16.
    Shirani, S.: Multimedia Communications Sub-band Coding. McMaster UniversityGoogle Scholar
  17. 17.
    Zang, Y., Zeytinoglu, M.: Improved lifting scheme for block subband coding. In: The proceedings of IEEE SP. International Conference on Electronic Devices, Systems and Applications (ICEDSA1999), pp 487–490 (1999)Google Scholar
  18. 18.
    Zhang, Y., Zeytinoglu, M.: block subband coding and time-varying orthogonal filter banks. In: The Proceedings of 1999 IEEE PacRim’99 Conference on Communications, Computers and Signal ProcessingGoogle Scholar
  19. 19.
    Vetterli, M., Herley, C.: Wavelets and filter banks: theory and design. IEEE Trans. Signal Proc. 40(9), 2207–2232 (2009)CrossRefGoogle Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2019

Authors and Affiliations

  • R. Chinna Rao
    • 1
    Email author
  • D. Elizabath Rani
    • 2
  • S. Srinivasa Rao
    • 1
  1. 1.MallaReddy College of Engineering and TechnologyHyderabadIndia
  2. 2.Gandhi Institute of Technology and ManagementVisakhapatnamIndia

Personalised recommendations