Analysis/Synthesis Speech Model Based on the Pitch-Tracking Periodic-Aperiodic Decomposition

  • Piotr Zubrycki
  • Alexander A. Petrovsky
Conference paper


This paper presents a speech analysis/synthesis model based on periodic-aperiodic decomposition. In presented approach, decomposition is performed in whole speech band without making identification of voiced/unvoiced regions. Other important feature is pitch-tracking ability of decomposition algorithm. For this purpose a new pitch-tracking transformation called Time-Varying Discrete Fourier Transform (TVDFT) is employed. Periodic component is modelled as a sum of pitch harmonics with amplitudes and phases estimated with TVDFT. Aperiodic component is defined as a difference between original speech signal and synthesised periodic component. TVDFT needs accurate fundamental pitch estimation. This paper also presents a robust pitch estimation.. Experimental results showing advantages of suggested model are also given.


speech decomposition Time-Varying DFT 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

7 References

  1. [1]
    Kondoz A.M., “Digital speech: coding for low bit rate communication systems”, John Wiley & Sons, Inc., New York, 1996.Google Scholar
  2. [2]
    Spanias A.S., „Speech coding: a tutorial review“, Proc. IEEE, Vol. 82, No. 10, pp. 1541–1582, 1994.CrossRefGoogle Scholar
  3. [3]
    Almeida L.B., Tribolet J.M., “Harmonic Coding: A Low Bit-Rate, Good Quality, Speech Coding Technique”, Proc. IEEE Int. Conf. on Accoust., Speech and Signal Processing, pp. 1664–1667, 1982.Google Scholar
  4. [4]
    McAulay R.J., Quatieri T.F., „Sinusoidal Coding“ in “Speech Coding and Synthesis” (W. Klein and K. Palival, eds.), Elsevier Science Publishers, Amsterdam, 1995.Google Scholar
  5. [5]
    George E.B., Smith M.J.T., “Speech Analysis/Synthesis and Modification Using an Analysis-by-Synthesis/Overlap-Add Sinusoidal Model”, IEEE Trans, on Speech and Audio Processing, Vol 5, No. 5, pp. 389–406, 1997.CrossRefGoogle Scholar
  6. [6]
    Stylianou Y., „Applying the Harmonic Plus Noise Mode in Concatenative Speech Synthesis“ IEEE Trans, on Speech and Audio Processing, Vol. 9, No 1., pp. 21–29, 2001.CrossRefGoogle Scholar
  7. [7]
    Griffin D.W., Lim J.S., „Multiband Excitation Vocoder“, IEEE Trans, on Acoust., Speech and Signal Processing, Vol. ASSP-36, pp. 1223–1235, 1988.CrossRefGoogle Scholar
  8. [8]
    B. Yegnanarayana, C. d'Alessandro, V. Darsions, “An Iterative Algorithm for Decomposiiton of Speech Signals into Periodic and Aperiodic Components”, IEEE Trans. On Speech and Audio Coding, Vol. 6, No. 1, pp. 1–11, 1998.CrossRefGoogle Scholar
  9. [9]
    Jackson P.J.B., Shadle C.H., “Pitch-Scaled Estimation of Simultaneous Voiced and Turbulence-Noise Components in Speech”, IEEE Trans. On Speech and Audio Processing, Vol. 9, No. 7, pp. 713–726, 2001CrossRefGoogle Scholar
  10. [10]
    Sercov V., Petrovsky A., „An Improved Speech Model with Allowance for Time-Varying Pitch Harmonic Amplitudes and Frequencies in Low Bit-Rate MBE Coders”, Proc. of the 6ht European Conf. on Speech Communication and Technology EUROSPEECH'99, pp. 1479–1482 Budapest, Hungary, 1999.Google Scholar
  11. [11]
    Petrovsky A., Sercov V., “Low Bit-Rate AbS Spectral Coding Based on the Harmonic Analysis of Speech Agreed Upon with Time-Varying Pitch Frequency and Psychoacoustical Optimization”, Proc. of Nordic Signal Processing Symposium NORSIG2000, pp. 45–48, 2000.Google Scholar
  12. [12]
    Petrovsky A., Zubrycki P., Sawicki A., Tonal and noise components separation based on a pitch synchronous DFT analyzer as a speech coding method // Proc. of European Conference on Circuit Theory and Devices ECCTD2003, Vol. III, pp. 169–172, 2003.Google Scholar
  13. [13]
    Eric W. M. Yu, Cheung-Fat Chan, A harmonic+noise coder with improved transient speech performance // Proc. of European Signal Processing Conference EUSIPCO'99, Special Session “Speech Coding”, 1999.Google Scholar
  14. [14]
    Sondhi M.M., New Methods of Pitch Extraction, IEEE Trans, on Audio and Electroacoustics, Vol. AU-16, No. 2, pp. 262–266, 1968.CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, Inc. 2005

Authors and Affiliations

  • Piotr Zubrycki
    • 1
  • Alexander A. Petrovsky
    • 1
  1. 1.Bialystok Technical UniversityPoland

Personalised recommendations