Speech Modeling Using the Complex Cepstrum

Vondra, Martin; Vích, Robert

doi:10.1007/978-3-642-18184-9_27

Martin Vondra²¹ &
Robert Vích²¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6456))

1171 Accesses
3 Citations

Abstract

Conventional cepstral speech modeling is based on the minimum phase parametric speech production model with infinite impulse response. In that approach only the logarithmic magnitude frequency response of the corresponding speech frame is approximated. In this contribution the principle of the cepstral speech modeling using the complex cepstrum is described. The obtained mixed-phase vocal tract model with finite impulse response contains also the information about the phase properties of the modeled speech frame. This model approximates the speech signal with higher accuracy than the model based on the real cepstrum, the numerical complexity and the memory requirements are at least twice greater.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Zen, H., Tokuda, K., Black, A.W.: Statistical Parametric Speech Synthesis. Speech Communication 51, 1039–1064 (2009)
Article Google Scholar
Vích, R.: Cepstral Speech Model, Padé Approximation, Excitation and Gain Matching in Cepstral Speech Synthesis. In: Jan, J. (ed.) BIOSIGNAL 2000, pp. 77–82. VUTIUM, Brno (2000)
Google Scholar
Drugman, T., Moinet, A., Dutoit, T., Wilfart, G.: Using a Pitch-Synchronous Residual Codebook for Hybrid HMM/Frame Selection Speech Synthesis. In: IEEE ICASSP, Taipei, Taiwan, pp. 3793–3796 (2009)
Google Scholar
Quatieri, T.F.: Discrete-Time Speech Signal Processing, pp. 253–308. Prentice-Hall, Englewood Cliffs (2002)
Google Scholar
Drugman, T., Bozkurt, B.T., Dutoit, T.: Complex Cepstrum-based Decomposition of Speech for Glottal Source Estimation. In: Interspeech 2009, Brighton, U.K, pp. 116–119 (2009)
Google Scholar
Oppenheim, A.V., Schafer, R.W.: Discrete-Time Signal Processing, pp. 768–825. Prentice-Hall, Englewood Cliffs (1989)
MATH Google Scholar
Vích, R.: Z-transform Theory and Application, pp. 207–216. D. Reidel Publ. Comp., Dordrecht (1987)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Photonics and Electronics, Academy of Sciences of the Czech Republic, Chaberska 57, CZ, 18251, Prague 8, Czech Republic
Martin Vondra & Robert Vích

Authors

Martin Vondra
View author publications
You can also search for this author in PubMed Google Scholar
Robert Vích
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute for Advanced Scientific Studies, Second University of Naples, and IIASS, Via Pellegrino 19, 84019, Vietri sul Mare (SA), Italy
Anna Esposito
Istituto Nazionale di Geofisica e Vulcanologia, Osservatorio Vesuviano, Via Diocleziano 328, 80124, Napoli, Italy
Antonietta M. Esposito
Dipartemento di Ingegneria dell’ Informazione, Seconda Università di Napoli, Via Roma 29, 81031, Aversa (CE), Italy
Raffaele Martone
Department of Humanities and Social Sciences, Anatolia College/ACT, Kennedy Street, 55510, Pylaia, Greece
Vincent C. Müller
Departmnet of Physics "E.R. Caoamoeööp", University of Salerno and IIASS, International Institute for Advanced Scientific Studies, 84081, Baronissi (SA), Italy
Gaetano Scarpetta

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Vondra, M., Vích, R. (2011). Speech Modeling Using the Complex Cepstrum. In: Esposito, A., Esposito, A.M., Martone, R., Müller, V.C., Scarpetta, G. (eds) Toward Autonomous, Adaptive, and Context-Aware Multimodal Interfaces. Theoretical and Practical Issues. Lecture Notes in Computer Science, vol 6456. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-18184-9_27

Download citation

DOI: https://doi.org/10.1007/978-3-642-18184-9_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-18183-2
Online ISBN: 978-3-642-18184-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics