Abstract
A method for timbre-preserving pitch shifting of music signals based on estimating the order in linear predictive coding (LPC) is described. LPC is used for estimating a transfer function approximated using autoregressive (AR) models. We need to determine the appropriate number of poles in the AR model for LPC. For general audio signals, the number of poles varies with time because the number and the kind of sound sources such as musical instruments change dynamically. To estimate the order, we utilize the inequality of arithmetic and geometric means (AM-GM inequality) and fractional bandwidth at each pole of the model. The estimated order can be applied to LPC for the timbre-preserving pitch shifting. A listening test evaluated by the mean opinion score (MOS) shows that our approach improves the sound quality of pitch-shifted signals.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Moulines, E., Laroche, J.: Non-parametric techniques for pitch-scale and time-scale modification of speech. Speech Communication 16, 175–205 (1995)
Itakura, F., Saito, S.: A statistical method for estimation of speech spectral density and formant frequencies. IEICE Trans. 53(1), 35–42 (1970)
Akaike, H.: A new look at the statistical model identification. IEEE Trans. Autom. Control AC-19, 716–723 (1974)
Rissanen, J.: Modeling by shortest data description. Automanica 14, 465–471 (1974)
Mallet, S.: A Wavelet Tour of Signal Processing, 2nd edn. Academic Press, California (1999)
Itakura, F., Saito, S.: Speech information compression based on the maximum likelihood spectral estimation. J. A. S. J. 27, 463–472 (1971)
Recommendation ITU-T P.800, Methods for subjective determination of transmission quality, ITU-T, Geneva (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Koshikawa, N., Murakami, T., Tanaka, T. (2008). Pitch Shifting of Music Based on Adaptive Order Estimation of Linear Predictor. In: Huang, YM.R., et al. Advances in Multimedia Information Processing - PCM 2008. PCM 2008. Lecture Notes in Computer Science, vol 5353. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89796-5_5
Download citation
DOI: https://doi.org/10.1007/978-3-540-89796-5_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89795-8
Online ISBN: 978-3-540-89796-5
eBook Packages: Computer ScienceComputer Science (R0)