Pitch Estimation

  • Mads G. Christensen


A key property of many sounds, and in particular those that we think of as music, is the pitch. In the context of music, the American Standard Association defines the term pitch as “that attribute of auditory sensation in terms of which sounds may be ordered on a musical scale.” As such, it is strictly speaking a perceptual phenomenon. It is, however, caused by physical stimuli that exhibit a certain behavior. Signals that cause the sensation of pitch are, broadly speaking, the signals that are well-described by a set of harmonically related sinusoids, meaning that their frequencies are approximately integral multiples of a fundamental frequency. Signals that have frequencies that are integral multiples of a fundamental frequency are what is commonly referred to as periodic. We have already discussed such periodic signals extensively in connection with comb filters, Fourier series, and phasors. Pitch is, of course, of interest in and of itself in music, as it is the essence of music, but it can also be used for many things in audio processing, such as auto-tuning, harmonizers, intelligent pitch shifting, and much more. Thus, no audio processing book would be complete without a chapter about pitch estimation!


  1. 31.
    M.G. Christensen, A. Jakobsson, Multi-Pitch Estimation. Synthesis Lectures on Speech & Audio Processing, vol. 5 (Morgan & Claypool Publishers, San Rafael, 2009)Google Scholar
  2. 32.
    J. Moorer, The optimum comb method of pitch period analysis of continuous digitized speech. IEEE Trans. Acoust. Speech Signal Process. 22(5), 330–338 (1974)CrossRefGoogle Scholar
  3. 33.
    R.L. Miller, E.S. Weibel, Measurement of the fundamental period of speech using a delay line. Presented at the 51st meeting of the Acoustical Society of America, 1956Google Scholar
  4. 34.
    J.S. Gill, Automatic extraction of the excitation function of speech with particular reference to the use of correlation methods. Presented at the 3rd I.C.A., Stuttgart, Germany, 1959Google Scholar
  5. 35.
    M. Noll, Pitch determination of human speech by harmonic product spectrum, the harmonic sum, and a maximum likelihood estimate, in Proceedings of the Symposium on Computer Processing Communications (1969), pp. 779–797Google Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  • Mads G. Christensen
    • 1
  1. 1.Aalborg UniversityAalborgDenmark

Personalised recommendations