Advertisement

Mel-Frequency Cepstral and Linear Predictive Coefficients

  • Jérôme Sueur
Part of the Use R! book series (USE R)

Abstract

Mel-frequency cepstral coefficients (MFCCs) and linear predictive coefficients (LPCs) are features used to describe sound according to time, frequency, and amplitude. These techniques, which are mainly used in speech analysis, are reviewed step by step for a good understanding and practice.

Audio files:hello.wav

References

  1. Cryer JD, Chan KS (2008) Time series analysis with applications in R. Springer, New YorkzbMATHGoogle Scholar
  2. Davis SB, Mermelstein P (1980) Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans Acoust Speech Signal Process 28:357–366CrossRefGoogle Scholar
  3. Sharan VR, Moir TJ (2016) An overview of applications and advancements in automatic sound recognition. Neurocomputing 200:22–34CrossRefGoogle Scholar
  4. Snell RC, Milinazzo F (1993) Formant location from LPC analysis data. IEEE Trans Speech Audio Process 1:129–134CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  • Jérôme Sueur
    • 1
  1. 1.Muséum National d’Histoire naturelleParisFrance

Personalised recommendations