Review of F0 Estimation in the Context of Indian Classical Music Expression Detection

  • Amit RegeEmail author
  • Ravi Sindal
Conference paper
Part of the Lecture Notes in Networks and Systems book series (LNNS, volume 100)


The work addresses the need of fast and accurate F0 detection method for faithful transcription of Indian classical music. Three prominent F0 detection methods, viz. discrete Fourier transform (DFT), constant Q transform (CQT), and YIN algorithm are described and compared on the basis of accuracy and frame size against simulated signals of standard MIDI note frequencies. The same analysis is repeated on recorded data containing vocal recitals of eight notes from an octave in the equal tempered musical scale. That YIN method is most accurate and applicable for small frame size and is concluded.


F0 Expression Ornamentation 



The authors acknowledge the support provided by IET-DAVV for availing necessary infrastructure to carry out the research. Moreover, the authors of [4] are also acknowledged for providing beautiful toolbox.


  1. 1.
    Rabiner LR (1977) On the use of autocorrelation analysis for pitch detection. IEEE Trans Acoust Speech Sig Process 25(1)CrossRefGoogle Scholar
  2. 2.
    Un CK, Yang SC (1977) A pitch extraction algorithm based on LPC inverse filtering and AMDF. IEEE Trans Acoust Speech Sig Process 25(6)Google Scholar
  3. 3.
    Gerhard D (2003) Pitch extraction and fundamental frequency: history and current techniques. Department of Computer Science, University of Regina, Regina, Saskatchewan, CanadaGoogle Scholar
  4. 4.
    Schoerkhuber C, Klapuri A (2010) Constant-Q transform toolbox for music processing. In: 7th sound and music computing conference, Barcelona, SpainGoogle Scholar
  5. 5.
    Ikemiya Y, Itoyama K, Okuno HG (2014) Transcribing vocal expression from polyphosnic music. In: ICASSP, Florence, ItalyGoogle Scholar
  6. 6.
    Sung D, Lee K (2014) Transcribing frequency modulated musical expressions from polyphonic music using HMM constrained shift invariant PLCA. In: Proceedings of tenth IEEE international conference on intelligent information hiding and multimedia signal processing (IIH-MSP)Google Scholar
  7. 7.
    Barbancho I, de la Bandera C, Barbancho AM, Tardon LJ (2009) Transcription and expressiveness detection system for violin music. In: IEEE International conference on acoustics, speech and signal processing (ICASSP), Taipei, Taiwan, pp 189–192Google Scholar
  8. 8.
    Klapuri A, Davy M (2006) Signal processing methods for music transcription. Springer, New YorkCrossRefGoogle Scholar
  9. 9.
    Polrolniczak E, Kramarczyk M (2015) Computer assessment of tremolo feature in context of evaluation of singing quality. Signal processing: algorithm, architecture, arrangements and applications, Sept 2015 Poznan, PolandGoogle Scholar
  10. 10.
    de Cheveigne A, Kawahara H (2002) YIN, a fundamental frequency estimator for speech and music. J Acoust Soc Am 111(4):1917–1930CrossRefGoogle Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2020

Authors and Affiliations

  1. 1.Medicaps UniversityIndoreIndia
  2. 2.IET Devi Ahilya UniversityIndoreIndia

Personalised recommendations