Comparison and Automatic Detection

  • Jérôme Sueur
Part of the Use R! book series (USE R)


Cross-correlation of amplitude envelopes, frequency spectra, and spectrograms are evoked together with the computation of the frequency coherence as solutions to compare two sounds. The dynamic time warping technique, that seeks for the best alignments of time series of unequal length, is also covered. A recipe is provided to run a supervised binary automatic identification over a series of recordings, that to seek automatically for a sound of interest in a large audio dataset.

Audio files:M-XV_20101125_150000.wavAllobates_femoralis_2015-11-10_161500_GFT.wavAllobates_femoralis.wav


  1. Davis SB, Mermelstein P (1980) Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans Acoust Speech Signal Process 28:357–366CrossRefGoogle Scholar
  2. Fawcett T (2006) An introduction to ROC analysis. Pattern Recogn Lett 27:861–874CrossRefGoogle Scholar
  3. Giorgino T (2009) Computing and visualizing dynamic time warping alignments in R: the dtw package. J Stat Softw 31:1–24CrossRefGoogle Scholar
  4. Katz J, Hafner SD, Donovan T (2016a) Assessment of error rates in acoustic monitoring with the R package monitor. Bioacoustics 25:177–196CrossRefGoogle Scholar
  5. Katz J, Hafner SD, Donovan T (2016b) Tools for automated acoustic monitoring within the R package monitor. Bioacoustics 25:191–210Google Scholar
  6. Sakoe H, Chiba S (1978) Dynamic programming algorithm optimization for spoken word recognition. IEEE Trans Acoust Speech Signal Process 26:43–49CrossRefGoogle Scholar
  7. Towsey M, Planitz B, Nantes A, Wimmer J, Roe P (2012) A toolbox for animal call recognition. Bioacoustics 21:107–125CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  • Jérôme Sueur
    • 1
  1. 1.Muséum National d’Histoire naturelleParisFrance

Personalised recommendations