Evaluation of Nonlinear Tempo Modification Methods Based on Sinusoidal Modeling

  • Kosuke NakamuraEmail author
  • Yuya Chiba
  • Takashi Nose
  • Akinori Ito
Conference paper
Part of the Smart Innovation, Systems and Technologies book series (SIST, volume 82)


Modifying tempo of musical signal is one of the basic signal processing for music signal, and many methods have been proposed so far. Nishino et al. proposed a tempo modification method of nonlinear modification based on sinusoidal model, but the evaluation of the methods was insufficient. In this paper, we evaluated the tempo modification methods with sinusoidal model and nonlinear signal stretch and compression. Namely, we compared effectiveness of use of residue signal and methods of determination of stretchable parts. From the experimental result, we could confirm the efficiency of the nonlinear tempo modification. We also compared several methods of determining the stretchable parts as well as the use of residue signal. As a result, the effect of the methods depended on the input signal.


Music signal processing Tempo modification Sinusoidal model 


  1. 1.
    Davies, M.E.P., Hamel, P., Yoshii, K., Goto, M.: AutoMashUpper: automatic creation of multi-song music mashups. IEEE/ACM Trans. Audio Speech Lang. Proc. 22(12), 1726–1737 (2014)CrossRefGoogle Scholar
  2. 2.
    Bruegge, B., Teschner, C., Lachenmaier, P., Fenzl, E., Schmidt, D., Bierbaum, S.: Pinocchio: conducting a virtual symphony orchestra. In: Proceedings of the International Conference on Advances in Computer Entertainment Technology, pp. 294–295 (2007)Google Scholar
  3. 3.
    Fabiani, M., Friberg, A.: Rule-based expressive modifications of tempo in polyphonic audio recordings. In: International Symposium on Computer Music Modeling and Retrieval, pp. 288–302 (2007)Google Scholar
  4. 4.
    Berthaut, F., Desainte-Catherine, M., Hachet, M.: Drile: an immersive environment for hierarchical live-looping. In: Proceedings of the New Interface for Musical Expression, p. 192 (2010)Google Scholar
  5. 5.
    Dolson, M.: The phase vocoder: a tutorial. Comput. Music J. 10(4), 14–27 (1986)CrossRefGoogle Scholar
  6. 6.
    Moulines, E., Charpentier, F.: Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones. Speech Commun. 9(5–6), 453–467 (1990)CrossRefGoogle Scholar
  7. 7.
    Verhelst, W., Roelands, M.: An overlap-add technique based on waveform similarity (WSOLA) for high quality time-scale modification of speech. In: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, pp. 554–557 (1993)Google Scholar
  8. 8.
    Malah, D.: Time-domain algorithms for harmonic bandwidth reduction and time scaling of speech signals. IEEE Trans. Acoust. Speech Signal Process. 27(2), 121–133 (1979)CrossRefGoogle Scholar
  9. 9.
    Igarashi, Y., Ito, M., Ito, A.: Evaluation of sinusoidal modeling for polyphonic music signal. In: Proceedings of the International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP), pp. 464–467 (2013)Google Scholar
  10. 10.
    Ito, A., Igarashi, Y., Ito, M., Nose, T.: Tempo modification of music signal using sinusoidal model and LPC-Based residue model. In: Proceedings of the International Congress on Sound and Vibration (2014)Google Scholar
  11. 11.
    Nishino, T., Nose, T., Ito, A.: Tempo modification of mixed music signal by nonlinear time scaling and sinusoidal modeling. In: Proceedings of the International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP), pp. 146–149 (2015)Google Scholar
  12. 12.
    McAulay, R.J., Quatieri, T.F.: Speech analysis/synthesis based on a sinusoidal representation. IEEE Trans. Acoust. Speech Sig. Process. 34(4), 744–754 (1986)CrossRefGoogle Scholar
  13. 13.
    Ito, M., Yano, M.: Sinusoidal modeling for nonstationary voiced speech based on a local vector transform. J. Acoust. Soc. Am. 121, 1717 (2007)CrossRefGoogle Scholar
  14. 14.
    Ding, Y., Qian, X.: Processing of musical tones using a combined quadratic polynomial phase sinusoid and residual (QUASER) signal model. J. Audio Eng. Soc. 45(7/8), 571–584 (1997)Google Scholar
  15. 15.
    Alonso, M., David, B., Richard, G.: Tempo and beat estimation of music signals. In: Proceedings of the ISMIR, pp. 158–164 (2004)Google Scholar
  16. 16.
    Nishino, T., Nose, T., Ito, A.: Deciding expandable sections for nonlinear changing play back speed of music signals. In: Proceedings of the ASJ Spring Meeting, 2-10-13 (2016). (in Japanese)Google Scholar
  17. 17.
    International Telecommunication Union, “Method for objective measurements of perceived audio quality,” ITU-R BS. 1387-1 (2001)Google Scholar
  18. 18.
    Peaqb-fast. Accessed 1 Mar 2017

Copyright information

© Springer International Publishing AG 2018

Authors and Affiliations

  • Kosuke Nakamura
    • 1
    • 2
    Email author
  • Yuya Chiba
    • 1
    • 2
  • Takashi Nose
    • 1
    • 2
  • Akinori Ito
    • 1
    • 2
  1. 1.Faculty of EngineeringTohoku UniversitySendaiJapan
  2. 2.Graduate School of EngineeringTohoku UniversitySendaiJapan

Personalised recommendations