Skip to main content
Log in

Note onset detection based on sparse decomposition

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Music onset detection is significant and essential for obtaining the high-level music features such as rhythm, beat, music paragraph and structure. The traditional methods for onset detection which employ Short Time Fourier Transform (STFT)-based or Wavelet Transform (WT)-based features to characterize music signal generally lack adaptiveness for representing the stationary and non-stationary part of the music signal. This will lead to the degraded performance for music note onset detection. To solve this problem, a new algorithm for note onset detection based on sparse decomposition is proposed. Firstly, the musical signals are sparsely decomposed with Matching Pursuit (MP), and then the hybrid detection algorithm which combines namely the Degree of Explanation (DE) and the Change of Partials (CP) is applied to the sparse representation of the music signal. Finally, a modified peak-picking algorithm is employed to generate onset vectors. The experiments on the dataset with 2050 onsets show that our results are superior to those of MIREX 2013. For the polyphonic music which is the most widely used form in our real life, the proposed algorithm has better performance than the other algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Bello JP, Daudet L, Abdallah S, Duxbury C, Davies M, Sandler MB (2005) A tutorial on onset detection in music signals. IEEE Trans Speech Audio Process 13(5):1035–1047

    Article  Google Scholar 

  2. Bello JP, Sandler M (2003) Phase-based note onset detection for music signals. Proceedings of 28thIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP ‘03), April 6–10, 2003

  3. Benetos E, Simon D (2011) Polyphonic music transcription using note onset and offset detection. Proceedings of 36th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP ‘11), May 22–27, 2011, Prague, Czech Republic

  4. Benetos E, Stylianou Y (2010) Auditory spectrum-based pitched instrument onset detection. IEEE Trans Audio Speech Lang Process 18(8):1968–1977

    Article  Google Scholar 

  5. Bingkun B, Guangyu Z, Jialie S, Shuicheng Y (2013) Robust image analysis with sparse representation on quantized visual features. IEEE Trans Image Process 22(3):860–871

    Article  MathSciNet  Google Scholar 

  6. Daudet L (2001) Transients modeling by pruned wavelet trees. Proceedings of Computer Music Conf (ICMC‘01), at Havana, Cuba

  7. Daudet L (2006) Sparse and structured decompositions of signals with the molecular matching pursuit. IEEE Trans Audio Speech Lang Process 14(5):1808–1816

    Article  Google Scholar 

  8. Davy M, Godsill S (2002) Detection of abrupt spectral changes using support vector machines. Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP-02), at Orlando, FL

  9. Duxbury C, Sandler M, Davies M A hybrid approach to musical note onset detection. Proceedings of Digital Audio Effects Conf. (DAFX, ‘02), at Hamburg, Germany

  10. Fusan Z (2005) Basic music theory tutorial (revised). Shandong University Press

  11. Goto M (2001) An audio-based real-time beat tracking system for music with or without drum-sounds. J New Music Res 30(2):159–171

    Article  Google Scholar 

  12. Gribonval R, Bacry E (2003) Harmonic decomposition of audio signals with matching pursuit. IEEE Trans Signal Process 51(1):101–111

    Article  MathSciNet  Google Scholar 

  13. Holzapfel A, Stylianou Y, Gedik AC, Bozkurt B (2010) Three dimensions of pitched instrument onset detection. IEEE Trans Audio Speech Lang Process 18(6):1517–1527

    Article  Google Scholar 

  14. Hoon H, Dooyong S, Kyogu L (2013) Note onset detection based on harmonic cepstrum regularity. Proceedings of 2013 I.E. International Conference on Multimedia and Expo (ICME2013), July 15–19, 2013, San Jose, CA

  15. Hui LT, Yongwei Z, Chaisorn L, Rahardja S (2010) Audio onset detection using energy-based and pitch-based processing. Proceedings of 2010 I.E. International Symposium on Circuits and Systems (ISCAS2010), May 30, 2010

  16. Jaggi S, Karl WC, Mallat S, Willsky AS (1998) High-resolution pursuit for feature extraction. Appl Comput Harmon Anal 5(4):428–449

    Article  MATH  Google Scholar 

  17. Jia-Yin S, Hai-Feng L, Lei L (2009) ANN based music chord perception. J Tsinghua Univ Sci Tech 49(s1):1369–1374, 1379

    Google Scholar 

  18. Krstulovic S, Gribonval R (2006) Mptk: matching pursuit made tractable. Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, (ICASSP 2006), May 14–19, 2006

  19. Mallat SG, Zhifeng Z (1993) Matching pursuits with time-frequency dictionaries. IEEE Trans Signal Process 41(12):3397–3415

    Article  MATH  Google Scholar 

  20. Masri P (1996) Computer modeling of sound for transformation and synthesis of musical signal. Univ. of Bristol, Bristol

    Google Scholar 

  21. Pertusa A (2010) Computationally efficient methods for polyphonic music transcription. PhD Thesis. Universidad de Alicante

  22. Ravelli E, Richard G, Daudet L (2008) Union of MDCT Bases for audio coding. IEEE Trans Audio Speech Lang Process 16(8):1361–1372

    Article  Google Scholar 

  23. Rodet X, Jaillet F (2001) Detection and modeling of fast attack transients. Proceedings of the Computer Music, at Havana, Cuba

  24. Wenming G, Xi S, Ruifan L (2011) A new algorithm for note onset detection. Comput Eng Appl 47(24):9–11

    Google Scholar 

  25. Xiangbin S, Ming L, Jiehong W, Lin Z (2010) An improved onset detection algorithm. Proceedings of the International Conference on Internet Technology and Applications, Aug. 20–22, 2010

  26. Yanan F, Qiang L, Xin G (2011) Entropy of teager energy in wavelet-domain algorithm applied in note onset detection. Proceedings of Seventh International Conference on Natural Computation (ICNC2011), July 26–28, 2011

Download references

Acknowledgments

This work is supported by the National Nature Science Foundation of China under Grant No. 60902065, No. 61401227, and by Beijing Natural Science Foundation(No.4152053).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xi Shao.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Shao, X., Gui, W. & Xu, C. Note onset detection based on sparse decomposition. Multimed Tools Appl 75, 2613–2631 (2016). https://doi.org/10.1007/s11042-015-2656-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-015-2656-8

Keywords

Navigation