Abstract
We have developed a music retrieval system that receives a humming query and finds similar audio intervals (segments) in a musical audio database. This system enables a user to retrieve a segment of a desired musical audio signal just by singing its melody. In this paper, we propose a method to summarize the music database through similarity analysis to thereby reduce the retrieval time. The distance of chroma vectors is used as a similarity measure. The key technique for summarization includes, mainly, a statistical smoothing method and a method of discriminant analysis. Practical experiments were conducted using 115 musical audio selections in the RWC popular music database. We report the summarization ratio as about 45%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bartsch, M. and Wakefield, G.H. (2001) To Catch a Chorus: Using Chroma-Based Representations For Audio Thumbnailing. in Proceedings, of the Workshop on Applications of Signal Processing to Audio and Acoustics, IEEE.
Cooper, M. and Foote, J. (2002) Automatic music summarization via similarity analysis, Proc. ISMIR, 81–85.
Dannenberg, R.B. and Hu, N. (2002) Pattern Discovery Techniques for Music Audio, Proc. ISMIR 2002, 63–70.
Goto, M. (2001) A Predominant-F0 Estimation Method for CD Recordings: MAP Estimation using EM Algorithm for Adaptive Tone Models, Proc. ICASSP 2001, V-3365–3368.
Goto, M., Hashiguchi, H., Nishimura, T. and Oka, R. (2002) RWC Music Database: Popular, Classical, and Jazz Music Databases, Proc. ISMIR 2002, 287–288.
Goto, M. (2002) A Real-time Music Scene Description System: A Chorus-Section Detecting Method, 2002-MUS-47-6, 2002 (100), 27–34 (in Japanese).
Goto, M. (2003) A Chorus-Section Detecting Method for Musical Audio Signals, Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2003), pp. V-437–440.
Goto, M. (2003) SmartMusicKIOSK: Music Listening Station with Chorus-Search Function, Proceedings of 16th Annual ACM symposium on User Interface Software and Technology (UIST 2003), pp. 31–40.
Hashiguchi, H., Nishimura, T., Takita, T., Zhang, J.X. and Oka, R. (2001) Music Signal Spotting Retrieval by a Humming Query, Proceedings of Fifth World Multi-Conference on Systemics, Cybernetics and Informatics, VII 280–284.
Nishimura, T., Hashiguchi, H., Takita, J., Zhang, J.X. and Oka, R. (2001) Music Signal Spotting Retrieval by a Humming Query Using Start Frame Feature Dependent Continuous Dynamic Programming, Proc. ISMIR 2001, 211–218.
Ohtsu. N. (1979) A threshold selection method from gray-level histograms, IEEE Trans. SMC, SMC-9(1), 62–66.
Peeters, G., Burthe, A.L. and Rodet, X. (2002) Toward automatic music audio summary generation from signal analysis, Proc. ISMIR 2002, 94–100.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer
About this paper
Cite this paper
Hashiguchi, H. (2008). Visualizing Similarity among Estimated Melody Sequences from Musical Audio. In: Tsubaki, H., Yamada, S., Nishina, K. (eds) The Grammar of Technology Development. Springer, Tokyo. https://doi.org/10.1007/978-4-431-75232-5_15
Download citation
DOI: https://doi.org/10.1007/978-4-431-75232-5_15
Received:
Accepted:
Publisher Name: Springer, Tokyo
Print ISBN: 978-4-431-75231-8
Online ISBN: 978-4-431-75232-5
eBook Packages: EngineeringEngineering (R0)