Abstract
In this paper, we propose the method of main vocal melody extraction based on harmonic structure analysis technique from polyphonic music signal. It is the most important part of contents based music retrieval method which has mainly three parts. The first part is pitch estimation from humming signal, the second one is the melody extraction from polyphonic music signal and the last one is the matching engine which measure the distance between two vectors. The accuracy of melody extraction affects the overall system performance rather than any other parts. Human vocal track makes the harmonics like most musical instruments. This is one of the most important things that we have considered to utilize. So, we might extract the main vocal melody from the complicated mixed signal with musical instruments. We utilize harmonic structure analysis and track pitch sequence during three frames include current frame. The proposed method contains three major blocks named preprocessing, multi-pitch extraction with peak picking, fundamental frequency detection and the last part with pitch tracking, predominant melody detection. We have started this project with aiming for supporting commercial service for music portal provider, KARAOKE system and mobile devices.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Orio N (2006) Music information retrieval: a turorial and review. Found Trends Inf Retr 1:1–90
Downie JS (2008) The music information retrieval evaluation exchange (2005–2007): a window into music information retrieval research. Acoust Sci Tech 29:4
Poliner G, Ellis DP, Ehamann AF, Gomez E, Streich S, Ong B (2007) Melody transcription from music audio: approaches and evaluation. IEEE Trans Audio Speech Lang Process 15(4):1066–1074
Eggink J, Broown GJ (2004) Extracting melody lines from complex audio, ISMIR
Klapuri AP (2003) Multiple fundamental frequency estimation by summing harmonic amplitude. IEEE Trans Speech Audio Process 8:6
Goto M (2004) A real-time music scene description system: predominant-F0 estimation for detecting melody and bass lines in real-world audio signals. Speech Commun 43(4):311–329
TIA-EIA-IS-127, Enhanced Variable Rate CODEC
Audio melody extraction results. http://www.music-ir.org/mirex/2009/index.php/
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer Science+Business Media B.V.
About this paper
Cite this paper
Song, CJ., Lee, SP., Seo, KH., Park, H. (2011). The Method of Main Vocal Melody Extraction Based on Harmonic Structure Analysis from Popular Song. In: Park, J., Arabnia, H., Chang, HB., Shon, T. (eds) IT Convergence and Services. Lecture Notes in Electrical Engineering, vol 107. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-2598-0_37
Download citation
DOI: https://doi.org/10.1007/978-94-007-2598-0_37
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-007-2597-3
Online ISBN: 978-94-007-2598-0
eBook Packages: EngineeringEngineering (R0)