The Method of Main Vocal Melody Extraction Based on Harmonic Structure Analysis from Popular Song

Song, Chai-Jong; Lee, Seok-Pil; Seo, Kyung-Hack; Park, Hochong

doi:10.1007/978-94-007-2598-0_37

Chai-Jong Song⁵,
Seok-Pil Lee⁵,
Kyung-Hack Seo⁵ &
…
Hochong Park⁶

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 107))

1036 Accesses

Abstract

In this paper, we propose the method of main vocal melody extraction based on harmonic structure analysis technique from polyphonic music signal. It is the most important part of contents based music retrieval method which has mainly three parts. The first part is pitch estimation from humming signal, the second one is the melody extraction from polyphonic music signal and the last one is the matching engine which measure the distance between two vectors. The accuracy of melody extraction affects the overall system performance rather than any other parts. Human vocal track makes the harmonics like most musical instruments. This is one of the most important things that we have considered to utilize. So, we might extract the main vocal melody from the complicated mixed signal with musical instruments. We utilize harmonic structure analysis and track pitch sequence during three frames include current frame. The proposed method contains three major blocks named preprocessing, multi-pitch extraction with peak picking, fundamental frequency detection and the last part with pitch tracking, predominant melody detection. We have started this project with aiming for supporting commercial service for music portal provider, KARAOKE system and mobile devices.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Orio N (2006) Music information retrieval: a turorial and review. Found Trends Inf Retr 1:1–90
Article MATH Google Scholar
Downie JS (2008) The music information retrieval evaluation exchange (2005–2007): a window into music information retrieval research. Acoust Sci Tech 29:4
Google Scholar
Poliner G, Ellis DP, Ehamann AF, Gomez E, Streich S, Ong B (2007) Melody transcription from music audio: approaches and evaluation. IEEE Trans Audio Speech Lang Process 15(4):1066–1074
Article Google Scholar
Eggink J, Broown GJ (2004) Extracting melody lines from complex audio, ISMIR
Google Scholar
Klapuri AP (2003) Multiple fundamental frequency estimation by summing harmonic amplitude. IEEE Trans Speech Audio Process 8:6
Google Scholar
Goto M (2004) A real-time music scene description system: predominant-F0 estimation for detecting melody and bass lines in real-world audio signals. Speech Commun 43(4):311–329
Article Google Scholar
TIA-EIA-IS-127, Enhanced Variable Rate CODEC
Google Scholar
Audio melody extraction results. http://www.music-ir.org/mirex/2009/index.php/

Download references

Author information

Authors and Affiliations

Digital Media Research Center, KETI, #1599, Sangam-dong, Mapo-gu, Seoul, South Korea
Chai-Jong Song, Seok-Pil Lee & Kyung-Hack Seo
Department of Electronics Engineering, Kwangwoon University, Seoul, Republic of Korea
Hochong Park

Authors

Chai-Jong Song
View author publications
You can also search for this author in PubMed Google Scholar
Seok-Pil Lee
View author publications
You can also search for this author in PubMed Google Scholar
Kyung-Hack Seo
View author publications
You can also search for this author in PubMed Google Scholar
Hochong Park
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chai-Jong Song .

Editor information

Editors and Affiliations

SeoulTech, Computer Science and Engineering, Seoul University of Science & Technology, Gongreung 2-dong 172, Seoul, 139-743, Korea, Republic of (South Korea)
James J. Park
, Computer Science, University of Georgia, GSRC 415, Athens, 30602-7404, Georgia, USA
Hamid Arabnia
, Business Administration, Daejin University, Hogukro 1007, Pocheon-Si, 487-711, Kyonggi-do, Korea, Republic of (South Korea)
Hang-Bae Chang
, Division of Information and Computer Eng, Ajou University, San 5, Suwon, Gyeonggido, 443-749, Korea, Republic of (South Korea)
Taeshik Shon

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Song, CJ., Lee, SP., Seo, KH., Park, H. (2011). The Method of Main Vocal Melody Extraction Based on Harmonic Structure Analysis from Popular Song. In: Park, J., Arabnia, H., Chang, HB., Shon, T. (eds) IT Convergence and Services. Lecture Notes in Electrical Engineering, vol 107. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-2598-0_37

Download citation

DOI: https://doi.org/10.1007/978-94-007-2598-0_37
Published: 01 November 2011
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-007-2597-3
Online ISBN: 978-94-007-2598-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics