Query by Humming
The goal of a Query by Humming system is to allow a user to find a song by humming part of the tune. No musical training is needed. The problem is still unsolved. Some systems have low retrieval precision because they rely on melodic contour information from the hum tune, which in turn relies on the error-prone note segmentation process. Some systems yield better precision when matching the melody directly from audio, but they are slow because of their extensive use of Dynamic Time Warping (DTW) (see Chapter 4). HumFinder [106, 107] improves both the retrieval precision and speed compared to previous approaches. We treat music as a time series and exploit and improve well-developed techniques from time series databases to indexing the music for fast similarity queries. We improve on existing DTW indexes technique by introducing the concept of envelope transforms, which gives a general guideline for extending existing dimensionality reduction methods to DTW indexes. The net result is high scalability. We test our system through experiments. Please read this approach as a case study of the techniques you have seen, not as a complete solution to this hard problem.
KeywordsDiscrete Wavelet Transform Dynamic Time Warping Indexing Scheme Absolute Pitch Time Series Approach
Unable to display preview. Download preview PDF.