Tune Retrieval in the Multimedia Library

McNab, Rodger J.; Smith, Lloyd A.; Witten, Ian H.; Henderson, Clare L.

doi:10.1023/A:1009606600500

Tune Retrieval in the Multimedia Library

Published: April 2000

Volume 10, pages 113–132, (2000)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Rodger J. McNab¹,
Lloyd A. Smith²,
Ian H. Witten³ &
…
Clare L. Henderson⁴

88 Accesses
18 Citations
Explore all metrics

Abstract

Musical scores are traditionally retrieved by title, composer or subject classification. Just as multimedia computer systems increase the range of opportunities available for presenting musical information, so they also offer new ways of posing musically-oriented queries. This paper shows how scores can be retrieved from a database on the basis of a few notes sung or hummed into a microphone. The design of such a facility raises several interesting issues pertaining to music retrieval. We first describe an interface that transcribes acoustic input into standard music notation. We then analyze string matching requirements for ranked retrieval of music and present the results of an experiment which tests how accurately people sing well known melodies. The performance of several string matching criteria are analyzed using two folk song databases. Finally, we describe a prototype system which has been developed for retrieval of tunes from acoustic input and evaluate its performance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

A. Askenfelt, “Automatic notation of played music: the Visa project,” IAML Conference, Lisbon 1978, pp. 109–121.
Google Scholar
J. Backus, The Acoustical Foundations of Music, Norton and Co., New York, 1969.
Google Scholar
D. Bainbridge and T.C. Bell, “An extensible optical music recognition system,” in Proc. 19th Australasian Computer Science Conf., Melbourne, January 1996, pp. 308-317.
B. Bauer, The New Real Book, Sher Music Co., Petaluma, CA, 1988.
Google Scholar
M.J. Bishop and E.A. Thompson, “Maximum likelihood alignment of DNA sequences,” J. Molecular Biology, Vol. 190, pp. 159–165, 1986.
Google Scholar
N.P. Carter, “Automatic recognition of printed music in the context of electronic publishing,” Ph.D. thesis, University of Surrey, UK, February 1989.
Google Scholar
A. Cohen and N. Cohen, “Tune evolution as an indicator of traditional musical norms,” J. American Folklore, Vol. 86, No. 339, pp. 37–47, 1973.
Google Scholar
D. Deutsch, “Octave generalization and tune recognition,” Perception and Psychophysics, Vol. 11, No. 6, pp. 411–412, 1972.
Google Scholar
W.J. Dowling, “Scale and contour: Two components of a theory of memory for melodies,” Psychological Review, Vol. 85, No. 4, pp. 341–354, 1978.
Google Scholar
Z. Galil and K. Park, “An improved algorithm for approximate string matching,” SIAM J. Comput., Vol. 19, No. 6, pp. 989–999, 1990.
Google Scholar
A. Ghias, J. Logan, D. Chamberlin, and B.C. Smith, “Query by humming,” in Proc. ACM Multimedia 95, San Francisco, November 1995.
B. Gold and L. Rabiner, “Parallel processing techniques for estimating pitch periods of speech in the time domain,” J. Acoust. Soc. Am., Vol. 46, No. 2, pp. 442–448, 1969.
Google Scholar
C.A. Goodrum and H.W. Dalrymple, Guide to the Library of Congress, Library of Congress, Washington, D.C., 1982.
M. Hawley, “The personal orchestra,” Computing Systems, Vol. 3, No. 2, pp. 289–329, 1990.
Google Scholar
W. Hess, Pitch Determination of Speech Signals, Springer-Verlag, New York, 1983.
Google Scholar
S. Loeb, “Architecting personalized delivery of multimedia information,” Commun. ACM, Vol. 35, No. 12, pp. 39–50, 1992.
Google Scholar
R. Lowrance and R.A. Wagner, “An extension of the string-to-string correction problem,” J. ACM, Vol. 22, No. 2, pp. 177–183, 1975.
Google Scholar
R.J. McNab, L.A. Smith, and I.H. Witten, “Signal processing for melody transcription,” in Proc. 19th Australasian Computer Science Conf., Melbourne, January 1996, pp. 301-307.
M. Mongeau and D. Sankoff, “Comparison of musical sequences,” Computers and the Humanities, Vol. 24, pp. 161–175, 1990.
Google Scholar
D. Parsons, The Directory of Tunes and Musical Themes, Spencer Brown, Cambridge, 1975.
Google Scholar
D. Sankoff and J.B. Kruskal (Eds.), TimeWarps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison, Addison-Wesley, 1983.
R. Sedgewick, Algorithms, Addison-Wesley, Reading, Massachusetts, 1988.
Google Scholar
E. Selfridge-Field, “Optical recognition of music notation: A survey of current work,” Computing in Musicology, Vol. 9, pp. 109–145, 1994.
Google Scholar
J. Sloboda, “Music performance,” in The Psychology of Music, D. Deutsch (Ed.), Academic Press, 1982, pp. 479-496.
K. Steiglitz, T.W. Parks, and J.F. Kaiser, “METEOR: A constraint-based FIR filter design program,” IEEE Trans. Signal Proc., Vol. 40, No. 8, pp. 1901–1909, 1992.
Google Scholar
J. Sundberg and B. Lindblom, “Generative theories in language and music descriptions,” Cognition, Vol. 4, pp. 99–122, 1976.
Google Scholar
R.A.Wagner and M.J. Fischer, “The string-to-string correction problem,” J. ACM, Vol. 21, No. 1, pp. 168–173, 1974.
Google Scholar
A.Waibel and B. Yegnanaryana, “Comparative study of nonlinear warping techniques in isolated word speech recognition systems,” IEEE Trans. Acoustics, Speech, and Signal Proc., Vol. 31, No. 6, pp. 1582–1586, 1983.
Google Scholar
S. Wu and U. Manber, “Fast text searching allowing errors,” Commun. ACM, Vol. 35, No. 10, pp. 83–91 1992.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Waikato, Hamilton, New Zealand
Rodger J. McNab
Department of Computer Science, University of Waikato, Hamilton, New Zealand
Lloyd A. Smith
Department of Computer Science, University of Waikato, Hamilton, New Zealand
Ian H. Witten
School of Education, University of Waikato, Hamilton, New Zealand
Clare L. Henderson

Authors

Rodger J. McNab
View author publications
You can also search for this author in PubMed Google Scholar
Lloyd A. Smith
View author publications
You can also search for this author in PubMed Google Scholar
Ian H. Witten
View author publications
You can also search for this author in PubMed Google Scholar
Clare L. Henderson
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

McNab, R.J., Smith, L.A., Witten, I.H. et al. Tune Retrieval in the Multimedia Library. Multimedia Tools and Applications 10, 113–132 (2000). https://doi.org/10.1023/A:1009606600500

Download citation

Issue Date: April 2000
DOI: https://doi.org/10.1023/A:1009606600500

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Tune Retrieval in the Multimedia Library

Abstract

Access this article

Similar content being viewed by others

Detecting Features for a Music Retrieval System

From Audio to Music Notation

Creating a Reliable Music Discovery and Recommendation System

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

Tune Retrieval in the Multimedia Library

Abstract

Access this article

Similar content being viewed by others

Detecting Features for a Music Retrieval System

From Audio to Music Notation

Creating a Reliable Music Discovery and Recommendation System

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation