Discovering Musical Structure in Audio Recordings

Dannenberg, Roger B.; Hu, Ning

doi:10.1007/3-540-45722-4_6

Roger B. Dannenberg³ &
Ning Hu³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2445))

Included in the following conference series:

International Conference on Music and Artificial Intelligence

1155 Accesses
10 Citations

Abstract

Music is often described in terms of the structure of repeated phrases. For example, many songs have the form AABA, where each letter represents an instance of a phrase. This research aims to construct descriptions or explanations of music in this form, using only audio recordings as input. A system of programs is described that transcribes the melody of a recording, identifies similar segments, clusters these segments to form patterns, and then constructs an explanation of the music in terms of these patterns. Additional work using spectral information rather than melodic transcription is also described. Examples of successful machine “listening” and music analysis are presented.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bartsch, M. and Wakefield, G.H., To Catch a Chorus: Using Chroma-Based Representations For Audio Thumbnailing. in Proceedings of the Workshop on Applications of Signal Processing to Audio and Acoustics, (2001), IEEE.
Google Scholar
Birmingham, W.P., Dannenberg, R.B., Wakefield, G.H., Bartsch, M., Bykowski, D., Mazzoni, D., Meek, C., Mellody, M. and Rand, W., MUSART: Music Retrieval Via Aural Queries. in International Symposium on Music Information Retrieval, (Bloomington, Indiana, 2001), 73–81.
Google Scholar
Coltrane, J. Naima Giant Steps, Atlantic Records, 1960.
Google Scholar
Conklin, D. and Anagnostopoulou, C., Representation and Discovery of Multiple Viewpoint Patterns. in Proceedings of the 2001 International Computer Music Conference, (2001), International Computer Music Association, 479–485.
Google Scholar
Cope, D. Experiments in Musical Intelligence. A-R Editions, Inc., Madison, Wisconsin, 1996.
Google Scholar
Foote, J. and Cooper, M., Visualizing Musical Structure and Rhythm via Self-Similarity. in Proceedings of the 2001 International Computer Music Conference, (Havana, Cuba, 2001), International Computer Music Association, 419–422.
Google Scholar
Goto, M., A Predominant-F0 Estimation Method for CD Recordings: MAP Estimation using EM Algorithm for Adaptive Tone Models. in 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing, (2001), IEEE, V–3365–3368.
Google Scholar
Lartillot, O., Dubnov, S., Assayag, G. and Bejerano, G., Automatic Modeling of Musical Style. in Proceedings of the 2001 International Computer Music Conference, (2001), International Computer Music Association, 447–454.
Google Scholar
Leyton, M. A Generative Theory of Shape. Springer, Berlin, 2001.
MATH Google Scholar
Mazzoni, D. and Dannenberg, R.B., Melody Matching Directly From Audio. in 2nd Annual International Symposium on Music Information Retrieval, (2001), Indiana University, 17–18.
Google Scholar
Mongeau, M. and Sankoff, D. Comparison of Musical Sequences. in Hewlett, W. and Selfridge-Field, E. eds. Melodic Similarity Concepts, Procedures, and Applications, MIT Press, Cambridge, 1990.
Google Scholar
Mont-Reynaud, B. and Goldstein, M., On Finding Rhythmic Patterns in Musical Lines. in Proceedings of the International Computer Music Conference 1985, (Vancouver, 1985), International Computer Music Association, 391–397.
Google Scholar
Narmour, E. Music Expectation by Cognitive Rule-Mapping. Music Perception, 17 (3). 329–398.
Google Scholar
Rabiner, L. On the use of autocorrelation analysis for pitch detection. IEEE Transactions on Acoustics, Speech, and Signal Processing, ASSP-25(1). 24–33.
Google Scholar
Roads, C. Autocorrelation Pitch Detection. in The Computer Music Tutorial, MIT Press, 1996, 509–511.
Google Scholar
Rodet, X. and Jaillet, F., Detection and Modeling of Fast Attack Transients. in Proceedings of the 2001 International Computer Music Conference, (2001), International Computer Music Association, 30–33.
Google Scholar
Rolland, P.-Y. and Ganascia, J.-G. Musical pattern extraction and similarity assessment. in Miranda, E. ed. Readings in Music and Artificial Intelligence, Harwood Academic Publishers, 2000, 115–144.
Google Scholar
Simon, H.A. and Sumner, R.K. Pattern in Music. in Kleinmuntz, B. ed. Formal Representation of Human Judgment, Wiley, New York, 1968.
Google Scholar
Stammen, D. and Pennycook, B., Real-Time Recognition of Melodic Fragments Using the Dynamic Timewarp Algorithm. in Proceedings of the 1993 International Computer Music Conference, (Tokyo, 1993), International Computer Music Association, 232–235.
Google Scholar
Wakefield, G.H., Mathematical Representation of Joint Time-Chroma Distributions. in International Symposium on Optical Science, Engineering, and Instrumentation, SPIE’99, (Denver, 1999).
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science, Carnegie Mellon University, 15217, Pittsburgh, PA, USA
Roger B. Dannenberg & Ning Hu

Authors

Roger B. Dannenberg
View author publications
You can also search for this author in PubMed Google Scholar
Ning Hu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Music, University of Edinburgh, 12 Nicolson Square, EH8 9DF, Edinburgh, UK
Christina Anagnostopoulou & Miguel Ferrand &
Division of Informatics, University of Edinburgh, 80 South Bridge, EH1 1HN, Edinburgh, UK
Alan Smaill

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dannenberg, R.B., Hu, N. (2002). Discovering Musical Structure in Audio Recordings. In: Anagnostopoulou, C., Ferrand, M., Smaill, A. (eds) Music and Artificial Intelligence. ICMAI 2002. Lecture Notes in Computer Science(), vol 2445. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45722-4_6

Download citation

DOI: https://doi.org/10.1007/3-540-45722-4_6
Published: 27 August 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44145-8
Online ISBN: 978-3-540-45722-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics