Abstract
Music is often described in terms of the structure of repeated phrases. For example, many songs have the form AABA, where each letter represents an instance of a phrase. This research aims to construct descriptions or explanations of music in this form, using only audio recordings as input. A system of programs is described that transcribes the melody of a recording, identifies similar segments, clusters these segments to form patterns, and then constructs an explanation of the music in terms of these patterns. Additional work using spectral information rather than melodic transcription is also described. Examples of successful machine “listening” and music analysis are presented.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bartsch, M. and Wakefield, G.H., To Catch a Chorus: Using Chroma-Based Representations For Audio Thumbnailing. in Proceedings of the Workshop on Applications of Signal Processing to Audio and Acoustics, (2001), IEEE.
Birmingham, W.P., Dannenberg, R.B., Wakefield, G.H., Bartsch, M., Bykowski, D., Mazzoni, D., Meek, C., Mellody, M. and Rand, W., MUSART: Music Retrieval Via Aural Queries. in International Symposium on Music Information Retrieval, (Bloomington, Indiana, 2001), 73–81.
Coltrane, J. Naima Giant Steps, Atlantic Records, 1960.
Conklin, D. and Anagnostopoulou, C., Representation and Discovery of Multiple Viewpoint Patterns. in Proceedings of the 2001 International Computer Music Conference, (2001), International Computer Music Association, 479–485.
Cope, D. Experiments in Musical Intelligence. A-R Editions, Inc., Madison, Wisconsin, 1996.
Foote, J. and Cooper, M., Visualizing Musical Structure and Rhythm via Self-Similarity. in Proceedings of the 2001 International Computer Music Conference, (Havana, Cuba, 2001), International Computer Music Association, 419–422.
Goto, M., A Predominant-F0 Estimation Method for CD Recordings: MAP Estimation using EM Algorithm for Adaptive Tone Models. in 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing, (2001), IEEE, V–3365–3368.
Lartillot, O., Dubnov, S., Assayag, G. and Bejerano, G., Automatic Modeling of Musical Style. in Proceedings of the 2001 International Computer Music Conference, (2001), International Computer Music Association, 447–454.
Leyton, M. A Generative Theory of Shape. Springer, Berlin, 2001.
Mazzoni, D. and Dannenberg, R.B., Melody Matching Directly From Audio. in 2nd Annual International Symposium on Music Information Retrieval, (2001), Indiana University, 17–18.
Mongeau, M. and Sankoff, D. Comparison of Musical Sequences. in Hewlett, W. and Selfridge-Field, E. eds. Melodic Similarity Concepts, Procedures, and Applications, MIT Press, Cambridge, 1990.
Mont-Reynaud, B. and Goldstein, M., On Finding Rhythmic Patterns in Musical Lines. in Proceedings of the International Computer Music Conference 1985, (Vancouver, 1985), International Computer Music Association, 391–397.
Narmour, E. Music Expectation by Cognitive Rule-Mapping. Music Perception, 17 (3). 329–398.
Rabiner, L. On the use of autocorrelation analysis for pitch detection. IEEE Transactions on Acoustics, Speech, and Signal Processing, ASSP-25(1). 24–33.
Roads, C. Autocorrelation Pitch Detection. in The Computer Music Tutorial, MIT Press, 1996, 509–511.
Rodet, X. and Jaillet, F., Detection and Modeling of Fast Attack Transients. in Proceedings of the 2001 International Computer Music Conference, (2001), International Computer Music Association, 30–33.
Rolland, P.-Y. and Ganascia, J.-G. Musical pattern extraction and similarity assessment. in Miranda, E. ed. Readings in Music and Artificial Intelligence, Harwood Academic Publishers, 2000, 115–144.
Simon, H.A. and Sumner, R.K. Pattern in Music. in Kleinmuntz, B. ed. Formal Representation of Human Judgment, Wiley, New York, 1968.
Stammen, D. and Pennycook, B., Real-Time Recognition of Melodic Fragments Using the Dynamic Timewarp Algorithm. in Proceedings of the 1993 International Computer Music Conference, (Tokyo, 1993), International Computer Music Association, 232–235.
Wakefield, G.H., Mathematical Representation of Joint Time-Chroma Distributions. in International Symposium on Optical Science, Engineering, and Instrumentation, SPIE’99, (Denver, 1999).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dannenberg, R.B., Hu, N. (2002). Discovering Musical Structure in Audio Recordings. In: Anagnostopoulou, C., Ferrand, M., Smaill, A. (eds) Music and Artificial Intelligence. ICMAI 2002. Lecture Notes in Computer Science(), vol 2445. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45722-4_6
Download citation
DOI: https://doi.org/10.1007/3-540-45722-4_6
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44145-8
Online ISBN: 978-3-540-45722-0
eBook Packages: Springer Book Archive