Skip to main content

Discovering Musical Structure in Audio Recordings

  • Conference paper
  • First Online:
Music and Artificial Intelligence (ICMAI 2002)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2445))

Included in the following conference series:

Abstract

Music is often described in terms of the structure of repeated phrases. For example, many songs have the form AABA, where each letter represents an instance of a phrase. This research aims to construct descriptions or explanations of music in this form, using only audio recordings as input. A system of programs is described that transcribes the melody of a recording, identifies similar segments, clusters these segments to form patterns, and then constructs an explanation of the music in terms of these patterns. Additional work using spectral information rather than melodic transcription is also described. Examples of successful machine “listening” and music analysis are presented.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bartsch, M. and Wakefield, G.H., To Catch a Chorus: Using Chroma-Based Representations For Audio Thumbnailing. in Proceedings of the Workshop on Applications of Signal Processing to Audio and Acoustics, (2001), IEEE.

    Google Scholar 

  2. Birmingham, W.P., Dannenberg, R.B., Wakefield, G.H., Bartsch, M., Bykowski, D., Mazzoni, D., Meek, C., Mellody, M. and Rand, W., MUSART: Music Retrieval Via Aural Queries. in International Symposium on Music Information Retrieval, (Bloomington, Indiana, 2001), 73–81.

    Google Scholar 

  3. Coltrane, J. Naima Giant Steps, Atlantic Records, 1960.

    Google Scholar 

  4. Conklin, D. and Anagnostopoulou, C., Representation and Discovery of Multiple Viewpoint Patterns. in Proceedings of the 2001 International Computer Music Conference, (2001), International Computer Music Association, 479–485.

    Google Scholar 

  5. Cope, D. Experiments in Musical Intelligence. A-R Editions, Inc., Madison, Wisconsin, 1996.

    Google Scholar 

  6. Foote, J. and Cooper, M., Visualizing Musical Structure and Rhythm via Self-Similarity. in Proceedings of the 2001 International Computer Music Conference, (Havana, Cuba, 2001), International Computer Music Association, 419–422.

    Google Scholar 

  7. Goto, M., A Predominant-F0 Estimation Method for CD Recordings: MAP Estimation using EM Algorithm for Adaptive Tone Models. in 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing, (2001), IEEE, V–3365–3368.

    Google Scholar 

  8. Lartillot, O., Dubnov, S., Assayag, G. and Bejerano, G., Automatic Modeling of Musical Style. in Proceedings of the 2001 International Computer Music Conference, (2001), International Computer Music Association, 447–454.

    Google Scholar 

  9. Leyton, M. A Generative Theory of Shape. Springer, Berlin, 2001.

    MATH  Google Scholar 

  10. Mazzoni, D. and Dannenberg, R.B., Melody Matching Directly From Audio. in 2nd Annual International Symposium on Music Information Retrieval, (2001), Indiana University, 17–18.

    Google Scholar 

  11. Mongeau, M. and Sankoff, D. Comparison of Musical Sequences. in Hewlett, W. and Selfridge-Field, E. eds. Melodic Similarity Concepts, Procedures, and Applications, MIT Press, Cambridge, 1990.

    Google Scholar 

  12. Mont-Reynaud, B. and Goldstein, M., On Finding Rhythmic Patterns in Musical Lines. in Proceedings of the International Computer Music Conference 1985, (Vancouver, 1985), International Computer Music Association, 391–397.

    Google Scholar 

  13. Narmour, E. Music Expectation by Cognitive Rule-Mapping. Music Perception, 17 (3). 329–398.

    Google Scholar 

  14. Rabiner, L. On the use of autocorrelation analysis for pitch detection. IEEE Transactions on Acoustics, Speech, and Signal Processing, ASSP-25(1). 24–33.

    Google Scholar 

  15. Roads, C. Autocorrelation Pitch Detection. in The Computer Music Tutorial, MIT Press, 1996, 509–511.

    Google Scholar 

  16. Rodet, X. and Jaillet, F., Detection and Modeling of Fast Attack Transients. in Proceedings of the 2001 International Computer Music Conference, (2001), International Computer Music Association, 30–33.

    Google Scholar 

  17. Rolland, P.-Y. and Ganascia, J.-G. Musical pattern extraction and similarity assessment. in Miranda, E. ed. Readings in Music and Artificial Intelligence, Harwood Academic Publishers, 2000, 115–144.

    Google Scholar 

  18. Simon, H.A. and Sumner, R.K. Pattern in Music. in Kleinmuntz, B. ed. Formal Representation of Human Judgment, Wiley, New York, 1968.

    Google Scholar 

  19. Stammen, D. and Pennycook, B., Real-Time Recognition of Melodic Fragments Using the Dynamic Timewarp Algorithm. in Proceedings of the 1993 International Computer Music Conference, (Tokyo, 1993), International Computer Music Association, 232–235.

    Google Scholar 

  20. Wakefield, G.H., Mathematical Representation of Joint Time-Chroma Distributions. in International Symposium on Optical Science, Engineering, and Instrumentation, SPIE’99, (Denver, 1999).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Dannenberg, R.B., Hu, N. (2002). Discovering Musical Structure in Audio Recordings. In: Anagnostopoulou, C., Ferrand, M., Smaill, A. (eds) Music and Artificial Intelligence. ICMAI 2002. Lecture Notes in Computer Science(), vol 2445. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45722-4_6

Download citation

  • DOI: https://doi.org/10.1007/3-540-45722-4_6

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44145-8

  • Online ISBN: 978-3-540-45722-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics