Abstract
Music can be described and represented in many different ways including sheet music, symbolic representations, and audio recordings. For each of these representations, there may exist different versions that correspond to the same musical work. For example, for Beethoven’s Fifth Symphony one can find a large number of music recordings performed by different orchestras and conductors. The general goal of music synchronization is to automatically link the various data streams, thus interrelating the multiple information sets related to a given musical work. More precisely, synchronization is taken to mean a procedure which, for a given position in one representation of a piece of music, determines the corresponding position within another representation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
V. ARIFI, M. CLAUSEN, F. KURTH, AND M. MÜLLER, Synchronization of music data in score-, MIDI- and PCM-format, Computing in Musicology, 13 (2004), pp. 9–33.
M. A. BARTSCH AND G. H. WAKEFIELD, Audio thumbnailing of popular music using chroma-based representations, IEEE Transactions on Multimedia, 7 (2005), pp. 96–104.
J. P. BELLO AND J. PICKENS, A robust mid-level representation for harmonic content in music signals, in Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), London, UK, 2005, pp. 304–311.
H.-J. BÖCKENHAUER AND D. BONGARTZ, Algorithmische Grundlagen der Bioinformatik: Modelle, Methoden und Komplexität, Teubner, 2003.
J. C. BROWN AND M. S. PUCKETTE, An efficient algorithm for the calculation of a constant Q transform, Journal of the Acoustic Society of America (JASA), 92 (1992), pp. 2698–2701.
A. CONT, A coupled duration-focused architecture for real-time music-to-score alignment, IEEE Transactions on Pattern Analysis and Machine Intelligence, 32 (2010), pp. 974–987.
T. H. CORMEN, C. E. LEISERSON, R. L. RIVEST, AND C. STEIN, Introduction to Algorithms, McGraw-Hill Higher Education, 2001.
D. DAMM, A Digital Library Framework for Heterogeneous Music Collections–from Document Acquisition to Cross-Modal Interaction, PhD thesis, University of Bonn, 2013.
D. DAMM, C. FREMEREY, V. THOMAS, M. CLAUSEN, F. KURTH, AND M. MÜLLER, A digital library framework for heterogeneous music collections: from document acquisition to cross-modal interaction, International Journal on Digital Libraries: Special Issue on Music Digital Libraries, 12 (2012), pp. 53–71.
R. B. DANNENBERG, An on-line algorithm for real-time accompaniment, in Proceedings of the International Computer Music Conference (ICMC), Paris, France, 1984, pp. 193–198.
R. B. DANNENBERG AND N. HU, Polyphonic audio matching for score following and intelligent audio editors, in Proceedings of the International Computer Music Conference (ICMC), San Francisco, USA, 2003, pp. 27–34.
R. B. DANNENBERG AND C. RAPHAEL, Music score alignment and computer accompaniment, Communications of the ACM, Special Issue: Music Information Retrieval, 49 (2006), pp. 38–43.
S. DIXON AND G. WIDMER, MATCH: A music alignment tool chest, in Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), London, UK, 2005.
Z. DUAN AND B. PARDO, A state space model for online polyphonic audio-score alignment, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Prague, Czech Republic, 2011, pp. 197–200.
D. P. ELLIS AND G. E. POLINER, Identifying ‘cover songs’ with chroma features and dynamic programming beat tracking, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 4, Honolulu, Hawaii, USA, 2007.
S. EWERT, M. MÜLLER, AND P. GROSCHE, High resolution audio synchronization using chroma onset features, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Taipei, Taiwan, 2009, pp. 1869–1872.
C. FREMEREY, Automatic Organization of Digital Music Documents – Sheet Music and Audio, PhD thesis, University of Bonn, 2010.
C. FREMEREY, F. KURTH, M. MÜLLER, AND M. CLAUSEN, A demonstration of the SyncPlayer system, in Proceedings of the International Conference on Music Information Retrieval (ISMIR), Vienna, Austria, 2007, pp. 131–132.
C. FREMEREY, M. MÜLLER, AND M. CLAUSEN, Handling repeats and jumps in score-performance synchronization, in Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), Utrecht, The Netherlands, 2010, pp. 243–248.
H. FUJIHARA, M. GOTO, J. OGATA, AND H. G. OKUNO, LyricSynchronizer: Automatic synchronization system between musical audio signals and lyrics, IEEE Journal of Selected Topics in Signal Processing, 5 (2011), pp. 1252–1261.
T. FUJISHIMA, Realtime chord recognition of musical sound: A system using common lisp music, in Proceedings of the International Computer Music Conference (ICMC), Beijing, 1999, pp. 464–467.
E. GÓMEZ, Tonal Description of Music Audio Signals, PhD thesis, UPF Barcelona, 2006.
M. GOTO, A chorus-section detecting method for musical audio signals, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hong Kong, China, 2003, pp. 437–440.
——, A chorus section detection method for musical audio signals and its application to a music listening station, IEEE Transactions on Audio, Speech, and Language Processing, 14 (2006), pp. 1783–1794.
N. HU, R. B. DANNENBERG, AND G. TZANETAKIS, Polyphonic audio matching and alignment for music retrieval, in Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, NY, USA, 2003.
Ö. IZMIRLI AND R. B. DANNENBERG, Understanding features and distance functions for music sequence alignment, in Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), Utrecht, The Netherlands, 2010, pp. 411–416.
C. JODER, S. ESSID, AND G. RICHARD, A comparative study of tonal acoustic features for a symbolic level music-to-score alignment, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Dallas, Texas, USA, 2010.
——, A conditional random field framework for robust and scalable audio-to-score matching, IEEE Transactions on Audio, Speech, and Language Processing, 19 (2011), pp. 2385–2397.
——, Optimizing the mapping from a symbolic to an audio representation for music-to-score alignment, in Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, NY, USA, 2011, pp. 121–124.
M.-Y. KAN, Y. WANG, D. ISKANDAR, T. L. NWE, AND A. SHENOY, LyricAlly: Automatic synchronization of textual lyrics to acoustic music signals, IEEE Transactions on Audio, Speech, and Language Processing, 16 (2008), pp. 338–349.
E. KEOGH, Exact indexing of dynamic time warping, in Proceedings of the VLDB Conference, Hong Kong, 2002, pp. 406–417.
E. KEOGH AND M. PAZZANI, Iterative deepening dynamic time warping for time series, in Proceedings of the SIAM International Conference on Data Mining, Arlington, Virginia, USA, 2002.
A. P. KLAPURI, Multipitch analysis of polyphonic music and speech signals using an auditory model, IEEE Transactions on Audio, Speech, and Language Processing, 16 (2008), pp. 255–266.
C. L. KRUMHANSL, Cognitive foundations of musical pitch, Oxford University Press, 1990.
F. KURTH, D. DAMM, C. FREMEREY, M. MÜLLER, AND M. CLAUSEN, A framework for managing multimodal digitized music collections, in ECDL, 2008, pp. 334–345.
F. KURTH AND M. MÜLLER, Efficient index-based audio matching, IEEE Transactions on Audio, Speech, and Language Processing, 16 (2008), pp. 382–395.
F. KURTH, M. MÜLLER, C. FREMEREY, Y. HA CHANG, AND M. CLAUSEN, Automated synchronization of scanned sheet music with audio recordings, in Proceedings of the International Conference on Music Information Retrieval (ISMIR), Vienna, Austria, 2007, pp. 261–266.
J. LANGNER AND W. GOEBL, Visualizing expressive performance in tempo-loudness space, Computer Music Journal, 27 (2003), pp. 69–83.
M. LAST, A. KANDEL, AND H. BUNKE, eds., Data Mining in Time Series Databases, World Scientific, 2004.
V. I. LEVENSHTEIN, Binary codes capable of correcting deletions, insertions, and reversals, Soviet Physics Doklady, 10 (1966), pp. 707–710.
R. MACRAE, J. NEUMANN, X. ANGUERA, N. OLIVER, AND S. DIXON, Real-time synchronisation of multimedia streams in a mobile device, in Proceedings of the IEEE International Conference on Multimedia and Expo (ICME), Barcelona, Spain, 2011, pp. 1–6.
M. MAUCH AND S. DIXON, Simultaneous estimation of chords and musical context from audio, IEEE Transactions on Audio, Speech, and Language Processing, 18 (2010), pp. 1280–1289.
M. MAUCH, H. FUJIHARA, AND M. GOTO, Integrating additional chord information into HMM-based lyrics-to-audio alignment, IEEE Transactions on Audio, Speech, and Language Processing, 20 (2012), pp. 200–210.
N. MONTECCHIO AND A. CONT, A unified approach to real time audio-to-score and audio-to-audio alignment using sequential Montecarlo inference techniques, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Prague, Czech Republic, 2011, pp. 193–196.
M. MÜLLER, Information Retrieval for Music and Motion, Springer Verlag, 2007.
M. MÜLLER AND D. APPELT, Path-constrained partial music synchronization, in Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, Nevada, USA, 2008, pp. 65–68.
M. MÜLLER AND M. CLAUSEN, Transposition-invariant self-similarity matrices, in Proceedings of the International Conference on Music Information Retrieval (ISMIR), Vienna, Austria, 2007, pp. 47–50.
M. MÜLLER AND S. EWERT, Towards timbre-invariant audio features for harmony-based music, IEEE Transactions on Audio, Speech, and Language Processing, 18 (2010), pp. 649–662.
——, Chroma Toolbox: MATLAB implementations for extracting variants of chroma-based audio features, in Proceedings of the International Conference on Music Information Retrieval (ISMIR), Miami, Florida, USA, 2011, pp. 215–220.
M. MÜLLER, V. KONZ, N. JIANG, AND Z. ZUO, A multi-perspective user interface for music signal analysis, in Proceedings of the International Computer Music Conference (ICMC), Huddersfield, UK, 2011, pp. 205–211.
M. MÜLLER, V. KONZ, A. SCHARFSTEIN, S. EWERT, AND M. CLAUSEN, Towards automated extraction of tempo parameters from expressive music recordings, in Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), Kobe, Japan, 2009, pp. 69–74.
M. MÜLLER, F. KURTH, AND M. CLAUSEN, Chroma-based statistical audio features for audio matching, in Proceedings of the Workshop on Applications of Signal Processing (WASPAA), New Paltz, New York, USA, 2005, pp. 275–278.
M. MÜLLER, H. MATTES, AND F. KURTH, An efficient multiscale approach to audio synchronization, in Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), Victoria, Canada, 2006, pp. 192–197.
N. ORIO, S. LEMOUTON, AND D. SCHWARZ, Score following: State of the art and new developments, in Proceedings of the International Conference on New Interfaces for Musical Expression (NIME), Montreal, Canada, 2003, pp. 36–41.
J. PAULUS, M. MÜLLER, AND A. P. KLAPURI, Audio-based music structure analysis, in Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), Utrecht, The Netherlands, 2010, pp. 625–636.
L. RABINER AND B.-H. JUANG, Fundamentals of Speech Recognition, Prentice Hall Signal Processing Series, 1993.
C. RAPHAEL, A probabilistic expert system for automatic musical accompaniment, Journal of Computational and Graphical Statistics, 10 (2001), pp. 487–512.
——, A hybrid graphical model for aligning polyphonic audio with musical scores, in Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), Barcelona, Spain, 2004, pp. 387–394.
S. SALVADOR AND P. CHAN, FastDTW: Toward accurate dynamic time warping in linear time and space, in Proceedings of the KDD Workshop on Mining Temporal and Sequential Data, 2004.
C. S. SAPP, Comparative analysis of multiple musical performances, in Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), Vienna, Austria, 2007, pp. 497–500.
C. SCHÖRKHUBER AND A. P. KLAPURI, Constant-Q transform toolbox for music processing, in Sound and Music Computing Conference (SMC), Barcelona, Spain, 2010.
J. SERRÀ, E. GÓMEZ, P. HERRERA, AND X. SERRA, Chroma binary similarity and local alignment applied to cover song identification, IEEE Transactions on Audio, Speech, and Language Processing, 16 (2008), pp. 1138–1151.
D. STOWELL AND M. PLUMBLEY, Adaptive whitening for improved real-time audio onset detection, in Proceedings of the International Computer Music Conference (ICMC), Copenhagen, Denmark, 2007.
V. THOMAS, Music Synchronization, Audio Matching, Pattern Detection, and User Interfaces for a Digital Music Library System, PhD thesis, University of Bonn, 2013.
V. THOMAS, C. FREMEREY, D. DAMM, AND M. CLAUSEN, SLAVE: a Score-Lyrics-Audio-Video-Explorer, in Proceedings of the International Conference on Music Information Retrieval (ISMIR), Kobe, Japan, 2009, pp. 717–722.
V. THOMAS, C. FREMEREY, M. MÜLLER, AND M. CLAUSEN, Linking sheet music and audio - challenges and new approaches, in Multimodal Music Processing, M. Müller, M. Goto, and M. Schedl, eds., vol. 3 of Dagstuhl Follow-Ups, Schloss Dagstuhl–Leibniz-Zentrum für Informatik, Dagstuhl, Germany, 2012, pp. 1–22.
R. J. TURETSKY AND D. P. ELLIS, Ground-truth transcriptions of real music from force-aligned MIDI syntheses, in Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), Baltimore, Maryland, USA, 2003, pp. 135–141.
M. VLACHOS, D. GUNOPULOS, AND G. KOLLIOS, Discovering similar multidimensional trajectories, in Proceedings of the International Conference on Data Engineering (ICDE), San Jose, California, USA, 2002, pp. 673–684.
H. VON LOESCH AND S. WEINZIERL, eds., Gemessene Interpretation – Computergestützte Aufführungsanalyse im Kreuzverhör der Disziplinen, Schott Verlag, Mainz, 2011.
G. WIDMER, Using AI and machine learning to study expressive music performance: project survey and first report, AI Communications, 14 (2001), pp. 149–162.
G. WIDMER, S. DIXON, W. GOEBL, E. PAMPALK, AND A. TOBUDIC, In search of the Horowitz factor, AI Magazine, 24 (2003), pp. 111–130.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Müller, M. (2015). Music Synchronization. In: Fundamentals of Music Processing. Springer, Cham. https://doi.org/10.1007/978-3-319-21945-5_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-21945-5_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-21944-8
Online ISBN: 978-3-319-21945-5
eBook Packages: Computer ScienceComputer Science (R0)