Transcribing Bach Chorales Using Particle Swarm Optimisations
This paper reports a novel application of particle swarm optimisation to polyphonic transcription task. The system transforms an input audio into activation strength of pitches in the desired range. This transformation begins with audio information in time-domain to frequency-domain and finally, to activation strength of pitches (a.k.a. piano-roll representation). We can infer the likely sounding pitches by comparing the observed activation strength of input audio to reference Tone-models. Although each Tone-model is learned offline from the pitches one wish to perform transcription with, this process often only approximates the Tone-model characteristics due to the variations in volume and other effects introduced from the manner of note executions. Hence, predicting sounding notes based solely on Tone-models gives inaccurate predictions. Here, we apply PSO to search for an optimum aggregation of different predicted pitches that best represents the input activation strength. We describe our problem formulation and the design of our approach. The experimental results show our approach to be of potential in the task of polyphonic transcription.
KeywordsParticle swarm optimisation Polyphonic transcription Tone-models Transcribing Bach’s Chorales
Unable to display preview. Download preview PDF.
- 1.Martin, K.D.: A blackboard system for automatic transcription of simple polyphonic music. M.I.T. Media Lab, Perceptual Computing. Technical Report. 385 (1996)Google Scholar
- 2.Kashino, K., Nakadai, K., Kinoshita, T., Tanaka, H.: Application of bayesian probability network to music scence analysis. In: Proceedings of IJCAI Workshop on CASA, Montreal, pp. 52–59 (1995)Google Scholar
- 3.Walmsley, P.J., Godsill, S.J., Rayner, P.J.W.: Bayesian graphical models for polyphonic pitch tracking. In: Proceedings of Diderot Forum on Mathematics and Music, Vienna, Austria, December 2-4, pp. 1–26 (1999)Google Scholar
- 4.Davy, M., Godsill, S.J.: Bayesian Harmonic Models for Musical Signal Analysis. In: Bernardo, J.M., Bayarri, M.J., Berger, J.O., Dawid, A.P., Heckerman, D., Smith, A.F.M., West, M. (eds.) Bayesian Statistics 7, pp. 105–124. Oxford University Press (2003)Google Scholar
- 6.Vincent, E., Rodet, X.: Music transcription with ISA and HMM. In: Proceedings of the Fifth International Conference on Independent Component Analysis and Blind Signal Separation (ICA 2004), Gradana, Spain, pp. 1197–1204 (2004)Google Scholar
- 8.Smaragdis, P., Brown, J.C.: Non-negative matric factorization for polyphonic music transcription. In: Proceedings of IEEE Workshop Applications of Signal Processing to Audio and Acoustics, pp. 177–180. New Paltz, NY (2003)Google Scholar
- 9.Hoyer, P.O.: Non-negative sparse coding. In: Proceedings of IEEE Workshop on Neural Networks for Signal Provcessing XII, Martigny, Switzerland (2002)Google Scholar