Abstract
The emergence of multimedia data in databases requires adequate methods for information retrieval. In a music data retrieval system by humming, the first stage is to extract exact pitch periods from a flow of signals. Due to the complexity of speech signals, it is difficult to make a robust and practical pitch tracking system. We adopt genetic algorithm in optimizing the control parameters for note segmentation and pitch determination. We applied the results to HumSearch, a commercialized product, as a pitch tracking engine. Experimental results showed that the proposed engine notably improved the performance of the existing engine in HumSearch.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Stephen Handel. Listening: An introduction to the perception of auditory events. In The MIT Press, 1989.
Ta Chou, Arbee L.P., and C.C. Liu Chen. Music database: indexing technique and implementation. In Proc. of IEEE, International Workshop on Multimedia data base Menagement System, 1996.
A. Ghais, J. Logan, D. Chamberlin, and B.C. Smith. Query by humming. In Proceeding ACM Multimedia 95, 1995.
Rodger J. McNab, Lloyd A. Smith, Ian H. Witten, Cclar L. Henderson, and Sally J. Cunningham. Towards the digital music library: Tune retrieval from acoustic input. In Proc. ACM Digital Libraries, 1996.
Sassan Ahmadi and Andreas S. Spanias. Cepstrum-based pitch detection using a new statistical v/uv classification algorithm. IEEE Trans. Speech and Audio Processing, 7(3):333–338, 1999.
Mingyang Wu, DeLiang Wang, and G.J. Brown. Pitch tracking based on statistical anticipation. In Proceedings. IJCNN’ 01. International Joint Conference, volume 2, pages 866–871, 2001.
L.R. Rabiner. On the use of autocorrelation analysis for pitch detection. IEEE Trans. Acoust, Speech, Signal Processing, ASSP-25(1):24–33, 1977.
Ross. M, J, H.L. Shaffer, A. Cohen, R. Freudberg, and H. J. Manley. Average magnitude difference function pitch extractor. IEEE Trans. Acoust, Speech, Signal Processing, ASSP-22(1):353–362, 1977.
Noll A.M. Cepstrum pitch determination. Journal of Acoust, Soc, Amer., 41:293–309, 1967.
T. Shimomuro and H. Kobayashi. Weighted autocorrelation for pitch extraction of noisy speech. IEEE Trans. Speech and Audio Processing, 9(7):727–730, 2001.
Shimamura T. Kunieda N. and Suzuki J. Robust method of measurement of fundamental frequency by aclos: autocorrelation of log spectrum. In IEEE International Conference Acoustics, Speech, and Signal Processing, pages 232–235, 1996.
J.K. Kaiser. On a simple algorithm to calculate the ‘energy’ of a signal. In Proceeding IEEE ICASSP, pages 381–384, 1990.
D. E. Goldberg, K. Deb, and B. Korb. Do not worry, be messy. In Proceedings of the Fourth International Conference on Genetic Algorithms, pages 24–30, 1991.
Syswerda. Uniform crossover in genetic algorithms. In International Conference on Genetic Algorithms, pages 2–9, 1989.
D. Cavicchio. Adaptive Search Using Simulated Evolution. PhD thesis, University of Michigan, Ann Arbor, MI, 1970. Unpublished.
D. Whitley and J. Kauth. GENITOR: A different genetic algorithm. In Proceedings of Rocky Mountain Conference on Artificial Intelligence, pages 118–130, 1988.
B. Efron and R. Tibshirani. Cross-validation and the bootstrap: Estimating the error rate of a prediction rule. In Technical Report (TR-477), Dept. of Statistics, Stanford University., 1995.
R. Kohavi. A study of cross-validation and bootstrap of accuracy estimation and model selection. In Proceedings of hte Rourteenth International Joint Conference on Artificial Intelligence, pages 1137–1143, 1995.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Choi, YS., Moon, BR. (2003). Parameter Optimization by a Genetic Algorithm for a Pitch Tracking System. In: Cantú-Paz, E., et al. Genetic and Evolutionary Computation — GECCO 2003. GECCO 2003. Lecture Notes in Computer Science, vol 2724. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45110-2_99
Download citation
DOI: https://doi.org/10.1007/3-540-45110-2_99
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40603-7
Online ISBN: 978-3-540-45110-5
eBook Packages: Springer Book Archive