Abstract
Content-based indexing of audio (and multimedia) data has become more important since conventional databases cannot provide the necessary efficiency and performance [1,2]. However, there are three main difficult problems. First, the content of audio data is subjective information; it is hard to give the descriptions in words. The recognition of data content requires prior knowledge and special techniques in Signal Processing and Pattern Recognition, which usually require long computing time. Second, since several audio features can be used as indices [3] (such as pitch, amplitude, and frequency), a method or processing technique designed and developed for one feature may not be appropriate for another. Third, the extremely large data size and the use of a similarity search require extensive computation. Similarity matching is based upon the computation of the distance between a query and each record in the database; the best match is in the data set with the smallest distances. To solve these three problems, we use a histogram-based feature model to represent subjective features[4], a unified model [5] to represent the data structures of the multimedia data, and a fast, generalized comparison algorithm to reduce the retrieval time.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
V. Gudivada and V. Raghavan, “Special issue on content-based image retrieval systems,” IEEE Computers, Vol. 28, No. 9, September 1995.
Z. Kemp, “Multimedia and spatial information systems,” IEEE Multimedia, Vol. 2, No. 4, 1995.
E. Wold et al., “Content-based classification, search and retrieval of audio data,” IEEE Multimedia, 1996.
P. Piamsa-nga, N. A. Alexandridis, S. Srakaew, G. Blankenship, G. Papakonstantinou, P. Tsanakas, and S. Tzafestas, “Multi-feature content based image retrieval,” in International Conference on Computer Graphics and imaging, 1998
P. Piamsa-nga, N. A. Alexandridis, G. Blankenship, G. Papakonstantinou, P. Tsanakas, and S. Tzafestas, “A unified k-tree model for multimedia retrieval,” in International Conference on Computers and their applications, Hawaii, March 1998.
S. R. Subramanya, P. Piamsa-nga, N. A. Alexandridis, and A. Youssef, “A Scheme for Content-Based Image Retrievals for Unrestricted Query Formats,” International Conference on Imaging Science, Systems and Technology (CISST′98), Las Vegas, July 1998
M. J. Swain and D. H. Ballard, “Color Indexing” International Journal of Computer Vision, 7:1, 1991.
J. R. Smith, “Integrated spatial and feature image systems: retrieval, analysis, and compression,” Ph.D. Thesis, Columbia University, 1997
Sunsite at University of North Carolina, “FTP archive,” available URL: http://sunsite.unc.edu/pub/multimedia/pub/multimedia
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Piamsa-Nga, P. et al. (1999). Content-Based Audio Retrieval Using a Generalized Algorithm. In: Tzafestas, S.G. (eds) Advances in Intelligent Systems. International Series on Microprocessor-Based and Intelligent Systems Engineering, vol 21. Springer, Dordrecht. https://doi.org/10.1007/978-94-011-4840-5_21
Download citation
DOI: https://doi.org/10.1007/978-94-011-4840-5_21
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-0393-6
Online ISBN: 978-94-011-4840-5
eBook Packages: Springer Book Archive