Abstract
Audio is available from various sources like recordings of meetings, newscast, telephonic conversations, etc. In this era of information technology, with the technological progress, more and more digital audio, video, and images are being captured and stored day by day. The amount of audio data is increasing exponentially on the web and other information storehouses. In order to efficiently use this huge multimedia data, there should be an effective search technique.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Gaikwad, B. M. G. P. Different indexing techniques.
Foote, J. (1999). An overview of audio information retrieval. Multimedia Systems, 7(1), 2–10.
Müller, M. (2015). Content-based audio retrieval. In Fundamentals of music processing (pp. 355–413). Cham: Springer.
Leavitt, N. (2002). Let’s hear it for audio mining. Computer, 35(10), 23–25.
Mand, M. K., & Nagpal, D. (2013). Gunjan, “An Analytical Approach for Mining Audio Signals”. International Journal of Advanced Research in Computer and Communication Engineering, 2(9).
Retrieved September 06, 2018, from http://www.rsystems.com/CommonResource/KnowledgeRepository/Deciphering-Voice-of-Customer-through-Speech-Analytics.pdf.
Logan, B., Goddeau, D., & Van Thong, J. M. (2005, March). Real-world audio indexing systems. In IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP 2005) (Vol. 5, pp. v-1001). IEEE.
Cardillo, P. S., Clements, M., & Miller, M. S. (2002). Phonetic searching vs. LVCSR: How to find what you really want in audio archives. International Journal of Speech Technology, 5(1), 9–22.
Van Thong, J. M., Goddeau, D., Litvinova, A., Logan, B., Moreno, P., & Swain, M. (2000, April). Speechbot: A speech recognition based audio indexing system for the web. In Content-based multimedia information access-Volume 1 (pp. 106–115). The Centre de Hautes Etudes Internationales d’informatique Documentaire.
Van Thong, J. M., Moreno, P. J., Logan, B., Fidler, B., Maffey, K., & Moores, M. (2002). Speechbot: An experimental speech-based search engine for multimedia content on the web. IEEE Transactions on Multimedia, 4(1), 88–96.
Logan, B., Moreno, P., Thong, J. M. V., & Whittaker, E. (2000). An experimental study of an audio indexing system for the web. In Sixth International Conference on Spoken Language Processing.
Trieu, H. L., Nguyen, L. M., & Nguyen, P. T. (2016). Dealing with out-of-vocabulary problem in sentence alignment using word similarity. In Proceedings of the 30th Pacific Asia Conference on Language, Information and Computation: Oral Papers (pp. 259–266).
Yusnita, M. A., Paulraj, M. P., Yaacob, S., Bakar, S. A., Saidatul, A., & Abdullah, A. N. (2011, March). Phoneme-based or isolated-word modeling speech recognition system? An overview. In 2011 IEEE 7th International Colloquium on Signal Processing and its Applications (CSPA) (pp. 304–309). IEEE.
Dey, N., & Ashour, A. S. (2018). Challenges and future perspectives in speech-sources direction of arrival estimation and localization. In Direction of arrival estimation and localization of multi-speech sources (pp. 49–52). Cham: Springer.
Dey, N., & Ashour, A. S. (2018). Direction of arrival estimation and localization of multi-speech sources. Springer International Publishing.
Dey, N., & Ashour, A. S. (2018). Applied examples and applications of localization and tracking problem of multiple speech sources. In Direction of arrival estimation and localization of multi-speech sources (pp. 35–48). Cham: Springer.
Dey, N., & Ashour, A. S. (2018). Microphone array principles. In Direction of arrival estimation and localization of multi-speech sources (pp. 5–22). Cham: Springer.
Karaa, W. B. A., & Dey, N. (2017). Mining multimedia documents. CRC Press.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2019 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Sen, S., Dutta, A., Dey, N. (2019). Audio Indexing. In: Audio Processing and Speech Recognition. SpringerBriefs in Applied Sciences and Technology(). Springer, Singapore. https://doi.org/10.1007/978-981-13-6098-5_1
Download citation
DOI: https://doi.org/10.1007/978-981-13-6098-5_1
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-6097-8
Online ISBN: 978-981-13-6098-5
eBook Packages: EngineeringEngineering (R0)