Audio Indexing

Sen, Soumya; Dutta, Anjan; Dey, Nilanjan

doi:10.1007/978-981-13-6098-5_1

Audio Indexing

Soumya Sen⁴,
Anjan Dutta⁵ &
Nilanjan Dey⁶

Chapter
First Online: 31 January 2019

1014 Accesses
6 Citations

Part of the book series: SpringerBriefs in Applied Sciences and Technology ((BRIEFSINTELL))

Abstract

Audio is available from various sources like recordings of meetings, newscast, telephonic conversations, etc. In this era of information technology, with the technological progress, more and more digital audio, video, and images are being captured and stored day by day. The amount of audio data is increasing exponentially on the web and other information storehouses. In order to efficiently use this huge multimedia data, there should be an effective search technique.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Gaikwad, B. M. G. P. Different indexing techniques.
Google Scholar
Foote, J. (1999). An overview of audio information retrieval. Multimedia Systems, 7(1), 2–10.
Article Google Scholar
Müller, M. (2015). Content-based audio retrieval. In Fundamentals of music processing (pp. 355–413). Cham: Springer.
Chapter Google Scholar
Leavitt, N. (2002). Let’s hear it for audio mining. Computer, 35(10), 23–25.
Article Google Scholar
Mand, M. K., & Nagpal, D. (2013). Gunjan, “An Analytical Approach for Mining Audio Signals”. International Journal of Advanced Research in Computer and Communication Engineering, 2(9).
Google Scholar
Retrieved September 06, 2018, from http://www.rsystems.com/CommonResource/KnowledgeRepository/Deciphering-Voice-of-Customer-through-Speech-Analytics.pdf.
Logan, B., Goddeau, D., & Van Thong, J. M. (2005, March). Real-world audio indexing systems. In IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP 2005) (Vol. 5, pp. v-1001). IEEE.
Google Scholar
Cardillo, P. S., Clements, M., & Miller, M. S. (2002). Phonetic searching vs. LVCSR: How to find what you really want in audio archives. International Journal of Speech Technology, 5(1), 9–22.
Article Google Scholar
Van Thong, J. M., Goddeau, D., Litvinova, A., Logan, B., Moreno, P., & Swain, M. (2000, April). Speechbot: A speech recognition based audio indexing system for the web. In Content-based multimedia information access-Volume 1 (pp. 106–115). The Centre de Hautes Etudes Internationales d’informatique Documentaire.
Google Scholar
Van Thong, J. M., Moreno, P. J., Logan, B., Fidler, B., Maffey, K., & Moores, M. (2002). Speechbot: An experimental speech-based search engine for multimedia content on the web. IEEE Transactions on Multimedia, 4(1), 88–96.
Article Google Scholar
Logan, B., Moreno, P., Thong, J. M. V., & Whittaker, E. (2000). An experimental study of an audio indexing system for the web. In Sixth International Conference on Spoken Language Processing.
Google Scholar
Trieu, H. L., Nguyen, L. M., & Nguyen, P. T. (2016). Dealing with out-of-vocabulary problem in sentence alignment using word similarity. In Proceedings of the 30th Pacific Asia Conference on Language, Information and Computation: Oral Papers (pp. 259–266).
Google Scholar
Yusnita, M. A., Paulraj, M. P., Yaacob, S., Bakar, S. A., Saidatul, A., & Abdullah, A. N. (2011, March). Phoneme-based or isolated-word modeling speech recognition system? An overview. In 2011 IEEE 7th International Colloquium on Signal Processing and its Applications (CSPA) (pp. 304–309). IEEE.
Google Scholar
Dey, N., & Ashour, A. S. (2018). Challenges and future perspectives in speech-sources direction of arrival estimation and localization. In Direction of arrival estimation and localization of multi-speech sources (pp. 49–52). Cham: Springer.
Google Scholar
Dey, N., & Ashour, A. S. (2018). Direction of arrival estimation and localization of multi-speech sources. Springer International Publishing.
Google Scholar
Dey, N., & Ashour, A. S. (2018). Applied examples and applications of localization and tracking problem of multiple speech sources. In Direction of arrival estimation and localization of multi-speech sources (pp. 35–48). Cham: Springer.
Google Scholar
Dey, N., & Ashour, A. S. (2018). Microphone array principles. In Direction of arrival estimation and localization of multi-speech sources (pp. 5–22). Cham: Springer.
Google Scholar
Karaa, W. B. A., & Dey, N. (2017). Mining multimedia documents. CRC Press.
Google Scholar

Download references

Author information

Authors and Affiliations

A.K. Choudhury School of Information Technology, University of Calcutta, Kolkata, West Bengal, India
Soumya Sen
Department of Information Technology, Techno India College of Technology, Kolkata, West Bengal, India
Anjan Dutta
Department of Information Technology, Techno India College of Technology, Kolkata, West Bengal, India
Nilanjan Dey

Authors

Soumya Sen
View author publications
You can also search for this author in PubMed Google Scholar
Anjan Dutta
View author publications
You can also search for this author in PubMed Google Scholar
Nilanjan Dey
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Sen, S., Dutta, A., Dey, N. (2019). Audio Indexing. In: Audio Processing and Speech Recognition. SpringerBriefs in Applied Sciences and Technology(). Springer, Singapore. https://doi.org/10.1007/978-981-13-6098-5_1

Download citation

DOI: https://doi.org/10.1007/978-981-13-6098-5_1
Published: 31 January 2019
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-6097-8
Online ISBN: 978-981-13-6098-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics