Skip to main content

Audio Indexing

  • Chapter
  • First Online:

Part of the book series: SpringerBriefs in Applied Sciences and Technology ((BRIEFSINTELL))

Abstract

Audio is available from various sources like recordings of meetings, newscast, telephonic conversations, etc. In this era of information technology, with the technological progress, more and more digital audio, video, and images are being captured and stored day by day. The amount of audio data is increasing exponentially on the web and other information storehouses. In order to efficiently use this huge multimedia data, there should be an effective search technique.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Gaikwad, B. M. G. P. Different indexing techniques.

    Google Scholar 

  2. Foote, J. (1999). An overview of audio information retrieval. Multimedia Systems, 7(1), 2–10.

    Article  Google Scholar 

  3. Müller, M. (2015). Content-based audio retrieval. In Fundamentals of music processing (pp. 355–413). Cham: Springer.

    Chapter  Google Scholar 

  4. Leavitt, N. (2002). Let’s hear it for audio mining. Computer, 35(10), 23–25.

    Article  Google Scholar 

  5. Mand, M. K., & Nagpal, D. (2013). Gunjan, “An Analytical Approach for Mining Audio Signals”. International Journal of Advanced Research in Computer and Communication Engineering, 2(9).

    Google Scholar 

  6. Retrieved September 06, 2018, from http://www.rsystems.com/CommonResource/KnowledgeRepository/Deciphering-Voice-of-Customer-through-Speech-Analytics.pdf.

  7. Logan, B., Goddeau, D., & Van Thong, J. M. (2005, March). Real-world audio indexing systems. In IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP 2005) (Vol. 5, pp. v-1001). IEEE.

    Google Scholar 

  8. Cardillo, P. S., Clements, M., & Miller, M. S. (2002). Phonetic searching vs. LVCSR: How to find what you really want in audio archives. International Journal of Speech Technology5(1), 9–22.

    Article  Google Scholar 

  9. Van Thong, J. M., Goddeau, D., Litvinova, A., Logan, B., Moreno, P., & Swain, M. (2000, April). Speechbot: A speech recognition based audio indexing system for the web. In Content-based multimedia information access-Volume 1 (pp. 106–115). The Centre de Hautes Etudes Internationales d’informatique Documentaire.

    Google Scholar 

  10. Van Thong, J. M., Moreno, P. J., Logan, B., Fidler, B., Maffey, K., & Moores, M. (2002). Speechbot: An experimental speech-based search engine for multimedia content on the web. IEEE Transactions on Multimedia, 4(1), 88–96.

    Article  Google Scholar 

  11. Logan, B., Moreno, P., Thong, J. M. V., & Whittaker, E. (2000). An experimental study of an audio indexing system for the web. In Sixth International Conference on Spoken Language Processing.

    Google Scholar 

  12. Trieu, H. L., Nguyen, L. M., & Nguyen, P. T. (2016). Dealing with out-of-vocabulary problem in sentence alignment using word similarity. In Proceedings of the 30th Pacific Asia Conference on Language, Information and Computation: Oral Papers (pp. 259–266).

    Google Scholar 

  13. Yusnita, M. A., Paulraj, M. P., Yaacob, S., Bakar, S. A., Saidatul, A., & Abdullah, A. N. (2011, March). Phoneme-based or isolated-word modeling speech recognition system? An overview. In 2011 IEEE 7th International Colloquium on Signal Processing and its Applications (CSPA) (pp. 304–309). IEEE.

    Google Scholar 

  14. Dey, N., & Ashour, A. S. (2018). Challenges and future perspectives in speech-sources direction of arrival estimation and localization. In Direction of arrival estimation and localization of multi-speech sources (pp. 49–52). Cham: Springer.

    Google Scholar 

  15. Dey, N., & Ashour, A. S. (2018). Direction of arrival estimation and localization of multi-speech sources. Springer International Publishing.

    Google Scholar 

  16. Dey, N., & Ashour, A. S. (2018). Applied examples and applications of localization and tracking problem of multiple speech sources. In Direction of arrival estimation and localization of multi-speech sources (pp. 35–48). Cham: Springer.

    Google Scholar 

  17. Dey, N., & Ashour, A. S. (2018). Microphone array principles. In Direction of arrival estimation and localization of multi-speech sources (pp. 5–22). Cham: Springer.

    Google Scholar 

  18. Karaa, W. B. A., & Dey, N. (2017). Mining multimedia documents. CRC Press.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2019 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Sen, S., Dutta, A., Dey, N. (2019). Audio Indexing. In: Audio Processing and Speech Recognition. SpringerBriefs in Applied Sciences and Technology(). Springer, Singapore. https://doi.org/10.1007/978-981-13-6098-5_1

Download citation

  • DOI: https://doi.org/10.1007/978-981-13-6098-5_1

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-13-6097-8

  • Online ISBN: 978-981-13-6098-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics