Skip to main content

Approaches for the Detection of the Keywords in Spoken Documents Application for the Field of E-Libraries

  • Conference paper
Neural Information Processing (ICONIP 2012)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7666))

Included in the following conference series:

  • 4181 Accesses

Abstract

Automatic indexing of multimedia documents across several different application tasks, including searching for words spoken, the detection of keywords and audio information retrieval. Thus, despite the changes made in the field of indexing speech, much remains to be done particularly for the key word search in spontaneous speech. Although the research areas of spoken words and audio retrieval has been well addressed, but still significant limitations to achieve, especially in terms of resource available today on the web.

The goal of this paper is to propose an approach for document management based multimedia indexing techniques to detect speech and keywords. We present in this article the various methods of indexing with the techniques of detection of key words. These methods derive three principal approaches from vocal indexing: the detection of key word, the detection of key words on phonetic flow (PSPL, CN,...) and the indexing containing the recognition with great vocabulary (LVR). We present, thereafter the step suggested for an approach based on the combinations of two techniques (PSPL, S-PSPL and CN, like on technique LVR.

A validation of this approach of indexing and information retrieval is in the course of validation for the field of the E-libraries.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Lee, L.S., Pan, Y.C.: Voice-Based Information Retrieval — How Far are We from the Text-Based Information Retrieval? In: Automatic Speech Recognition & Understanding, pp. 26–43 (2009)

    Google Scholar 

  2. Yao, Q., Soong, F.K., Lee, T.: Tone Enhanced Generalized Character Posterior Probability (GCPP) for Cantonese LVCSR. Comp. Speech Lang. 22, 360–373 (2008)

    Article  Google Scholar 

  3. Chelba, C., Silva, J., Acero, A.: Soft Indexing of Speech Content for Search in Spoken Documents. Computer Speech and Language 21, 458–478 (2007)

    Article  Google Scholar 

  4. Seide, F., Yu, P., Shi, Y.: Towards Spoken Document Retrieval for the Enterprise: Approximate Word-Lattice Indexing with Text Indexers. In: Automatic Speech Recognition & Understanding, pp. 629–634 (2007)

    Google Scholar 

  5. Hori, T., Hetherington, I.L., Hazen, T.J., Glass, J.R.: Open Vocabulary Spoken Utterance Retrieval Using Confusion Networks. In: ICASSP, pp. 73–76 (2007)

    Google Scholar 

  6. Pan, Y.C., Chang, H.L., Lee, L.S.: Subword-Based Position Specific Posterior Lattices (S-PSPL) for Indexing Speech Information. In: Interspeech, pp. 318–321 (2007)

    Google Scholar 

  7. Park, A., Hazen, T., Glass, J.: Automatic Processing of Audio Lectures for Information Retrieval: Vocabulary Selection and Language Modeling. In: Proc. ICASSP, Philadelphia, PA (2005)

    Google Scholar 

  8. Chelba, C., Acero, A.: Position Specific Posterior Lattices for Indexing Speech. In: Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL 2005), pp. 443–450. Association for Computational Linguistics, Michigan, Ann Arbor (2005)

    Chapter  Google Scholar 

  9. Wessel, F., Schluter, R., Macherey, K., Ney, H.: Confidence Measures for Large Vocabulary Continuous Speech Recognition. In: SAP, vol. 9, pp. 288–298 (2001)

    Google Scholar 

  10. Mangu, L., Brill, E., Stolcke, A.: Finding Consensus in Speech Recognition: Word Error Minimization and other Applications of Confusion Networks. Computer Speech and Language 14, 373–400 (2000)

    Article  Google Scholar 

  11. Mills. T., Pye, D., Sinclair, D., Wood, K., A Digital Photo Management System. Technical, AT&T Laboratories Cambridge, Cambridge (2000)

    Google Scholar 

  12. El Meliani, R., O’Shaughnessy, D.: Lexical Fillers for Task-Independent-Training Based Keyword Spotting and Detection of New Words. In: Proc. EUROSPEECH, pp. 2129–2133 (1995)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Issam, B., Ridda, L.M. (2012). Approaches for the Detection of the Keywords in Spoken Documents Application for the Field of E-Libraries. In: Huang, T., Zeng, Z., Li, C., Leung, C.S. (eds) Neural Information Processing. ICONIP 2012. Lecture Notes in Computer Science, vol 7666. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34478-7_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-34478-7_25

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-34477-0

  • Online ISBN: 978-3-642-34478-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics