Skrybot – A System for Automatic Speech Recognition of Polish Language

Pawlaczyk, Lesław; Bosky, Paweł

doi:10.1007/978-3-642-00563-3_40

Skrybot – A System for Automatic Speech Recognition of Polish Language

Lesław Pawlaczyk⁴ &
Paweł Bosky

Conference paper

1019 Accesses
6 Citations

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 59))

Abstract

In this article we present a system for clustering and indexing of automatically recognised radio and television news spoken in Polish language. The aim of the system is to quickly navigate and search for information which is not available in standard internet search engines. The system comprises of speech recognition, alignment and indexing module. The recognition part is trained using dozens of hours of transcribed audio and millions of words representing modern Polish language. The training audio and text is then converted into acoustic and language model, where we apply techniques such as Hidden Markov Models and statistical language processing. The audio is decoded and later submitted into indexing engine which extracts summary information about the spoken topic. The system presents a significant potential in many areas such as media monitoring, university lectures indexing, automated telephone centres and security enhancements.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Clarkson, P.R., Rosenfeld, R.: Statistical language modeling using the CMU-Cambridge toolkit. In: Proceedings of the European Conference on Speech Communication and Technology (1997)
Google Scholar
Formey, G.D.: The Viterbi algorithm. Proceedings of the IEEE 61, 268–278 (1973)
Article Google Scholar
Jurafsky, D., Martin, J.H.: Machine translation. In: Ward, N., Jurafsky, D. (eds.) Speech and Language Processing. Prentice-Hall, Englewood Cliffs (2000)
Google Scholar
Lee, A., Kawahar, T., Shikano, K.: Julius – an open source real-time large vocabulary recognition engine. In: Proceedings of the European Conference on Speech Communication and Technology, pp. 1691–1694 (2001)
Google Scholar
Young, S., et al.: The HTK book (for HTK version 3.4). Cambridge University Engineering Department (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Insitute of Informatics, Silesian University of Technology, Akademicka 16, 44-100, Gliwice, Poland
Lesław Pawlaczyk

Authors

Lesław Pawlaczyk
View author publications
You can also search for this author in PubMed Google Scholar
Paweł Bosky
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Silesian University of Technology, Gliwice, Poland
Krzysztof A. Cyran , Stanisław Kozielski , Urszula Stańczyk & Alicja Wakulicz-Deja , , &
University of Manitoba, Winnipeg, Canada
James F. Peters

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pawlaczyk, L., Bosky, P. (2009). Skrybot – A System for Automatic Speech Recognition of Polish Language. In: Cyran, K.A., Kozielski, S., Peters, J.F., Stańczyk, U., Wakulicz-Deja, A. (eds) Man-Machine Interactions. Advances in Intelligent and Soft Computing, vol 59. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00563-3_40

Download citation

DOI: https://doi.org/10.1007/978-3-642-00563-3_40
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00562-6
Online ISBN: 978-3-642-00563-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics