Automatic Speech Segmentation for Automatic Speech Translation

Kłosowski, Piotr; Dustor, Adam

doi:10.1007/978-3-642-38865-1_47

Piotr Kłosowski³ &
Adam Dustor³

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 370))

Included in the following conference series:

International Conference on Computer Networks

1555 Accesses
6 Citations

Abstract

The article presents selected, effective speech signal processing algorithms and their use in order to improve the automatic speech translation. Automatic speech translation uses natural language processing techniques implemented using algorithms of automatic speech recognition, speaker recognition, automatic text translation and text-to-speech synthesis. It is very possible to improve the process of automatic speech translation by using effective algorithms for automatic segmentation of speech signals based on speaker recognition and language recognition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Dziwoki, G.: An analysis of the unsupervised phase correction method in quadrature amplitude modulation systems. Przeglad Elektrotechniczny 88(7a), 245–249 (2012)
Google Scholar
Izydorczyk, J., Izydorczyk, M.: Limits to microprocessor scaling. Computer 43(8), 20–26 (2010)
Article Google Scholar
Sułek, W.: Pipeline processing in low-density parity-check codes hardware decoder. Bulletin of the Polish Academy of Sciences Technical Sciences 59(2), 149–155 (2011)
Google Scholar
Zawadzki, P.: Security of ping-pong protocol based on pairs of completely entangled qudits. Quantum Information Processing 11(6), 1419–1430 (2012)
Article MATH Google Scholar
Kucharczyk, M.: Blind signatures in electronic voting systems. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2010. CCIS, vol. 79, pp. 349–358. Springer, Heidelberg (2010)
Chapter Google Scholar
Dustor, A.: Speaker verification based on fuzzy classifier. In: Cyran, K.A., Kozielski, S., Peters, J.F., Stańczyk, U., Wakulicz-Deja, A. (eds.) Man-Machine Interactions. AISC, vol. 59, pp. 389–397. Springer, Heidelberg (2009)
Chapter Google Scholar
Kłosowski, P.: Speech processing application based on phonetics and phonology of the polish language. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2010. CCIS, vol. 79, pp. 236–244. Springer, Heidelberg (2010)
Chapter Google Scholar
Kłosowski, P., Pułka, A.: Polish Semantic Speech Recognition Expert System Supporting Electronic Design System. In: Prooccedings of The International Conference on Human Systems Interactions, HSI 2008, Kraków, Poland. IEEE Eurographics Technical Report Series, pp. 479–484 (2008)
Google Scholar
Stuker, S., Herrmann, T., Kolss, M., Niehues, J., Wolfel, M.: Research Opportunities In Automatic Speech-To-Speech Translation. IEEE Potentials 31(3), 26–33 (2012)
Article Google Scholar
Koehn, P.: Statistical Machine Translation. Cambridge Univ. Press, Cambridge (2009)
Book Google Scholar
Waibel, A., Fügen, C.: Spoken language translation-enabling crosslingual human-human communication. IEEE Signal Processing Mag. 25(3), 70–79 (2008)
Article Google Scholar
Huang, X., Acero, A., Hon, H.W.: Spoken Language Processing. Prentice-Hall, Englewood Cliffs (2001)
Google Scholar
Gordon Jr., R.G.: Ethnologue, Languages of the World, 15th edn. SIL International, Dallas (2005)
Google Scholar
Janson, T.: Speak-A Short History of Languages. Oxford Univ. Press, London (2002)
Google Scholar
Hutchins, J.: International Association for Machine Translation compendium of translation software (2010), http://www.hutchinsweb.me.uk/Compendium.htm
A new framework strategy for multilingualism, Communication from the Commission to the Council, the European Parliament, the European Economic and Social Committee, and the Committee of the Regions. Commission of the European Communities (November 2005)
Google Scholar
Steinbiss, V.: Human language technologies for Europe. Work Comissioned by ITC-irst, Trento, Italy to Accipio Consulting, Aachen, Germany (April 2006)
Google Scholar
Rabiner, L.R., Juang, B.H.: Fundamentals of speech recognition. Prentice-Hall (1993)
Google Scholar
Reynolds, D.A., Rose, R.C.: Robust text-independent speaker identification using Gaussian mixture speaker models. IEEE Transactions on Speech and Audio Processing 3(1), 72–82 (1995)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Silesian University of Technology, Akademicka Str. 16, 44-100, Gliwice, Poland
Piotr Kłosowski & Adam Dustor

Authors

Piotr Kłosowski
View author publications
You can also search for this author in PubMed Google Scholar
Adam Dustor
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Informatics, Silesian University of Technology, ul. Akademicka 16, 41-100, Gliwice, Poland
Andrzej Kwiecień
Institute of Informatics, Silesian University of Technology, ul. Akademicka 16, 44-100, Gliwice, Poland
Piotr Gaj & Piotr Stera &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kłosowski, P., Dustor, A. (2013). Automatic Speech Segmentation for Automatic Speech Translation. In: Kwiecień, A., Gaj, P., Stera, P. (eds) Computer Networks. CN 2013. Communications in Computer and Information Science, vol 370. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38865-1_47

Download citation

DOI: https://doi.org/10.1007/978-3-642-38865-1_47
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38864-4
Online ISBN: 978-3-642-38865-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics