Statistical Analysis of the Prosodic Parameters of a Spontaneous Arabic Speech Corpus for Speech Synthesis

Hadj Ali, Ikbel; Mnasri, Zied

doi:10.1007/978-3-319-45925-7_5

Statistical Analysis of the Prosodic Parameters of a Spontaneous Arabic Speech Corpus for Speech Synthesis

Ikbel Hadj Ali¹⁵ &
Zied Mnasri¹⁵

Conference paper
First Online: 21 September 2016

505 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9918))

Abstract

In this paper, we present the analysis of a normalized and representative spontaneous Arabic speech corpus with labeling and annotation, in order to provide a complete library of voice segments and their respective prosodic parameters at different levels (phonemic or syllabic). A statistical analysis was conducted afterwards to determine and normalize the distribution of the collected data. The obtained results were then compared to those of a prepared Arabic speech corpus, in order to determine the characteristics of each kind of speech corpus and its suitable application area.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 34.99; Price excludes VAT (USA)

Softcover Book: USD 44.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Abdel-Hamid, O., Abdou, S.M., Rashwan, M.: Improving Arabic HMM based speech synthesis quality. In: INTERSPEECH (2006)
Google Scholar
Al-Ani, S.: Arabic phonology: an acoustical and a physilogical investigation. Walter de Gruyter (1970)
Google Scholar
Boersma, P., Weenink, D.: Praat: doing phonetics by computer (2010)
Google Scholar
Boudraa, M., Boudraa, B., Guerin, B.: Elaboration d’une base de données arabe phonétiquement équilibrée. In: Actes du colloque Langue Arabe et Technologies Informatiques Avancées, pp. 171–187 (1993)
Google Scholar
Campbell, W.N.: Predicting segmental durations for accommodation within a syllable-level timing framework. In: 3rd European Conference on Speech Communication and Technology (1993)
Google Scholar
Ghasemi, A., Zahediasl, S.: Normality tests for statistical analysis: a guide for non-statisticians. Int. J. Endocrinol. Metabol. 10(2), 486–489 (2012)
Article Google Scholar
Ladd, D.R.: Intonational Phonology. Cambrige University Press, Cambrige (1986)
Google Scholar
Markose, S., Alentorn, A.: The Generalized extreme value distribution and extreme economic value at risk (EE-VaR). In: Kontoghiorghes, E., Rustem, B., Winker, P. (eds.) Computational Methods in Financial Engineering. Springer, Berlin (2008)
Google Scholar
Mixdorff, H., Jokisch, O.: An integrated approach to modeling German prosody. Int. J. Speech Technol. 6(1), 45–55 (2003)
Article MATH Google Scholar
Mnasri, Z., Boukadida, F., Ellouze, N.: Design and development of a prosody generator for Arabic TTS systems. Int. J. Comput. Appl. 12(1), 24–31 (2010)
Google Scholar
Vainio, M., et al.: Artificial neural networks based prosody models for Finnish text-to-speech synthesis. Ph.D. thesis, Helsinky University of Technology (2001)
Google Scholar
Van Santen, J.: Assignement of segmental duration in text-to-speech synthesis. Comput. Speech Lang. 8(2), 95–128 (1994)
Article Google Scholar
Zen, H., Nose, T., Yamagishi, J., Sako, S., Masuko, T., Black, A.W., Tokuda, K.: The HMM-based speech synthesis system (HTS) version 2.0. In: SSW, pp. 294–299. Citeseer (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Signal, Image and Technology of Information Laboratory, Ecole Nationale d’Ingenieurs de Tunis-ENIT, University Tunis El-Manar, Tunis, Tunisia
Ikbel Hadj Ali & Zied Mnasri

Authors

Ikbel Hadj Ali
View author publications
You can also search for this author in PubMed Google Scholar
Zied Mnasri
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zied Mnasri .

Editor information

Editors and Affiliations

University of West Bohemia , Plzen, Czech Republic
Pavel Král
Rovira i Virgili University , Tarragona, Spain
Carlos Martín-Vide

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hadj Ali, I., Mnasri, Z. (2016). Statistical Analysis of the Prosodic Parameters of a Spontaneous Arabic Speech Corpus for Speech Synthesis. In: Král, P., Martín-Vide, C. (eds) Statistical Language and Speech Processing. SLSP 2016. Lecture Notes in Computer Science(), vol 9918. Springer, Cham. https://doi.org/10.1007/978-3-319-45925-7_5

Download citation

DOI: https://doi.org/10.1007/978-3-319-45925-7_5
Published: 21 September 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-45924-0
Online ISBN: 978-3-319-45925-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics