Skip to main content

Statistical Analysis of the Prosodic Parameters of a Spontaneous Arabic Speech Corpus for Speech Synthesis

  • Conference paper
  • First Online:
  • 505 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9918))

Abstract

In this paper, we present the analysis of a normalized and representative spontaneous Arabic speech corpus with labeling and annotation, in order to provide a complete library of voice segments and their respective prosodic parameters at different levels (phonemic or syllabic). A statistical analysis was conducted afterwards to determine and normalize the distribution of the collected data. The obtained results were then compared to those of a prepared Arabic speech corpus, in order to determine the characteristics of each kind of speech corpus and its suitable application area.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   34.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   44.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Abdel-Hamid, O., Abdou, S.M., Rashwan, M.: Improving Arabic HMM based speech synthesis quality. In: INTERSPEECH (2006)

    Google Scholar 

  2. Al-Ani, S.: Arabic phonology: an acoustical and a physilogical investigation. Walter de Gruyter (1970)

    Google Scholar 

  3. Boersma, P., Weenink, D.: Praat: doing phonetics by computer (2010)

    Google Scholar 

  4. Boudraa, M., Boudraa, B., Guerin, B.: Elaboration d’une base de données arabe phonétiquement équilibrée. In: Actes du colloque Langue Arabe et Technologies Informatiques Avancées, pp. 171–187 (1993)

    Google Scholar 

  5. Campbell, W.N.: Predicting segmental durations for accommodation within a syllable-level timing framework. In: 3rd European Conference on Speech Communication and Technology (1993)

    Google Scholar 

  6. Ghasemi, A., Zahediasl, S.: Normality tests for statistical analysis: a guide for non-statisticians. Int. J. Endocrinol. Metabol. 10(2), 486–489 (2012)

    Article  Google Scholar 

  7. Ladd, D.R.: Intonational Phonology. Cambrige University Press, Cambrige (1986)

    Google Scholar 

  8. Markose, S., Alentorn, A.: The Generalized extreme value distribution and extreme economic value at risk (EE-VaR). In: Kontoghiorghes, E., Rustem, B., Winker, P. (eds.) Computational Methods in Financial Engineering. Springer, Berlin (2008)

    Google Scholar 

  9. Mixdorff, H., Jokisch, O.: An integrated approach to modeling German prosody. Int. J. Speech Technol. 6(1), 45–55 (2003)

    Article  MATH  Google Scholar 

  10. Mnasri, Z., Boukadida, F., Ellouze, N.: Design and development of a prosody generator for Arabic TTS systems. Int. J. Comput. Appl. 12(1), 24–31 (2010)

    Google Scholar 

  11. Vainio, M., et al.: Artificial neural networks based prosody models for Finnish text-to-speech synthesis. Ph.D. thesis, Helsinky University of Technology (2001)

    Google Scholar 

  12. Van Santen, J.: Assignement of segmental duration in text-to-speech synthesis. Comput. Speech Lang. 8(2), 95–128 (1994)

    Article  Google Scholar 

  13. Zen, H., Nose, T., Yamagishi, J., Sako, S., Masuko, T., Black, A.W., Tokuda, K.: The HMM-based speech synthesis system (HTS) version 2.0. In: SSW, pp. 294–299. Citeseer (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zied Mnasri .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Hadj Ali, I., Mnasri, Z. (2016). Statistical Analysis of the Prosodic Parameters of a Spontaneous Arabic Speech Corpus for Speech Synthesis. In: Král, P., Martín-Vide, C. (eds) Statistical Language and Speech Processing. SLSP 2016. Lecture Notes in Computer Science(), vol 9918. Springer, Cham. https://doi.org/10.1007/978-3-319-45925-7_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-45925-7_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-45924-0

  • Online ISBN: 978-3-319-45925-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics