Skip to main content

Performance rules in a text-to-speech system

  • Chapter
  • 307 Accesses

Abstract

Speech synthesis, as a research tool, has been instrumental in increasing the understanding of the speech communication process. With the help of rules in our text-to-speech system we are able to model the transformation of symbols to sounds (Carlson et al., 1990). This approach has been used for both speech and music, and there are many interesting parallels that we discussed in an earlier paper (Carlson et al., 1989). In this contribution, we will refer to some experiments on speech, designed to better understand the rules of speech performance.

This is a preview of subscription content, log in via an institution.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Bruce, G. & Granström, B. (1989), “Modelling Swedish intonation in a text-to-speech system”, in Proc. of Fonetik-89, STL-QPSR 1 /1989, pp. 17–21.

    Google Scholar 

  • Bruce, G. and Granström, B. (1990), “Modelling Swedish prosody in text-to-speech: phrasing”, to appear in Nordic Prosody V, Abo University, Finland.

    Google Scholar 

  • Carlson, R. and Granström, B. (1986): “A search for durational rules in a real-speech data base,” Phonetica 43, pp. 140–154.

    Article  Google Scholar 

  • Carlson, R. and Granström, B. (1989): “Modelling duration for different text materials”, Proc. ‘Eurospeech 89’, European Conf on Speech Comm and Technology, Paris, September 26–28, Vol 2 pp. 328–331

    Google Scholar 

  • Carlson, R. and Granström, B. (1975): “Perception of segment duration”, in Structure and Process in speech perception, ed. A. Cohen and S. Nooteboom, Springer Verlag, Heidelberg.

    Google Scholar 

  • Carlson, R., Friberg, A., Frydén, L., Granström, B. and Sundberg, J. (1989): “Speech and music performance,” Contemporary Music Review 4, pp. 389–402; also translated into French: “La parole et l’exécution de la musique: parallèles et contrastes,” to be publ. in Proc. of the Symposium La Musique et les Sciences cognitives.

    Google Scholar 

  • Carlson, R., Granström, B. and Hunnicutt, S. (1990): “Multilingual text-to-speech development and applications”, in A.W. Ainsworth (ed), Advances in speech, hearing and language processing, JAI Press, London

    Google Scholar 

  • Carlson, R., Granström, B., and Klatt, D. K. (1979): “Some notes on the perception of temporal patterns in speech”, in ( B. Lindblom and S. Öhman, eds.) Frontiers in Speech Communication Research, Academic Press, New York.

    Google Scholar 

  • Fant, G. and Kruckenberg, A. (1988): “Some durational correlates of Swedish prosody,” pp. 495–502 in Proc. SPEECH ‘88, Book 2 (7th FASE-Symposium), Institute of Acoustics, Edinburgh.

    Google Scholar 

  • Fant, G., Nord, L., and Kruckenberg, A. (1986): “Individual variations in text reading. A data-bank pilot study”, STL-QPSR 4 /1986, pp. 1–17.

    Google Scholar 

  • Fujisaki, H., Nakamura, K. and Imoto, T. (1975): “Auditory perception of duration of speech and non-speech stimuli”, in Auditory analysis and perception of speech, ed G. Fant and M. Tatham, pp. 197–200, Academic Press, New York.

    Google Scholar 

  • Klatt, D.K. (1976): “Linguistic uses of segmental duration in English: Acoustic and perceptual evidence,” J.Acoust.Soc.Am. 59, pp. 1208–1221.

    Article  PubMed  CAS  Google Scholar 

  • Klatt, D.K. (1979): “Synthesis by rule of segmental durations in English sentences,” in ( B. Lindblom and S. Ohman, eds.) Frontiers in Speech Communication Research, Academic Press, New York.

    Google Scholar 

  • Lehiste, I. (1987): “Phonetic manifestations of linguistic hierarchies”, Proc. Symposium on Language Universals, Tallinn.

    Google Scholar 

  • Nord, L. (1988): “Acoustic-phonetic studies in a Swedish speech databank”, pp. 225–231 in Proc. Speech ‘88, Book 3, (7th FASE symposium), Institute of Acoustics, Edinburgh.

    Google Scholar 

Download references

Authors

Editor information

Editors and Affiliations

Copyright information

© 1991 The Wenner-Gren Center

About this chapter

Cite this chapter

Carlson, R., Granström, B. (1991). Performance rules in a text-to-speech system. In: Sundberg, J., Nord, L., Carlson, R. (eds) Music, Language, Speech and Brain. Wenner-Gren Center International Symposium Series. Palgrave, London. https://doi.org/10.1007/978-1-349-12670-5_11

Download citation

  • DOI: https://doi.org/10.1007/978-1-349-12670-5_11

  • Publisher Name: Palgrave, London

  • Print ISBN: 978-1-349-12672-9

  • Online ISBN: 978-1-349-12670-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics