Performance rules in a text-to-speech system

Carlson, R.; Granström, B.

doi:10.1007/978-1-349-12670-5_11

Performance rules in a text-to-speech system

R. Carlson &
B. Granström

Chapter

307 Accesses

Part of the book series: Wenner-Gren Center International Symposium Series

Abstract

Speech synthesis, as a research tool, has been instrumental in increasing the understanding of the speech communication process. With the help of rules in our text-to-speech system we are able to model the transformation of symbols to sounds (Carlson et al., 1990). This approach has been used for both speech and music, and there are many interesting parallels that we discussed in an earlier paper (Carlson et al., 1989). In this contribution, we will refer to some experiments on speech, designed to better understand the rules of speech performance.

This is a preview of subscription content, log in via an institution.

Preview

Unable to display preview. Download preview PDF.

References

Bruce, G. & Granström, B. (1989), “Modelling Swedish intonation in a text-to-speech system”, in Proc. of Fonetik-89, STL-QPSR 1 /1989, pp. 17–21.
Google Scholar
Bruce, G. and Granström, B. (1990), “Modelling Swedish prosody in text-to-speech: phrasing”, to appear in Nordic Prosody V, Abo University, Finland.
Google Scholar
Carlson, R. and Granström, B. (1986): “A search for durational rules in a real-speech data base,” Phonetica 43, pp. 140–154.
Article Google Scholar
Carlson, R. and Granström, B. (1989): “Modelling duration for different text materials”, Proc. ‘Eurospeech 89’, European Conf on Speech Comm and Technology, Paris, September 26–28, Vol 2 pp. 328–331
Google Scholar
Carlson, R. and Granström, B. (1975): “Perception of segment duration”, in Structure and Process in speech perception, ed. A. Cohen and S. Nooteboom, Springer Verlag, Heidelberg.
Google Scholar
Carlson, R., Friberg, A., Frydén, L., Granström, B. and Sundberg, J. (1989): “Speech and music performance,” Contemporary Music Review 4, pp. 389–402; also translated into French: “La parole et l’exécution de la musique: parallèles et contrastes,” to be publ. in Proc. of the Symposium La Musique et les Sciences cognitives.
Google Scholar
Carlson, R., Granström, B. and Hunnicutt, S. (1990): “Multilingual text-to-speech development and applications”, in A.W. Ainsworth (ed), Advances in speech, hearing and language processing, JAI Press, London
Google Scholar
Carlson, R., Granström, B., and Klatt, D. K. (1979): “Some notes on the perception of temporal patterns in speech”, in ( B. Lindblom and S. Öhman, eds.) Frontiers in Speech Communication Research, Academic Press, New York.
Google Scholar
Fant, G. and Kruckenberg, A. (1988): “Some durational correlates of Swedish prosody,” pp. 495–502 in Proc. SPEECH ‘88, Book 2 (7th FASE-Symposium), Institute of Acoustics, Edinburgh.
Google Scholar
Fant, G., Nord, L., and Kruckenberg, A. (1986): “Individual variations in text reading. A data-bank pilot study”, STL-QPSR 4 /1986, pp. 1–17.
Google Scholar
Fujisaki, H., Nakamura, K. and Imoto, T. (1975): “Auditory perception of duration of speech and non-speech stimuli”, in Auditory analysis and perception of speech, ed G. Fant and M. Tatham, pp. 197–200, Academic Press, New York.
Google Scholar
Klatt, D.K. (1976): “Linguistic uses of segmental duration in English: Acoustic and perceptual evidence,” J.Acoust.Soc.Am. 59, pp. 1208–1221.
Article PubMed CAS Google Scholar
Klatt, D.K. (1979): “Synthesis by rule of segmental durations in English sentences,” in ( B. Lindblom and S. Ohman, eds.) Frontiers in Speech Communication Research, Academic Press, New York.
Google Scholar
Lehiste, I. (1987): “Phonetic manifestations of linguistic hierarchies”, Proc. Symposium on Language Universals, Tallinn.
Google Scholar
Nord, L. (1988): “Acoustic-phonetic studies in a Swedish speech databank”, pp. 225–231 in Proc. Speech ‘88, Book 3, (7th FASE symposium), Institute of Acoustics, Edinburgh.
Google Scholar

Download references

Authors

R. Carlson
View author publications
You can also search for this author in PubMed Google Scholar
B. Granström
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Speech Comm & Music Acoustics, Royal Inst. of Technology, 70014, S-100 44, Stockholm, Sweden
Johan Sundberg , Lennart Nord & Rolf Carlson , &

Copyright information

About this chapter

Cite this chapter

Carlson, R., Granström, B. (1991). Performance rules in a text-to-speech system. In: Sundberg, J., Nord, L., Carlson, R. (eds) Music, Language, Speech and Brain. Wenner-Gren Center International Symposium Series. Palgrave, London. https://doi.org/10.1007/978-1-349-12670-5_11

Download citation

DOI: https://doi.org/10.1007/978-1-349-12670-5_11
Publisher Name: Palgrave, London
Print ISBN: 978-1-349-12672-9
Online ISBN: 978-1-349-12670-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics