Abstract
Text-to-speech synthesis, so long confined to small industrial applications, is now being opened up to many new areas of general interest, especially since the tremendous upsurge of multimedia applications. Endowing speech synthesis with the kind of quality that makes it acceptable and attractive to the general public is thus one of the major challenges of today’s speech technology research. A second challenge is the capability of processing unrestricted running texts, regardless of their length or content.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Abe, M. 1996. Speaking styles: statistical analysis and synthesis by a text-to-speech system. In van Santen et al. (eds.), 495–510.
Abney, S. 1991. Parsing by chunks. In Berwick, Abney and Tenny (eds.), 257–278.
Abney, S. 1997. Part-of-speech tagging and partial parsing. In Young and Bloothooft (eds.), 118–136.
Allen, J., S. Hunnincutt, R. Carlson and B. GranstrÖm. 1979. MITalk-79: The 1979 MIT text-to-speech system. Proc. 97th ASA, 507–510.
Allen, J., S. Hunnincutt and D. Klatt. 1987. From Text to Speech: The MITalk System. Cambridge University Press.
Astesano, C, A. Di Cristo and D.J. Hirst. 1995. Discourse-based empirical evidence for a multi-class accent system in French. Proc. 13th ICPhS (Stockholm, Sweden), vol. 4, 630–633.
Atkins, B. and A. Zampolli (eds.). 1994. Computational Approaches to the Lexicon. Oxford University Press.
Aubergé, V. 1992. Developing a structured lexicon for synthesis of prosody. In Bailly et al. (eds), 307–321.
Bailly, G. 1996. Pistes de recherches en synthèse de la parole. In Meloni (ed.), 109–121.
Bailly, G., C. Benoît and T. Sawallis (eds.). 1992. Talking Machines: Theories, Models and Designs. Amsterdam: Elsevier Science.
Barbosa, P. and G. Bailly. 1996. Generation of pauses within the z-score model. In van Santen et al. (eds.), 365–381.
Beaugendre, F. 1996. Modèles de l’intonation pour la synthèse. In Meloni (ed.), 97–107.
Behne, D.M. 1989. Acoustic Effects of Focus and Sentence Position on Stress in English and French. PhD dissertation, University of Wisconsin-Madison.
Benoît, C. 1995. Speech synthesis: present and future. In Bloothooft et al (eds.), 119–123.
Berwick, R., S. Abney and C. Tenny (eds.). 1991. Principle-based Parsing. Dordrecht: Kluwer Academic Publishers.
Bolinger, D.L. 1986. Intonation and its Parts. Stanford University Press.
Bourigault, D. 1992. Surface grammatical analysis for the extraction of terminological noun phrases. Proc. COLING’ 92 (Nantes, France), vol. 3, 977–981.
Brown, G. and G. Yule. 1983. Discourse Analysis. Cambridge University Press.
Bruce, G. 1985. Structures and function of prosody. Proc. French-Swedish Seminar on Speech (Grenoble, France), 549–559.
Caelen-Haumont, G. Forthcoming. Prosodie et Sens, une Approche Expérimentale. Paris: Editions du CNRS, Collection Sciences du Langage.
Campbell, W.N. 1997. Synthesizing spontaneous speech. In Sagisaka et al. (eds.), 165–186.
Campione, E., E. Flachaire, D.J. Hirst and J. Véronis. 1997. Stylisation and symbolic coding of Fo. Proc. ESCA Workshop on Intonation (Athens, Greece), 71–74.
Campione, E., D.J. Hirst and J. Véronis. This volume. Automatic stylisation and modelling of French and Italian intonation.
Cartier, M., F. Emerard, D. Pascal, P. Combescure and L. Soubigou. 1992. Une méthode d’évaluation multicritère de sorties vocales. Application au test de 4 sustèmes de synthèse à partir du texte. Proc. Journées d’Etude sur la Parole (Bruxelles, Belgium), 117–122.
Chan, D., A. Fourcin, D. Gibbon, B. GranstrÖm, M. Hucvale, G. Kokkinakis, K. Kvale, L. Lamel, B. Lindberg, A. Mofreno, J. Mouropoulos, F. Senia, L. Trancoso, C. Veld and J. Zeiliger. 1995. EROM-A spoken language resource for the EU. Proc. EUROSPEECH’ 95 (Madrid, Spain), vol. 1, 867–870.
Choppy, C, J.S. Liénard and D. Teil. 1995. Un algorithme de prosodie automatique sans analyse syntaxique. Actes des ôèmes Journées d’Etude sur la Parole (Toulouse, France), 387–395.
Cohen, P., J. Morgan and M. Pollack (eds.). 1990. Intentions in Communication. Cambridge, Mass. MIT Press.
Collier, R. and J.M.B. Terken. 1987. Intonation by rule in text-to-speech applications. European Conference on Speech Technology (Edinburgh, UK), 165–168.
Connell, B. and A. Arvaniti (eds.). 1995. Phonology and Phonetic Evidence. Papers in Laboratory Phonology IV. Cambridge University Press.
Cruttenden, A. 1986. Intonation. Cambridge University Press.
Cuttler, A and D.R. Ladd (eds.). 1983. Prosody: Models and Measurements. Berlin: Springer-Verlag.
Dechert, W. and M. Raupach (eds.). 1980. Temporal variables in speech. The Hague: Mouton.
Dell, F. 1984. L’accentuation dans les phrases en français. In Dell et al. (eds.), 65–122.
Dell, F., DJ. Hirst and J.R. Vergnaud. 1984. Formes Sonore du Langage. Paris: Hermann.
De Tournemire, S. 1997. Identification and automatic generation of prosodic contours for text-to-speech synthesis system in French. Proc. EUROSPEECH’ 97 (Rhodes, Greece).
Di Cristo, A. 1976. Indices prosodiques et structure constituante. Cahiers de Linguistique, d’Orientalisme et de Slavistique 7, 27–40.
Di Cristo, A. 1978. De la Microprosodie à l’intonosyntaxe. Thèse de doctorat d’Etat, Université de Provence (published 1985 by l’Université de Provence).
Di Cristo, A. 1998. Intonation in French. In Hirst and Di Cristo (eds.), 195–218.
Di Cristo, A. Forthcoming. Vers une modélisation de l’accentuation du français. Journal of French Language Studies.
Di Cristo, A. and D.J. Hirst. 1993a. Rythme syllabique, rythme mélodique et représentation hiérarchique de la prosodie du français. Travaux de l’Institut de Phonétique d’Aix 15, 9–24.
Di Cristo, A. and D.J. Hirst. 1993b. Prosodic regularities in the surface structure of French questions. Proc. ESC A Workshop on Prosody (Lund, Sweden), 268–271.
Di Cristo, A. and D.J. Hirst. 1996. Vers une typologie des unités intonatives du français. Actes des 21èmes Journées d’Etudes sur la Parole (Avignon, France), 219–222.
Di Cristo, A. and D.J. Hirst. 1997. L’accent non-emphatique en français: stratégies et paramètres. Polyphonies à I. Fônagy, 71–101. Paris: l’Harmattan.
Di Cristo, A., Ph. Di Cristo and J. Véronis. 1997. A metrical model of rhythm and intonation for French text-to-speech synthesis. Proc. ESC A Workshop on Intonation (Athens, Greece), 83–86.
Duez, D. 1978. Essai sur la Prosodie du Discours Politique. Thèse de Doctorat, Université de Paris III.
Dusterhoff, K. and A. Black. 1997. Generating Fo Contours for Speech Synthesis Using the Tilt intonation Theory. Proc. ESC A Workshop on Intonation (Athens, Greece), 107–110.
Dutoit, T., V. Pagel, N. Pierret, F. Bataille and O. van der Vreken. 1996. The MBROLA Project: towards a set of high-quality speech synthesizers free of use for noncommercial purposes. Proc. ICSLP’ 96 (Philadelphia, USA), vol. 3, 1393–1396.
Ejerhed, E. 1988. Finding clauses in unrestricted text by finitary and stochastic methods. Proc. 2nd Conf. Applied Natural Language Processing (Austin, USA), 219–227.
Emerard, F. 1977. Synthèse par Diphones et Traitement de la Prosodie. Thèse de Doctorat, Université de Grenoble.
Fant, G. and A. Kruckenberg. 1996. On the quantal nature of speech timing. Proc. ICSLP’ 96 (Philadelphia, USA), 2044–2047.
Fant, G., A. Kruckenberg and L. Nord. 1991. Durational correlates of stress in Swedish, French and English. Journal of Phonetics 19, 351–365.
FÖnagy, I. 1980. L’accent en français. Studia Phonetica 15, 123–133.
Furui, S. and M.M. Sandhi (eds.). 1991. Advances in SSP. The Bartlett Press Inc.
Furui, S. and M.M. Sondhi (eds.). 1992. Advances in Speech Signal Processing. New York: Dekker.
Gee, J.P. and F. Grosjean. 1983. Performance structures: A psycholinguistic and linguistic appraisal. Cognitive Psychology 15, 411–458.
Goldsmith, J. (ed.). 1995. The Handbook of Phonological Theory. Cambridge and Oxford: Blackwell.
Grosjean, F. 1980. Comparative studies of temporal variables in spoken and sign languages: A short review. In Dechert and Raupach (eds.), 307–312.
Grosjean, F. and J.Y. Dommergues. 1983. Les structures de performance en psycholinguistique. L’Année Psychologique 83, 513–536.
Grosz, BJ. and C. Sidner. 1986. Attention, intentions and the structure of discourse. Computational Linguistics 12, 175–204.
Guaïtella, I. Etude des relations entre geste et prosodie à travers leurs fonctions rythmique et symbolique. Proc. 12th ICPhS (Aix-en-Provence, France), vol. 3, 266–269.
Halle, M. and W. Isdardi. 1995. Stress and metrical structure. In Goldsmith (ed.), 403–443.
Halle, M. and J.R. Vergnaud. 1987. An Essay on Stress. Cambridge, Mass. MIT Press.
Hayes, B. 1995. Metrical Stress Theory. The University of Chicago Press.
Higuchi, N., T. Hirai and Y. Sagisaka. 1996. Effect of speaking style on parameters of fundamental frequency contours. In van Santen et al. (eds.), 417–428.
Hindle, D. 1994. A parser for text corpora. In Atkins and Zampolli (eds.), 103–151.
Hirst, D.J. 1983. Structures and categories in prosodic representations. In Cuttler and Ladd (eds.), 93–109.
Hirst, D.J. and A. Di Cristo. 1984. French intonation: a parametric approach. Die Neueren Sprachen 83, 554–569.
Hirst, D.J. and A. Di Cristo, A. 1998. A survey of intonation systems. In Hirst and Di Cristo (eds.), 1–44.
Hirst, D.J. and A. Di Cristo (eds.). 1998. Intonation Systems. Cambridge University Press.
Hirst, DJ., A. Di Cristo and R. Espesser. Forthcoming. Levels of description and levels of representation in the analysis of intonation. In HÖrne (ed.).
Hirst, DJ., and R. Espesser. 1993. Automatic Modelling of Fundamental Frequency using a quadratic spline function. Travaux de l’Institut de Phonétique d’Aix-en-Provence 15, 75–85.
Horne, M. (ed.). Forhtcoming. Prosody: Theory and Experiment. Dordrecht: Kluwer Academic Publishers.
Idsardi, J. 1992. The Computation of Prosody. PhD dissertation, MIT.
Jankowski, L. 1996. Le Marquage Prosodique des Mots. Mémoire de Maitrise, Université de Provence.
Karlsson, F., A. Voutilainen, J. Heikkilä and A. Anttila (eds.). 1995. Constraint Grammars. Berlin and New York: Mouton de Gruyter.
Keller, E. and B. Zellner (eds.). 1997. Les Défis Actuels en Synthèse de la Parole. Etudes de Lettres 3. Université de Lausanne.
Kohler, K. 1997. Modelling prosody in spontaneous speech. In Sagisaka et al. (eds.), 187–210.
Ladd, D.R. 1987. A model of intonational phonology for use in speech synthesis by rule. European Conference on Speech Technology (Edinburgh, UK), 21–24.
Lambrecht, K. 1996. Information Structure and Sentence Form. Cambridge University Press.
Leben, W.R. 1973. Suprasegmental Phonology. PhD dissertation, MIT (published 1989, New York: Garland).
Léon, P. 1992. Phonétisme et Prononciation du Français. Paris: Nathan Université.
Levin, H. 1979. The Eye-voice Span. Cambridge, Mass. MIT Press.
Liberman, M.Y. and K. Church. 1992. Text analysis and word pronunciation in text-to-speech synthesis. In Furui and Sondhi (eds.), 791–831.
Lucci, V. 1983. Etude Phonétique du Français Contemporain à Travers la Variation Situationnelle. Publications de l’Université de Grenoble.
Lyche, C. and F. Girard. 1995. Le mot retrouvé. Lingua 95, 205–221.
Malfrère, F. and T. Dutoit. 1997. High quality speech synthesis for phonetic speech segmentation. Proc. EUROSPEECH’ 97 (Rhodes, Greece), 2631–2634.
Martin Ph. 1986. Structure prosodique et rythmique pour la synthèse. Actes des 15èmes Journées d’Etude sur la Parole (Aix-en-Provence, France), 89–91.
Meloni, H. (ed.). 1996. Fondements et Perspectives en Traitement Automatique de la Parole. AUPELF-UREF.
Mimer, J.C. and F. Regnault. 1987. Dire le Vers. Paris: Seul.
Monaghan, A.I.C. 1992. Heuristic strategies for higher-level analysis of unrestricted text. In Bailly et al. (eds.), 143–161.
Morlec, Y., V. Aubergé and G. Bailly. 1995. Synthesis and evaluation of intonation with a superposition model. Proc. EUROSPEECH’ 95 (Madrid, Spain), vol. 3, 2043–2046.
Moulines, E. and F. Charpentier. 1990. Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphone. Speech Communication 9, 453–467.
Nakatani, C.H. 1997. Integrating prosodic and discourse modelling. In Sagisaka et al. (eds.), 67–80.
Nespor, M. and I. Vogel. 1986. Prosodic Phonology. Dordrecht: Foris.
Nespor, M. and I. Vogel. 1989. On clashes and lapses. Phonology 6, 69–116.
O’Shaughnessy, D. 1990. Relationships between syntax and prosody for speech synthesis. Proc. ESCA Tutorial on Speech Synthesis (Autrans, France), 39–42.
Ostendorf, M.F. and N.M. Veilleux. 1994. A hierarchical stochastic model for automatic prediction of prosodie boundary location. Computational Linguistics 20, 27–54.
Pardo, J.M., M. Martinez, A. Quilis and E. Munoz. 1987. Improving text-to-speech conversion in Spanish: linguistic analysis and prosody. European Conference on Speech Technology (Edinburgh, UK), vol. 2, 173–176.
Pasdeloup, V. 1990. Modèle de Règles Rythmiques du Français Appliqué à la Synthèse de la Parole. Thèse de doctorat, Université de Provence.
Pavlovic, C, M. Brousseau, D. Howells, D. Miller, V. Hazan, A. Faulkner and A. Fourcin. 1995. Analytic assessment and training in speech and hearing using a poly-lingual workstation, EURAUD. In Placencia Porrero and Puig de la Bellacasa (eds.), 332–335.
Pensom, R. 1993. Accent and metre in French. Journal of French Language Studies 3, 19–37.
Pierrehumbert, J.B. 1981. Synthesizing intonation. J. Acoust. Soc. Am. 70, 985–995.
Pierrehumbert, J.B. Forthcoming. Tonal elements and their alignment. In HÖrne (ed.).
Pierrehumbert, J.B. and J. Hirschberg. 1990. The Meaning of the intonational contours in the interpretation of discourse. In Cohen, Morgan and Pollack (eds.), 271–311.
Pisoni, D.B., B.G. Greene and J.S. Logan. 1989. An overview of ten years of research on the perception of synthetic speech. Proc. ESCA workshop on Speech Input/Output Assessment and Speech Databases, 111–114.
Placencia Porrero, I. and R. Puig de la Bellacasa (eds.). 1991. The European Context for Assistive Technology. Amsterdam: IOS Press.
Pols, L.C.W. 1991. Quality assessment of text-to-speech synthesis by rule. In Furui and Sandhi (eds.), 387–416.
Prince, A. and P. Smolensky. 1993. Optimality Theory: Constraint Interactions in Generative Grammar (ms. Rutgers University, at New Brunswick and University of Colorado at Boulder).
Quené, H. and R. Kager. 1992. The derivation of prosody for text-to-speech from prosodie sentence structure. Computer Speech and Language 6, 77–98.
Ross, K. 1995. Modelling Intonation for Speech Synthesis. PhD dissertation, University of Boston.
Ross, I.C. and J.W. Tukey. 1975. Introduction to these Volumes. In Index to Statistics and Probability. The R&D Press, Los Altos (California), iv–x.
Rossi, M. 1977. L’intonation et la troisième articulation. Bull. Soc. Ling. Paris, LXII, 1, 55–68.
Rossi, M. 1985. L’intonation et l’organisation de l’énoncé. Phonetica 42, 135–153.
Rossi, M. 1993. A model for predicting the prosody of spontaneous speech (PPSS model). Speech Communication 13, 87–107.
Rossi, M. and M. Chafcouloff. 1972. Les niveaux intonatifs. Travaux de l’Institut de Phonétique dAix 1, 167–176.
Sagisaka, Y., N. Campbell and N. Higuchi (eds.). 1997. Computing Prosody. New York: Springer-Verlag.
Séguinot, A. 1976. L’accent d’insistance en français standard. In Carton et al. (eds.), 1–91.
Selkirk, E.O. 1984. Phonology and Syntax: The Relation between Sound and Structure. Cambridge, Mass. The MIT Press.
Shattuck-Hufnagel, S. 1995.The importance of phonological transcription in empirical approaches to “stress shift” versus “early accent”. In Connell and Arvaniti (eds.), 128–140.
Shih, C. and B. Ao. 1996. Duration study for the Bell Laboratories Mandarin text-to-speech system. In van Santen et al. (eds.), 383–399.
Silverman, K., M.E. Beckman, J. Pitrelli, M.F. Ostendorf, C.W. Wightman, PJ. Price, J.P. Pierrehumbert and J. Hirschberg. 1992. ToBI: a standard for labelling English prosody. Proc. ICSLP’ 92 (Banff, Canada), vol. 2, 867–870.
Sorin, C. and F. Emerard. 1996. Domaines d’application et évaluation de la synthèse de la parole à partir du texte. In Meloni (ed.), 123–131.
Sorin, C, D. Larreur and R. Llorca. 1987. A rhythm-based prosodie parser for text-to-speech systems in French. Proc. 11th ICPhS (Tallin, Estonia), vol. 1, 125–128.
Taylor, P. 1993. Automatic recognition of intonation from Fo contours using the Rise/Fall/Connection model. Proc. EUROSPEECH’ 93 (Berlin, Germany), 2, 789–792.
Truckenbrodt, H. 1995. Phonological Phrases: Their Relation to Syntax, Focus and Prominence. PhD dissertation, MIT.
Vaissière, J. 1971. Contribution à la Synthèse par Règles du Français. Thèse de Doctorat, Université de Grenoble.
van Santen, J.P.H. 1993. Perceptual experiments for diagnostic testing of text-to-speech systems. Computer Speech and Language 7, 49–100.
van Santen, J.P.H. 1994. Assignment of segmental duration in text-to-speech synthesis. Computer Speech and Language 8, 95–128.
van Santen, J.P.H., R. Sproat, J. Olive and J. Hirschberg. 1996. Progress in Speech Synthesis. New York: Springer-Verlag.
Véronis, J., Ph. Di Cristo, F. Courtois and B. Lagrue. 1997. A stochastic model of intonation for text-to-speech synthesis. Proc. EUROSPEECH’ 97 (Rhodes, Greece), vol. 5, 2643–2646.
Véronis, J., DJ. Hirst, R. Espesser and N. Ide. 1994. NL and speech in the MULTEXT project. Proc. AAAI’ 94 Workshop on Integration of Natural Language and Speech (Seattle, USA), 72–78.
Wightman, C.W. and W.N. Campbell. 1994. Automatic Labelling of Prosodic Structure. Technical Report TR-IT-0061, ATR Interpreting Telecommunications Laboratories, Kyoto, Japan.
Young, S. and G. Bloothooft (eds.). 1997. Corpus-Based Methods in Language and Speech Processing. Dordrecht: Kluwer Academic Publishers.
Zellner, B. 1997. La fluidité en synthèse de la parole. In Keller and Zellner (eds.), 47–78.
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
di Cristo, A., di Cristo, P., Campione, E., VÉronis, J. (2000). A Prosodic Model for Text-to-speech Synthesis in French. In: Botinis, A. (eds) Intonation. Text, Speech and Language Technology, vol 15. Springer, Dordrecht. https://doi.org/10.1007/978-94-011-4317-2_14
Download citation
DOI: https://doi.org/10.1007/978-94-011-4317-2_14
Publisher Name: Springer, Dordrecht
Print ISBN: 978-0-7923-6723-9
Online ISBN: 978-94-011-4317-2
eBook Packages: Springer Book Archive