Duration Study for the Bell Laboratories Mandarin Text-to-Speech System

  • Chilin Shih
  • Benjamin Ao


We present in this chapter the methodology and results of a duration study designed for the Mandarin Chinese text-to-speech system of Bell Laboratories. A greedy algorithm is used to select text from on-line corpora to maximize the coverage of factors that are important to the study of duration. The duration model and some interesting results are discussed.


Greedy Algorithm Bell Laboratory Closure Duration Vowel Duration Syllable Type 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. [AHK87]
    J. Allen, S. Hunnicut, and D. H. Klatt. From text to speech: The MITalk system. Cambridge University Press, Cambridge, UK, 1987.Google Scholar
  2. [Ber93]
    R. Berkovits. Utterance-final lengthening and the duration of final-stop closures. J. Phonetics21(4):479–489, 1993.Google Scholar
  3. [CG86]
    R. Carlson and B. Cranström. A search for durational rules in a real-speech data base. Phonetica43:140–154, 1986.CrossRefGoogle Scholar
  4. [CH82]
    T. H. Crystal and A. S. House. Segmental durations in connected speech signals: Preliminary results. JASA72:705–716, 1982.CrossRefGoogle Scholar
  5. [CH88]
    T. H. Crystal and A. S. House. Segmental durations in connected-speech signals: Current results. JASA83:1553–1573, 1988.CrossRefGoogle Scholar
  6. [EB88]
    J. Edwards and M. E. Beckman. Articulatory timing and the prosodic interpretation of syllable duration. Phonetica45(2): 156–174, 1988.CrossRefGoogle Scholar
  7. [Fen85]
    L. Feng Beijinghua yuliu zhong sheng yun diao de shichang (Duration of consonants, vowels, and tones in Beijing Mandarin speech). In Beijinghua Yuyin Shiyanlu (Acoustics Experiments in Beijing Mandarin),Beijing University Press, Beijing, 131–195, 1985.Google Scholar
  8. [FGO93]
    R. M. French, A. Greenwood, and J. P. Olive. Speech Segmentation Criteria. Technical report, AT&T Bell Laboratories, 1993.Google Scholar
  9. [FM93]
    J. Fletcher and A. McVeigh. Segment and syllable duration in Australian English. Speech Comm. 13:355–365, 1993.CrossRefGoogle Scholar
  10. [Hou61]
    A. S. House. On vowel duration in English. JASA33:1174–1178, 1961.CrossRefGoogle Scholar
  11. [HU74]
    M. S. Harris and N. Umeda. Effect of speaking mode on temporal factors in speech: Vowel duration. JASA56:1016–1018, 1974.CrossRefGoogle Scholar
  12. [Kla73]
    D. H. Klatt. Interaction between two factors that influence vowel duration. JASA54:1102–1104, 1973.CrossRefGoogle Scholar
  13. [Kla75]
    D. H. Klatt. Vowel lengthening is syntactically determined in a connected discourse. J. Phonetics3:129–140, 1975.Google Scholar
  14. [Leh72]
    I. Lehiste The timing of utterances and linguistic boundaries. JASA 51(6.2): 2018–2024, 1972.CrossRefGoogle Scholar
  15. [LR73]
    D. Lindblom and K. Rapp. Some temporal regularities of spoken Swedish. Publication of the Institute of Linguistics, University of Stockholm,21:1–59, 1973.Google Scholar
  16. [Noo72]
    S. G. Nooteboom. Production and Perception of Vowel Duration. University of Utrecht, Utrecht, 1972.Google Scholar
  17. [OGC93]
    J. P. Olive, A. Greenwood, and J. Coleman. Acoustics of American English Speech: A Dynamic Approach. Springer-Verlag, New York, 1993.Google Scholar
  18. [01173]
    D. K. Oiler. The effect of position in utterance on speech segment duration in English. JASA54:1235–1247, 1973.CrossRefGoogle Scholar
  19. [Por81]
    R. F. Port. Linguistic timing factors in combination. JASA69:262–274, 1981.CrossRefGoogle Scholar
  20. [Ren85]
    H. Ren. Linguistically conditioned duration rules in a timing model for Chinese. In UCLA Working Papers in Phonetics 62,I. Maddieson, ed. UCLA, Los Angeles, 1985.Google Scholar
  21. [SSGC94]
    R. W. Sproat, C. Shih, W. Gale, and N. Chang. A stochastic finite-state wordsegmentation algorithm for Chinese. In Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics,New Mexico State University, 66–73, 1994.Google Scholar
  22. [TSK89]
    K. Takeda, Y. Sagisaka, and H. Kuwabara. On sentence-level factors governing segmental duration in Japanese. JASA86:2081–2087, 1989.CrossRefGoogle Scholar
  23. [Ume77]
    N. Umeda. Consonant duration in American English. JASA61:846–858, 1977.CrossRefGoogle Scholar
  24. [van92a]
    J. P. H. van Santen. Contextual effects on vowel duration. Speech Comm. ll(6):513–546, 1992.CrossRefGoogle Scholar
  25. [van92b]
    J. P. H. van Santen. Diagnostic perceptual experiments for text-to-speech system evaluation. In Proceedings of ICSLP,Barff, Alberta, Canada, 555–558, 1992.Google Scholar
  26. [van93]
    J. P. H. van Santen. Perceptual experiments for diagnostic testing of text-to-speech system. Computer Speech and Language 7(l):49–100, 1993.CrossRefGoogle Scholar
  27. [van94]
    J. P. H. van Santen. Assignment of segmental duration in text-to-speech synthesis. Computer Speech and Language8(2):95–128, 1994.CrossRefGoogle Scholar
  28. [WSOP92]
    C. W. Wightman, S. Shattuck-Hufnagel, M. Ostendorf, and P. J. Price. Segmentai durations in the vicinity of prosodic phrase boundaries. JASA91:1707–1717, 1992.CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media New York 1997

Authors and Affiliations

  • Chilin Shih
  • Benjamin Ao

There are no affiliations available

Personalised recommendations