Skip to main content

English Letter-Phoneme Conversion by Stochastic Transducers

  • Chapter

Part of the book series: Telecommunications Technology & Applications Series ((TTAP))

Abstract

This chapter describes the use of stochastic transducers to model and to perform the conversion of English word spellings to phonemic equivalents. Generic word structures can be described by a simple regular grammar which usually overgenerates, producing many candidate translations. The ‘best’ candidate is selected based on the maximum likelihood criterion and the stochastic translation is assumed to be a Markov chain. The initial grammar allows any input string to translate to any output string. A set of example translations is used to refine this grammar to a more specific form: the Kleene closure of letter-phoneme correspondences. These correspondences were inferred by segmenting the maximum likelihood alignment of example translations when a special type of transducer movement is found, or when a segmentation point is found in one of the two (orthographic or phonemic) domains. For efficient translation, the transducers were implemented as stochastic generalised sequential Moore machines so that useless intermediate states in translation and Markov probabilities can be eliminated and reduced, respectively. The current translation accuracy on a per symbol basis is 93.7% on a representative 1667 word test set selected from the Oxford Advanced Learners Dictionary.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer Science+Business Media Dordrecht

About this chapter

Cite this chapter

Luk, R.W.P., Damper, R.I. (2001). English Letter-Phoneme Conversion by Stochastic Transducers. In: Damper, R.I. (eds) Data-Driven Techniques in Speech Synthesis. Telecommunications Technology & Applications Series. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-3413-3_5

Download citation

  • DOI: https://doi.org/10.1007/978-1-4757-3413-3_5

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4419-4733-8

  • Online ISBN: 978-1-4757-3413-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics