Skip to main content

Myanmar Number Normalization for Text-to-Speech

  • Conference paper
  • First Online:
  • 866 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 781))

Abstract

Text Normalization is an essential module for Text-to-Speech (TTS) system as TTS systems need to work on real text. This paper describes Myanmar number normalization designed for Myanmar Text-to-Speech system. Semiotic classes for Myanmar language are identified by the study of Myanmar text corpus and Weighted Finite State Transducers (WFST) based Myanmar number normalization is implemented. Number suffixes and prefixes are also applied for token classification and finally, post-processing has been done for tokens that cannot be classified. This approach achieves average tag accuracy of 93.5% for classification phase and average Word Error Rate (WER) 0.95% for overall performance which is 5.65% lower than rule-based system. The results show that this approach can be used in Myanmar TTS system, and to our knowledge, this is the first published work of Myanmar number normalization system designed for Myanmar TTS system.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    http://www2.nict.go.jp/astrec-att/member/mutiyama/ALT/index.html.

  2. 2.

    http://www.openfst.org/twiki/bin/view/GRM/Thrax.

References

  1. Taylor, P.: Text-to-Speech Synthesis. Cambridge University Press, Cambridge (2009)

    Book  Google Scholar 

  2. Sproat, R., Black, A.W., Chen, S., Kumar, S., Ostendorf, M., Richards, C.: Normalization of non-standard words. Comput. Speech Lang. 15(3), 287–333 (2001)

    Article  Google Scholar 

  3. Ebden, P., Sproat, R.: The kestrel TTS text normalization system. Nat. Lang. Eng. 21(03), 333–353 (2015)

    Article  Google Scholar 

  4. Thu, Y.K., Pa, W.P., Ni, J., Shiga, Y., Finch, A., Hori, C., Kawai, H., Sumita, E.: Hmm based myanmar text to speech system. In: Sixteenth Annual Conference of the International Speech Communication Association (2015)

    Google Scholar 

  5. Beliga, S., Martinčić-Ipšić, S.: Text normalization for croatian speech synthesis. In: MIPRO, 2011 Proceedings of the 34th International Convention, pp. 1664–1669. IEEE (2011)

    Google Scholar 

  6. Alam, F., Habib, S., Khan, M.: Text normalization system for bangla. Technical report, BRAC University (2008)

    Google Scholar 

  7. Zhou, T., Dong, Y., Huang, D., Liu, W., Wang, H.: A three-stage text normalization strategy for mandarin text-to-speech systems. In: 6th International Symposium on Chinese Spoken Language Processing, ISCSLP 2008, pp. 1–4. IEEE (2008)

    Google Scholar 

  8. Panchapagesan, K., Talukdar, P.P., Krishna, N.S., Bali, K., Ramakrishnan, A.: Hindi text normalization. In: Fifth International Conference on Knowledge Based Computer Systems (KBCS), pp. 19–22. Citeseer (2004)

    Google Scholar 

  9. Sproat, R.: Lightly supervised learning of text normalization: Russian number names. In: 2010 IEEE Spoken Language Technology Workshop (SLT), pp. 436–441. IEEE (2010)

    Google Scholar 

  10. Nguyen, T.T.T., Pham, T.T., Tran, D.D.: A method for vietnamese text normalization to improve the quality of speech synthesis. In: Proceedings of the 2010 Symposium on Information and Communication Technology, pp. 78–85. ACM (2010)

    Google Scholar 

  11. Sproat, R., Jaitly, N.: RNN approaches to text normalization: a challenge. arXiv preprint arXiv:1611.00068 (2016)

  12. Riza, H., Purwoadi, M., Gunarso, Uliniansyah, T., et al.: Introduction of the asian language treebank. Oriental COCOSDA (2016)

    Google Scholar 

  13. Roark, B., Sproat, R., Allauzen, C., Riley, M., Sorensen, J., Tai, T.: The opengrm open-source finite-state grammar software libraries. In: Proceedings of the ACL 2012 System Demonstrations, pp. 61–66. Association for Computational Linguistics (2012)

    Google Scholar 

  14. Sproat, R.: Multilingual text analysis for text-to-speech synthesis. In: Proceedings of the Fourth International Conference on Spoken Language, ICSLP 1996, vol. 3, pp. 1365–1368. IEEE (1996)

    Google Scholar 

Download references

Acknowledgements

This work is partly supported by the ASEAN IVO project “Open Collaboration for Developing and Using Asian Language Treebank”.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Aye Mya Hlaing .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Hlaing, A.M., Pa, W.P., Thu, Y.K. (2018). Myanmar Number Normalization for Text-to-Speech. In: Hasida, K., Pa, W. (eds) Computational Linguistics. PACLING 2017. Communications in Computer and Information Science, vol 781. Springer, Singapore. https://doi.org/10.1007/978-981-10-8438-6_21

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-8438-6_21

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-8437-9

  • Online ISBN: 978-981-10-8438-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics