Myanmar Number Normalization for Text-to-Speech

Hlaing, Aye Mya; Pa, Win Pa; Thu, Ye Kyaw

doi:10.1007/978-981-10-8438-6_21

Myanmar Number Normalization for Text-to-Speech

Aye Mya Hlaing¹¹,
Win Pa Pa¹¹ &
Ye Kyaw Thu¹²

Conference paper
First Online: 04 March 2018

866 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 781))

Abstract

Text Normalization is an essential module for Text-to-Speech (TTS) system as TTS systems need to work on real text. This paper describes Myanmar number normalization designed for Myanmar Text-to-Speech system. Semiotic classes for Myanmar language are identified by the study of Myanmar text corpus and Weighted Finite State Transducers (WFST) based Myanmar number normalization is implemented. Number suffixes and prefixes are also applied for token classification and finally, post-processing has been done for tokens that cannot be classified. This approach achieves average tag accuracy of 93.5% for classification phase and average Word Error Rate (WER) 0.95% for overall performance which is 5.65% lower than rule-based system. The results show that this approach can be used in Myanmar TTS system, and to our knowledge, this is the first published work of Myanmar number normalization system designed for Myanmar TTS system.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

References

Taylor, P.: Text-to-Speech Synthesis. Cambridge University Press, Cambridge (2009)
Book Google Scholar
Sproat, R., Black, A.W., Chen, S., Kumar, S., Ostendorf, M., Richards, C.: Normalization of non-standard words. Comput. Speech Lang. 15(3), 287–333 (2001)
Article Google Scholar
Ebden, P., Sproat, R.: The kestrel TTS text normalization system. Nat. Lang. Eng. 21(03), 333–353 (2015)
Article Google Scholar
Thu, Y.K., Pa, W.P., Ni, J., Shiga, Y., Finch, A., Hori, C., Kawai, H., Sumita, E.: Hmm based myanmar text to speech system. In: Sixteenth Annual Conference of the International Speech Communication Association (2015)
Google Scholar
Beliga, S., Martinčić-Ipšić, S.: Text normalization for croatian speech synthesis. In: MIPRO, 2011 Proceedings of the 34th International Convention, pp. 1664–1669. IEEE (2011)
Google Scholar
Alam, F., Habib, S., Khan, M.: Text normalization system for bangla. Technical report, BRAC University (2008)
Google Scholar
Zhou, T., Dong, Y., Huang, D., Liu, W., Wang, H.: A three-stage text normalization strategy for mandarin text-to-speech systems. In: 6th International Symposium on Chinese Spoken Language Processing, ISCSLP 2008, pp. 1–4. IEEE (2008)
Google Scholar
Panchapagesan, K., Talukdar, P.P., Krishna, N.S., Bali, K., Ramakrishnan, A.: Hindi text normalization. In: Fifth International Conference on Knowledge Based Computer Systems (KBCS), pp. 19–22. Citeseer (2004)
Google Scholar
Sproat, R.: Lightly supervised learning of text normalization: Russian number names. In: 2010 IEEE Spoken Language Technology Workshop (SLT), pp. 436–441. IEEE (2010)
Google Scholar
Nguyen, T.T.T., Pham, T.T., Tran, D.D.: A method for vietnamese text normalization to improve the quality of speech synthesis. In: Proceedings of the 2010 Symposium on Information and Communication Technology, pp. 78–85. ACM (2010)
Google Scholar
Sproat, R., Jaitly, N.: RNN approaches to text normalization: a challenge. arXiv preprint arXiv:1611.00068 (2016)
Riza, H., Purwoadi, M., Gunarso, Uliniansyah, T., et al.: Introduction of the asian language treebank. Oriental COCOSDA (2016)
Google Scholar
Roark, B., Sproat, R., Allauzen, C., Riley, M., Sorensen, J., Tai, T.: The opengrm open-source finite-state grammar software libraries. In: Proceedings of the ACL 2012 System Demonstrations, pp. 61–66. Association for Computational Linguistics (2012)
Google Scholar
Sproat, R.: Multilingual text analysis for text-to-speech synthesis. In: Proceedings of the Fourth International Conference on Spoken Language, ICSLP 1996, vol. 3, pp. 1365–1368. IEEE (1996)
Google Scholar

Download references

Acknowledgements

This work is partly supported by the ASEAN IVO project “Open Collaboration for Developing and Using Asian Language Treebank”.

Author information

Authors and Affiliations

Natural Language Processing Lab, UCSY, Yangon, Myanmar
Aye Mya Hlaing & Win Pa Pa
Artificial Intelligence Lab, Okayama Prefectural University, Okayama, Japan
Ye Kyaw Thu

Authors

Aye Mya Hlaing
View author publications
You can also search for this author in PubMed Google Scholar
Win Pa Pa
View author publications
You can also search for this author in PubMed Google Scholar
Ye Kyaw Thu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Aye Mya Hlaing .

Editor information

Editors and Affiliations

Graduate School of Information Science and Technology, The University of Tokyo, Tokyo, Japan
Kôiti Hasida
Natural Language Processing Lab, University of Computer Studies, Yangon, Yangon, Myanmar
Win Pa Pa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hlaing, A.M., Pa, W.P., Thu, Y.K. (2018). Myanmar Number Normalization for Text-to-Speech. In: Hasida, K., Pa, W. (eds) Computational Linguistics. PACLING 2017. Communications in Computer and Information Science, vol 781. Springer, Singapore. https://doi.org/10.1007/978-981-10-8438-6_21

Download citation

DOI: https://doi.org/10.1007/978-981-10-8438-6_21
Published: 04 March 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-8437-9
Online ISBN: 978-981-10-8438-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics