A System for the Conversion of Digital Gujarati Text-to-Speech for Visually Impaired People

Jariwala, Nikisha; Patel, Bankim

doi:10.1007/978-981-10-6626-9_8

Nikisha Jariwala¹⁸ &
Bankim Patel¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 664))

843 Accesses
2 Citations

Abstract

In the epoch of hi-tech development, study on Text-to-Speech conversion shows remarkable enhancement in last couple of decades. Visually impaired people are not able to read, so Text-to-Speech system acts as an aid for visually impaired people for reading by hearing the text. In this paper, we presented the development of computer-based Gujarati Text-to-Speech system that delivers text in Gujarati audio form. Arbitrary digital Gujarati text is considered as an input to the system; conversion is done with regard to the Akshara of Gujarati language, and sound is produced in the form of phoneme, diphone, or syllable as per the requirement. Single audio file is created of the text so that it can also be heard at later stage. The detailed algorithm along with the format of speech database is also presented in the paper. Proposed system is tested on the documents collected from online news Web site and it gives satisfactory result.

Ms. Nikisha B. Jariwala, Ph.D. Scholar & Asst. Professor of Smt. Tanuben & Dr. Manubhai Trivedi College of Information Science, affiliated to Veer Narmad South Gujarat University, Surat, Gujarat, India.

Dr. Bankim Patel, Director, Shrimad Rajchandra Institute of Management & Computer Application, Uka Tarsadia University, Maliba Campus, Gujarat, India.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Speech, http://dictionary.reference.com/browse/speech
Sasirekha, D., Chandra, E.: Text To speech: a simple tutorial. Int. J. Soft Comput. Eng. (IJSCE) 2(1), 275–278 (2012)
Google Scholar
What is Diphone? http://www.wisegeek.com/what-is-a-diphone.htm
Syllable, http://www.thefreedictionary.com/syllables
Text, http://whatis.techtarget.com/definition/text
Patil, H., Patel, T., Talesara, S., Shah, N., Sailor, H., Vachhani, B., Akhani, J., Kanakiya, B., Gaur, Y., Prajapati, V.: Algorithms for speech segmentation at syllable-level for text-to-speech synthesis system in Gujarati. In. Proceeding of International Conference on Asian Spoken Language Research and Evaluation, pp. 1–7 (2013)
Google Scholar
Kumar, R., Kishore, S., Gopalkrishna, A., Chitturi, R., Joshi, S., Singh, S., Sitaram, R.: Development of Indian language speech databases for large vocabulary speech recognition systems. In: Proceedings of International Conference on Speech and Computer (2005)
Google Scholar
Black, A., Zen, H., Tokuda, K.: Statistical parametric speech synthesis. In: Proceeding of ICASSP, vol. 4, pp. 1229–1232 (2007)
Google Scholar
Gupta, S., Kumar, P.: Comparative study of text to speech system for indian language. Int. J. Adv. Comput. Inf. Technol. 199–209 (2012)
Google Scholar
Baheti, M., Kale, K., Jadav, M.: Comparison of classifiers for Gujarati numeral recognition. Int. J. Mach. Intell. 3(3), 160–163 (2011)
Article Google Scholar
Suthar, B.: Gujarati-English learner’s dictionary, http://ccat.sas.upenn.edu/plc/gujarati/guj-engdictionary.pdf
Kayasth, M., Patel, B.: Offline typed Gujarati character recognition. Nat. J. Syst. Inf. Technol. 2(1), 73–82 (2009)
Google Scholar
Sojitra, B., Dhakad, V.: Neural network in character recognition of Gujarati script. J. Inf. Knowl. Res Comput. Eng. 2(2), 269–272 (2012)
Google Scholar
Raj, A., Sarkar, T., Pammi, S., Yuvraj, S., Bansal, M., Prahallad, K., Black, A.: Text Processing for text-to-speech systems in Indian languages. In: ISCA Workshop on Speech Synthesis, pp. 188–193 (2007)
Google Scholar
Choudhury, M.: Rule based grapheme to phoneme mapping for hindi speech synthesis. In: 90th Indian Science Congress of the International Speech Communication Association-ISCA (2003)
Google Scholar
Mishra, P., Shukla, J.: Research proposal paper on Sanskrit voice engine: convert text-to-audio in Sanskrit/Hindi. Int. J. Comput. Appl. 70(26), 30–34 (2013)
Google Scholar
Kabra, S., Agarwal, R., Yadav, N.: Rule based Schwa deletion algorithm for text to speech synthesis in Hindi. In: Advanced Computing, Networking and Informatics, Springer, vol. 1 (2014)
Google Scholar
Klatt, D.: The Klattalk text-to-speech conversion system. In: Acoustics, Speech and Signal Processing IEEE International Conference on ICASSP, pp. 1589–1592 (1982)
Google Scholar
Al-Rehili, A., Al-Juhani, D., Al-Maimani, M., Ahmed, M.: A Novel approach to convert speech to text and vice-versa and translate from English to Arabic language. Int. J. Sci. Appl. Inf. Technol. 1(2), 57–64 (2012)
Google Scholar
Davaatsagaan, M., Paliwal, K.: Diphone-based concatenative speech synthesis system for mongolian. In: Proceeding of International Multi Conference of Engineers and Computer Scientists, vol. 1 (2008)
Google Scholar
Wolters, M.: A Diphone-based Text-to-speech system for Scottish Gaelic, Thesis (1997)
Google Scholar
Dika, A., Maxhuni, A., Rexhepi, A.: The principles of designing of algorithm for speech synthesis from texts written in Albanian language. Int. J. Comput. Sci. Issues 9(3), 175–180 (2012)
Google Scholar
Molakatala, N., Kumar, M., Bhaskar, U.: Image to speech conversion system for Telugu language. Int. J. Eng. Sci. Innovative Technol. 2(6), 161–166 (2013)
Google Scholar
Patra, T., Patra, B., Mohapatra, P.: Text-to-Speech conversion with phonematic concatenation. Int. J. Electron. Commun. Comput. Technol. 2(5), 223–226 (2012)
Google Scholar
Trilla, A.: Natural language processing techniques in text-to-speech synthesis and automatic speech recognition (2009)
Google Scholar
Sitaram, S., Palkar, S., Chen, Y., Parlikar A., Black, A.: Bootstrapping Text-to-Speech for speech processing in languages without an orthography. In: Proceeding of ICASSP International Conference, pp. 7992–7996 (2013)
Google Scholar
Rao, M., Thomas, S., Nagarajan, T., Murthy, H.: Text-To-speech synthesis using syllable like units. In: National Conference on Communication, pp. 227–280 (2005)
Google Scholar
Onaolapo, J., Idachaba, F., Badejo, J., Odu, T., Adu, O.: A simplified overview of text-to-speech synthesis. In: Proceeding of World Congress on Engineering, vol. 1 (2014)
Google Scholar
Kishore, S., Black, A., Kumar, R., Sangal, R.: Experiments with unit selection speech databases for indian languages. In: Proceedings of National Seminar on Language Technology Tools: Implementations of Telugu (2003)
Google Scholar
Balajthy, E.: Text-to-speech software for helping struggling readers. Int. J. Read. Assoc. 8(4) (2005)
Google Scholar
Sandesh Newspaper, http://www.sandesh.com/
Divya Bhaskar Newspaper, http://www.divyabhaskar.co.in/
Gujarat Samachar Newspaper, http://www.gujaratsamachar.com/

Download references

Acknowledgements

We would like to thank trustee Shri. Anand Chokhavala, the Principal Mrs. Manisha Gajjar, and the recording studio In-charge Ms. Jyoti Jariwala of Ambaben Maganlal Andhjan Shala, Surat, for their cooperation by supporting us for recording audio and by providing necessary resources needed for the work. We would also like to thank Dr. Naren Burade to motivate us for the work on Text-to-Speech conversion for visually impaired people.

Author information

Authors and Affiliations

Smt. Tanuben & Dr. Manubhai Trivedi College of Information Science, Surat, Gujarat, India
Nikisha Jariwala
Shrimad Rajchandra Institute of Management & Computer Application, Uka Tarsadia University, Surat, Gujarat, India
Bankim Patel

Authors

Nikisha Jariwala
View author publications
You can also search for this author in PubMed Google Scholar
Bankim Patel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nikisha Jariwala .

Editor information

Editors and Affiliations

KIIT, Gurgaon, Haryana, India
S. S. Agrawal
Bhai Parmanand Institute of Business Studies, New Delhi, Delhi, India
Amita Devi
MCA Department, Bhrati Vidyapeeth’s Institute of Computer Applications and Management (BVICAM), New Delhi, Delhi, India
Ritika Wason
Maharaja Surajmal Institute of Technology, GGSIP University, New Delhi, Delhi, India
Poonam Bansal

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jariwala, N., Patel, B. (2018). A System for the Conversion of Digital Gujarati Text-to-Speech for Visually Impaired People. In: Agrawal, S., Devi, A., Wason, R., Bansal, P. (eds) Speech and Language Processing for Human-Machine Communications. Advances in Intelligent Systems and Computing, vol 664. Springer, Singapore. https://doi.org/10.1007/978-981-10-6626-9_8

Download citation

DOI: https://doi.org/10.1007/978-981-10-6626-9_8
Published: 16 November 2017
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-6625-2
Online ISBN: 978-981-10-6626-9
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics