Abstract
As the automatic speech recognition technology (ASR) has becoming more and more mature, especially with statistical language modeling built with web scale data, and with the utilization of Hidden Markov Model probabilistic framework, speech recognition has become applicable to many domains and usage scenarios. In particular, speech recognition can be applied to task such as Chinese postal address recognition. This paper presents the first attempt ever, in both academic and commercial settings, to create an ASR-based input method for postal address recognition in Chinese Mandarin. By customizing the statistical language model to such domain, and incorporating the knowledge from the structural information provided by geo-topology, our language model successfully captures the signals from geographical contextual information and self-correct possible mis-recognitions. Experiment results provide evident that our approach based on speech recognition achieves a faster and a more accuracy input method compare to traditional keyboard-based input.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Chen, Z., Lee, K.-F.: A new statistical approach to Chinese Pinyin input. In: Proceedings of the 38th Annual Meeting on Association for Computational Linguistics, pp. 241–247. Association for Computational Linguistics (2000)
Hartley, J., Sotto, E., Pennebaker, J.: Speaking versus typing: a case-study of the effects of using voice-recognition software on academic correspondence. British Journal of Educational Technology 34(1), 5–16 (2003)
Chen, Z., Han, J., Lee, K.-F.: Language input architecture for converting one text form to another text form with tolerance to spelling, typographical, and conversion errors. U.S. Patent 6,848,080 (issued January 25, 2005)
Wang, J., Zhai, S., Su, H.: Chinese input with keyboard and eye-tracking: an anatomical study. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 349–356. ACM (2001)
iFlyTek Voice Cloud, iFlyTek (May 2, 2014), http://open.voicecloud.cn/ (accessed May 2, 2014)
Erden, M., Arslan, L.M.: Automatic Detection of Anger in Human-Human Call Center Dialogs. In: INTERSPEECH, pp. 81–84 (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Wei, L.F., Maosong, S. (2014). ASR-Based Input Method for Postal Address Recognition in Chinese Mandarin. In: Sun, M., Liu, Y., Zhao, J. (eds) Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data. NLP-NABD CCL 2014 2014. Lecture Notes in Computer Science(), vol 8801. Springer, Cham. https://doi.org/10.1007/978-3-319-12277-9_27
Download citation
DOI: https://doi.org/10.1007/978-3-319-12277-9_27
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12276-2
Online ISBN: 978-3-319-12277-9
eBook Packages: Computer ScienceComputer Science (R0)