Abstract
A study on acoustic-phonetic features for the obstruent detection and classification based on the knowledge of Mandarin speech is proposed. Seneff auditory model is used as the front-end processor for extracting acoustic-phonetic features. These features are rich in their information content in a hierarchical decision process to detect and classify the Mandarin obstruents. The preliminary experiments showed that accuracy of obstruent detection is about 84%. An algorithm based on the information of feature distribution is applied to further classify the obstruents into stops, fricatives, and affricates. The average accuracy of obstruent classification is about 80%. The proposed approach based on the feature distribution is simple and effective. It could be a very promising method for improving the phone detection in continuous speech recognition.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Lee, C.-H.: From Knowledge- Ignorant to Knowledge-Rich Modeling: A New Speech Research Paradigm for Next Generation Automatic Speech Recognition. In: International Conference on Spoken Language Processing, ICSLP 2004, Plenary Session, Jeju, Korea (2004)
Stevens, K.N.: Toward a model for lexical access base on acoustic landmarks and distinctive features. J. Acoust. Soc. Am. 111(4), 1872–1891 (2002)
Seneff, S.: A Joint Synchrony/ Mean Rate Model of Auditory Speech Processing. J. Phonetics 16, 55–76 (1988)
Seneff, S.: A Computational Model for the Peripheral Auditory System: Application to Speech Recognition Research. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 1983–1986 (1986)
Abdelatty Ali, A.M.: Auditory-Based Speech Processing Based on the Average Localized Synchrony Detection. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (2000)
Abdelatty Ali, A.M.: Segmentation and Categorization of Phonemes in Continuous Speech. Technical Report, TRCST25JUL98, Center for Sensor Technologies, University of Pennsylvania (1998)
Aversano, G.: A New Text-Independent Method for Phoneme Segmentation. In: IEEE International Conference on Circuit and System (2001)
Hongtao, H.: Temporal pre-classification for Chinese voiceless consonant speech. In: IEEE International Conference on Signal Processing (1996)
Abdelatty Ali, A.M.: An Acoustic-Phonetic Feature-Based System for the Automatic Recognition of Fricative Consonants. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (1988)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sung, KT., Wang, HC. (2006). A Study of Knowledge-Based Features for Obstruent Detection and Classification in Continuous Mandarin Speech. In: Huo, Q., Ma, B., Chng, ES., Li, H. (eds) Chinese Spoken Language Processing. ISCSLP 2006. Lecture Notes in Computer Science(), vol 4274. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11939993_14
Download citation
DOI: https://doi.org/10.1007/11939993_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49665-6
Online ISBN: 978-3-540-49666-3
eBook Packages: Computer ScienceComputer Science (R0)