Tone Recognition of Isolated Mandarin Syllables

  • Zhaoqiang Xie
  • Zhenjiang Miao
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6134)


Mandarin is tonal language. For Mandarin, tone identification is very important for speech recognition and pronunciation evaluation. Mandarin tone behavior varies greatly from speaker to speaker and it presents the greatest challenge to any speaker-independent tone recognition system. This paper presents a speaker normalizing algorithm which is designed to reduce this influence. Then a basic neural network tone recognizer using recognition features extracted from the processing syllable is introduced. The system employs an improved pitch detector and a powerful tone identification method. Finally, the experimental results show that the tone recognition system classifies four tones very well.


Mandarin speech tone recognition pitch normalization pitch contour BP neural network 


  1. 1.
    Lee, L.S.: Voice dictation of Mandarin Chinese. IEEE Signal Processing Magazine 14(4), 63–101 (1997)CrossRefGoogle Scholar
  2. 2.
    Rabiner, L.R., Cheng, M.L., Rosenberg, A.E., McGonegal, C.A.: A comparative performance study of several pitch detection algorithms. IEEE Trans. Acoust. Speech, Signal Processing, ASSP 24, 399–417 (1976)CrossRefGoogle Scholar
  3. 3.
    Xie, X., Xie, L.: A new approach for tone recognition of isolated Mandarin syllables. Communications and Information Technologies (2006)Google Scholar
  4. 4.
    Markel, J.D.: The SIFT Algorithm for Fundamental Frequency Estimation. IEEE Trans. AU 20, 361–377 (1972)Google Scholar
  5. 5.
    Shilin, X.: Fuzzy Tone Recognition for Chinese Speech. Acta Electronica Sinica, China 24(1), 119–121 (1996)MathSciNetGoogle Scholar
  6. 6.
    Jun, C., Kechu, Y., Bingbing, L.: A tone recognizer using wavelet transform and Hidden Markov Model. Journal of Electronics and Information Technology, China 19(2), 177–182 (1997)Google Scholar
  7. 7.
    Fang, S., Guangrui, H.: Chinese Tones Recognition Based on a New Neural Network. Journal of Shanghai JiaoTong University 31(5), 36–38 (1997)Google Scholar
  8. 8.
    Tang, L., Yin, J.: Mandarin Tone Recognition Based on Pre-Classification. In: Proceedings of the 6th World Congress on Intelligent Control and Automation, June 21-23 (2006)Google Scholar
  9. 9.
    Rabiner, L.R., Schafer, R.W.: Digital Processing of Speech Signals. Science Press, Beijing (1983)Google Scholar
  10. 10.
    Seneff, S.: Real-time harmonic pitch detector. IEEE Trans. Acoustics, Speech and Signal Processing 26(4), 358–365 (1978)CrossRefGoogle Scholar
  11. 11.
    Qing, S., Lin, T.: Introduction of Pattern Recognition.National Defense University of Technology Press, Changsha (1991)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Zhaoqiang Xie
    • 1
  • Zhenjiang Miao
    • 1
  1. 1.Institute of Information ScienceBeijing Jiao Tong UniversityBeijingP.R. China

Personalised recommendations