Abstract
The concept of fuzzy sets and fuzzy logic is widely used to propose of several methods applied to systems modeling, classification and pattern recognition problem. This paper proposes a genetic-fuzzy recognition system for speech recognition. In addition to pre-processing, with mel-cepstral coefficients, the Discrete Cosine Transform (DCT) is used to generate a two-dimensional time matrix for each pattern to be recognized. A genetic algorithms is used to optimize a Mamdani fuzzy inference system in order to obtain the best model for final recognition. The speech recognition system used in this paper was named Hybrid DCT-Genetic-Fuzzy Inference System for Speech Recognition (HGFIS).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Picone, J.W.: Signal modeling techiniques in speech recognition. IEEE Transactions on Computer 79(4), 1214–1247 (1991)
Rabiner, L., Biing-Hwang, J.: Fundamentals of Speech Recognition. Prentice Hall, New Jersey (1993)
Andrews, H.C.: Multidimensional Rotations in Feature Selection. IEEE Transaction on Computers (September 1971)
Abushariah, A.A.M., Gunawan, T.S., Khalifa, O.O.: English Digits Speech Recognition System Based on Hidden Markov Models. In: International Conference on Computer and Communication Engineer (ICCCE 2010), Kuala Lumpur, Malaysia (May 2010)
De Wachter, M., Matton, M., Demuynck, K., Wambacq, P., Cools, R., Compernolle, D.V.: Template-Based Continuous Speech Recognition. IEEE Transactions on Audio, Speech, and Language Processing 15(4) (May 2007)
Revathi, A., Venkataramani, Y.: Speaker Independent Continuous Speech and Isolated Digit Recognition using VQ and HMM. In: International Conference on Communications and Signal Processing, ICCSP (February 2011)
Ahmed, T.N.N., Rao, K.: Discrete Cosine Trasnform. IEEE Transaction on Computers c-24 (January 1974)
Jianqin Zhou, P.C.: Generalized Discrete Cosine Transform. In: 2009- Pacific-Asia Conference on Circuits,Communications and System, Chegdu, China (May 2009)
Zeng, J., Liu, Z.-Q.: Type-2 Fuzzy Hidden Markov Models and Their Application to Speech Recognition. IEEE Transactions on Fuzzy Systems 14(3) (June 2006)
Silva, W.L.S., Serra, G.L.O.: Proposta de Metodologia TCD-Fuzzy para reconhecimentos de Voz. X SBAI – Simposio Brasileiro de Automacao Inteligente. Sao Joao del-Rei - MG - Brasil, pp. 1054–1059 (September 2011)
Milner, B.P., Vaseghi, S.V.: Speech Modeling using Cepstral-Time Feature and Hidden Markov Models. In: Proceedings of In. Conference on Acustic Speech and Signal Processing, Adelaide, vol. I, pp. 601–604 (1994)
Chen, G.: Discussion of Approximation Properties of Minimum Inference Fuzzy System. In: Proceedings of the 29th Chinese Control Conference, Beijing, China, July 29-31 (2010)
Seki, H., Ishii, H., Mizumoto, M.: On the Monotonicity of Fuzzy-Inference Methods Related to T–S Inference Method. IEEE Transactions on Fuzzy Systems 18(3) (June 2010)
Zhou, E., Khotand, A.: Fuzzy Classifier Design Genetic Algorithms (2007)
Zhang, X., Wang, X., Zhang, S., Yu, F.: Approximating the True Domain of Fuzzy Inference Sentence with Genetic Algorithm. In: Seventh International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2010 (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Silva, W., Serra, G. (2012). A Hybrid Approach Based on DCT-Genetic-Fuzzy Inference System for Speech Recognition. In: Yin, H., Costa, J.A.F., Barreto, G. (eds) Intelligent Data Engineering and Automated Learning - IDEAL 2012. IDEAL 2012. Lecture Notes in Computer Science, vol 7435. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32639-4_7
Download citation
DOI: https://doi.org/10.1007/978-3-642-32639-4_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32638-7
Online ISBN: 978-3-642-32639-4
eBook Packages: Computer ScienceComputer Science (R0)