Abstract
This paper discusses the use of speech recognition techniques in non-speech sound recognition. It analyses the different techniques used for speech recognition and identifies those that can be used for non-speech sound recognition. It then performs benchmarks on these techniques and determines which technique is better suited for non-speech sound recognition. As a comparison, it also gives results for the use of learning vector quantization (LVQ) and artificial neural network (ANN) techniques in speech recognition.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
R. Brooks, C. Breazeal, M. Marjanovic, B. Scassellati, and M. Williamson, “The Cog Project: Building a Humanoid Robot,” C. Nehaniv, ed., Computation for Metaphors, Analogy and Agents, Springer Lecture Notes in Artificial Intelligence, Springer-Verlag, Vol. 1562, 1998
M. Cowling, R. Sitte, “Sound Identification and Direction Detection in MATLAB for Surveillance Applications,” Proc. MATLAB Users Conference, Melbourne, Australia, November 2000.
M. Cowling, R. Sitte, “Sound Identification and Direction Detection for Surveillance Applications,” Proc. of ICICS 2001, Singapore, October, 2001.
B. Lilly, Robust Speech Recognition in Adverse Environments, PhD Thesis, Faculty of Engineering, Griffith University, Nathan Campus, May 2000.
B. Gold, N. Morgan, Speech and Audio Signal Processing, John Wiley & Sons, Inc, 2000, New York, NY.
C. H. Lee, F. K. Soong, K. Paliwal, “An Overview of Automatic Speech Recognition,” in Automatic Speech and Speaker Recognition: Advanced Topics, Kluwer Academic Publishers, 1996, Norwell, MA.
C. H. Lee, F. K. Soong, K. Paliwal, “An Overview of Speaker Recognition Technology,” in Automatic Speech and Speaker Recognition: Advanced Topics, Kluwer Academic Publishers, 1996, Norwell, MA.
R. Rodman, Computer Speech Technology, Artech House, Inc. 1999, Norwood, MA 02062.
T. Kohonen, Self-Organizing Maps, 1997, Springer-Verlag Berlin, Germany.
R. S. Goldhor, “Recognition of Environmental Sounds,” Proc. ICASSP, Vol. 1, pp 149–152, New York, NY, USA, April 1993
M. Slaney, “Auditory Toolbox”TM, Interval Research Coporation, version 2, 1998.
M. J. Castro, J. C. Perez, “Comparison of Geometric, Connectionist and Structural Techniques on a Difficult Isolated Word Recognition Task,” Proc. European Conference on Speech Comm. and Tech., ESCA, Vol. 3, pp 1599–1602, Berlin, Germany, 1993.
G. Van de Wouver, P. Scheunders, D. Van Dyck, “Wavelet-FILVQ Classifier for Speech Analysis,” Proc. Int. Conference Pattern Recognition, pp. 214–218, Vienna, 1996.
M. Orr, D. Pham, B. Lithgow, R. Mahony, “Speech perception based algorithm for the separation of overlapping speech signal,” Proc. The Seventh Australian and New Zealand Intelligent Information Systems Conference, pp. 341–344 Perth, Western Australia, 2001.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Kluwer Academic Publishers
About this chapter
Cite this chapter
Cowling, M., Sitte, R. (2002). Recognition of Environmental Sounds Using Speech Recognition Techniques. In: Wysocki, T.A., Darnell, M., Honary, B. (eds) Advanced Signal Processing for Communication Systems. The International Series in Engineering and Computer Science, vol 703. Springer, Boston, MA. https://doi.org/10.1007/0-306-47791-2_3
Download citation
DOI: https://doi.org/10.1007/0-306-47791-2_3
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4020-7202-4
Online ISBN: 978-0-306-47791-1
eBook Packages: Springer Book Archive