Recognition of Environmental Sounds Using Speech Recognition Techniques

Cowling, Michael; Sitte, Renate

doi:10.1007/0-306-47791-2_3

Michael Cowling⁴ &
Renate Sitte⁴

Part of the book series: The International Series in Engineering and Computer Science ((SECS,volume 703))

276 Accesses
7 Citations

Abstract

This paper discusses the use of speech recognition techniques in non-speech sound recognition. It analyses the different techniques used for speech recognition and identifies those that can be used for non-speech sound recognition. It then performs benchmarks on these techniques and determines which technique is better suited for non-speech sound recognition. As a comparison, it also gives results for the use of learning vector quantization (LVQ) and artificial neural network (ANN) techniques in speech recognition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

R. Brooks, C. Breazeal, M. Marjanovic, B. Scassellati, and M. Williamson, “The Cog Project: Building a Humanoid Robot,” C. Nehaniv, ed., Computation for Metaphors, Analogy and Agents, Springer Lecture Notes in Artificial Intelligence, Springer-Verlag, Vol. 1562, 1998
Google Scholar
M. Cowling, R. Sitte, “Sound Identification and Direction Detection in MATLAB for Surveillance Applications,” Proc. MATLAB Users Conference, Melbourne, Australia, November 2000.
Google Scholar
M. Cowling, R. Sitte, “Sound Identification and Direction Detection for Surveillance Applications,” Proc. of ICICS 2001, Singapore, October, 2001.
Google Scholar
B. Lilly, Robust Speech Recognition in Adverse Environments, PhD Thesis, Faculty of Engineering, Griffith University, Nathan Campus, May 2000.
Google Scholar
B. Gold, N. Morgan, Speech and Audio Signal Processing, John Wiley & Sons, Inc, 2000, New York, NY.
Google Scholar
C. H. Lee, F. K. Soong, K. Paliwal, “An Overview of Automatic Speech Recognition,” in Automatic Speech and Speaker Recognition: Advanced Topics, Kluwer Academic Publishers, 1996, Norwell, MA.
Google Scholar
C. H. Lee, F. K. Soong, K. Paliwal, “An Overview of Speaker Recognition Technology,” in Automatic Speech and Speaker Recognition: Advanced Topics, Kluwer Academic Publishers, 1996, Norwell, MA.
Google Scholar
R. Rodman, Computer Speech Technology, Artech House, Inc. 1999, Norwood, MA 02062.
Google Scholar
T. Kohonen, Self-Organizing Maps, 1997, Springer-Verlag Berlin, Germany.
Google Scholar
R. S. Goldhor, “Recognition of Environmental Sounds,” Proc. ICASSP, Vol. 1, pp 149–152, New York, NY, USA, April 1993
Google Scholar
M. Slaney, “Auditory Toolbox”^TM, Interval Research Coporation, version 2, 1998.
Google Scholar
M. J. Castro, J. C. Perez, “Comparison of Geometric, Connectionist and Structural Techniques on a Difficult Isolated Word Recognition Task,” Proc. European Conference on Speech Comm. and Tech., ESCA, Vol. 3, pp 1599–1602, Berlin, Germany, 1993.
Google Scholar
G. Van de Wouver, P. Scheunders, D. Van Dyck, “Wavelet-FILVQ Classifier for Speech Analysis,” Proc. Int. Conference Pattern Recognition, pp. 214–218, Vienna, 1996.
Google Scholar
M. Orr, D. Pham, B. Lithgow, R. Mahony, “Speech perception based algorithm for the separation of overlapping speech signal,” Proc. The Seventh Australian and New Zealand Intelligent Information Systems Conference, pp. 341–344 Perth, Western Australia, 2001.
Google Scholar

Download references

Author information

Authors and Affiliations

Griffith University, Gold Coast, Qld, 9726, Australia
Michael Cowling & Renate Sitte

Authors

Michael Cowling
View author publications
You can also search for this author in PubMed Google Scholar
Renate Sitte
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Wollongong, Australia
Tadeusz A. Wysocki
The University of Leeds, UK
Michael Darnell
Lancaster University, UK
Bahram Honary

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Cowling, M., Sitte, R. (2002). Recognition of Environmental Sounds Using Speech Recognition Techniques. In: Wysocki, T.A., Darnell, M., Honary, B. (eds) Advanced Signal Processing for Communication Systems. The International Series in Engineering and Computer Science, vol 703. Springer, Boston, MA. https://doi.org/10.1007/0-306-47791-2_3

Download citation

DOI: https://doi.org/10.1007/0-306-47791-2_3
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4020-7202-4
Online ISBN: 978-0-306-47791-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics