Abstract
This proposed research work presents acoustic scene classification (ASC) which is an errand to relate a semantic name to a sound stream that distinguishes the environment in which it has been delivered. ASC can be applied in many areas including mobile robot navigation systems and context-aware devices, such as an automatically mode-switching smart phones according to the current acoustic environment. Proposing a strong ASC system is difficult because the sound from natural setting compromises numerous audio sources and also the microphones do not seem to be organized in a very controlled condition. Furthermore, not all sounds from long-duration audio data are relevant for identifying scene label. The dataset for this assignment is that the DCASE 2018 dataset collected from Tampere University of Technology, comprising of sound recordings from different scenes like airport, metro station, shopping mall, etc. For each location, there are 5–6 min of audio files. We propose to implement the ASC task using convolutional neural network (CNN) that performs the task of classification. The audio files are converted to log mel-spectrograms which are provided as input to CNN. Upon training the CNN model by varying the number of layers and the hyperparameters, it is observed that significant accuracy of 78.4 and 73.84% has been achieved for the inputs RGB scale spectrograms and grayscale spectrograms, respectively.
Keywords
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Dang, A., . Vu, T.-H., Wang, J.-C.: Acoustic scene classification using convolutional neural networks and multi-scale multi-feature extraction. In: Proceedings of Detection and Classification of Acoustic Scenes and Events (2017)
Mesaros, A., Heittola, T., Benetos, E., Foster, P., Lagrange, M., Virtane, T., Plumbley, M.D: Detection and classification of acoustic scenes and events: outcome of the DCASE 2016 challenge. In: IEEE/ACM Trans. Audio Speech Lang. Process. (2016)
Mesaros, A., Heittola, T., Virtanen, T.: Tut database for acoustic scene classification and sound event detection. In: Proceedings of Signal Processing Conference (EUSIPCO) (2016)
Li, D., Tam, J., Toub, D.: Auditory scene classification using machine learning techniques. In: Proceedings of IEEE AASP Challenge on Detection and Classification of Acoustic Scenes and Events (2013)
Barchiesi, D., Giannoulis, D., Stowell, D., Plumbley, M.D.: Acoustic scene classification. In: Proceedings of Detection and Classification of Acoustic Scenes and Events, pp. 4–12 (2014)
Takahashi, G., Yamada, T., Ono, N., Makino, S: Performance evaluation of acoustic scene classification using DNN-GMM and frame-concatenated acoustic features. In: Proceedings of APSIPA Annual Summit and Conference (2017)
Jiang, H., Bai, J., Zhang, S., Xu, B: SVM-based audio scene classification. In: Proceedings of IEEE International Conference on Natural Language Processing and Knowledge Engineering (2005)
Battaglino, D., Lepauloux, L., Evans, N.: Acoustic scene classification using convolutional neural networks. In: Proceedings of Detection and Classification of Acoustic Scenes and Events (2016)
D. Giannoulis, E. Benetos, D. Stowell, M. Rossignol, M. Lagrange, and M. D. Plumbley: Detection and classification of acoustic scenes and events: an IEEE AASP challenge. In: Proceedings of IEEE Applications of Signal Processing to Audio and Acoustics (WASPAA), pp. 1–4 (2013)
Alexander, G., Alexander, L: Acoustic scene classification using convolutional neural networks and different channels representations and its fusion. In: Proceedings of Detection and Classification of Acoustic Scenes and Events (2018)
Hussaina, K., Hussainb, M., Khanc, M.G.: An improved acoustic scene classification method using convolutional neural networks (CNNs). Am. Sci. Res. J. Eng. Technol. Sci. (2018)
http://dcase.community/challenge2018/task-acoustic-scene-classification
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Akshara, S., Hemapriyalakshmi, R., Keerthana, S., Bharathi, B., Kavitha, S. (2020). Acoustic Scene Classification Using Convolutional Neural Network. In: Reddy, V., Prasad, V., Wang, J., Reddy, K. (eds) Soft Computing and Signal Processing. ICSCSP 2019. Advances in Intelligent Systems and Computing, vol 1118. Springer, Singapore. https://doi.org/10.1007/978-981-15-2475-2_28
Download citation
DOI: https://doi.org/10.1007/978-981-15-2475-2_28
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-2474-5
Online ISBN: 978-981-15-2475-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)