Advertisement

Acoustic Scene Classification Using Convolutional Neural Network

  • S. AksharaEmail author
  • R. Hemapriyalakshmi
  • S. Keerthana
  • B. Bharathi
  • S. Kavitha
Conference paper
  • 17 Downloads
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 1118)

Abstract

This proposed research work presents acoustic scene classification (ASC) which is an errand to relate a semantic name to a sound stream that distinguishes the environment in which it has been delivered. ASC can be applied in many areas including mobile robot navigation systems and context-aware devices, such as an automatically mode-switching smart phones according to the current acoustic environment. Proposing a strong ASC system is difficult because the sound from natural setting compromises numerous audio sources and also the microphones do not seem to be organized in a very controlled condition. Furthermore, not all sounds from long-duration audio data are relevant for identifying scene label. The dataset for this assignment is that the DCASE 2018 dataset collected from Tampere University of Technology, comprising of sound recordings from different scenes like airport, metro station, shopping mall, etc. For each location, there are 5–6 min of audio files. We propose to implement the ASC task using convolutional neural network (CNN) that performs the task of classification. The audio files are converted to log mel-spectrograms which are provided as input to CNN. Upon training the CNN model by varying the number of layers and the hyperparameters, it is observed that significant accuracy of 78.4 and 73.84% has been achieved for the inputs RGB scale spectrograms and grayscale spectrograms, respectively.

Keywords

Acoustic scene classification DCASE 2018 Machine learning Convolutional neural networks Log mel-spectrograms 

References

  1. 1.
    Dang, A., . Vu, T.-H., Wang, J.-C.: Acoustic scene classification using convolutional neural networks and multi-scale multi-feature extraction. In: Proceedings of Detection and Classification of Acoustic Scenes and Events (2017)Google Scholar
  2. 2.
    Mesaros, A., Heittola, T., Benetos, E., Foster, P., Lagrange, M., Virtane, T., Plumbley, M.D: Detection and classification of acoustic scenes and events: outcome of the DCASE 2016 challenge. In: IEEE/ACM Trans. Audio Speech Lang. Process. (2016)Google Scholar
  3. 3.
    Mesaros, A., Heittola, T., Virtanen, T.: Tut database for acoustic scene classification and sound event detection. In: Proceedings of Signal Processing Conference (EUSIPCO) (2016)Google Scholar
  4. 4.
    Li, D., Tam, J., Toub, D.: Auditory scene classification using machine learning techniques. In: Proceedings of IEEE AASP Challenge on Detection and Classification of Acoustic Scenes and Events (2013)Google Scholar
  5. 5.
    Barchiesi, D., Giannoulis, D., Stowell, D., Plumbley, M.D.: Acoustic scene classification. In: Proceedings of Detection and Classification of Acoustic Scenes and Events, pp. 4–12 (2014)Google Scholar
  6. 6.
    Takahashi, G., Yamada, T., Ono, N., Makino, S: Performance evaluation of acoustic scene classification using DNN-GMM and frame-concatenated acoustic features. In: Proceedings of APSIPA Annual Summit and Conference (2017)Google Scholar
  7. 7.
    Jiang, H., Bai, J., Zhang, S., Xu, B: SVM-based audio scene classification. In: Proceedings of IEEE International Conference on Natural Language Processing and Knowledge Engineering (2005)Google Scholar
  8. 8.
    Battaglino, D., Lepauloux, L., Evans, N.: Acoustic scene classification using convolutional neural networks. In: Proceedings of Detection and Classification of Acoustic Scenes and Events (2016)Google Scholar
  9. 9.
    D. Giannoulis, E. Benetos, D. Stowell, M. Rossignol, M. Lagrange, and M. D. Plumbley: Detection and classification of acoustic scenes and events: an IEEE AASP challenge. In: Proceedings of IEEE Applications of Signal Processing to Audio and Acoustics (WASPAA), pp. 1–4 (2013)Google Scholar
  10. 10.
    Alexander, G., Alexander, L: Acoustic scene classification using convolutional neural networks and different channels representations and its fusion. In: Proceedings of Detection and Classification of Acoustic Scenes and Events (2018)Google Scholar
  11. 11.
    Hussaina, K., Hussainb, M., Khanc, M.G.: An improved acoustic scene classification method using convolutional neural networks (CNNs). Am. Sci. Res. J. Eng. Technol. Sci. (2018)Google Scholar
  12. 12.

Copyright information

© Springer Nature Singapore Pte Ltd. 2020

Authors and Affiliations

  • S. Akshara
    • 1
    Email author
  • R. Hemapriyalakshmi
    • 1
  • S. Keerthana
    • 1
  • B. Bharathi
    • 1
  • S. Kavitha
    • 1
  1. 1.Department of CSESSN College of EngineeringChennaiIndia

Personalised recommendations