Advertisement

Instrumentals/Songs Separation for Background Music Removal

  • Himadri MukherjeeEmail author
  • Sk Md Obaidullah
  • K. C. Santosh
  • Teresa Gonçalves
  • Santanu Phadikar
  • Kaushik Roy
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 945)

Abstract

The music industry has come a long way since its inception. Music producers have also adhered to modern technology to infuse life into their creations. Systems capable of separating sounds based on sources especially vocals from songs have always been a necessity which has gained attention from researchers as well. The challenge of vocal separation elevates even more in the case of the multi-instrument environment. It is essential for a system to be first able to detect that whether a piece of music contains vocals or not prior to attempting source separation. In this paper, such a system is proposed being tested on a database of more than 99 h of instrumentals and songs. Using line spectral frequency-based features, we have obtained the highest accuracy of 99.78% from among six different classifiers, viz. BayesNet, Support Vector Machine, Multi Layer Perceptron, LibLinear, Simple Logistic and Decision Table.

Keywords

Background track Vocals Line spectral frequency Framing 

References

  1. 1.
    Leung, T.W., Ngo, C.W., Lau, R.W.: ICA-FX features for classification of singing voice and instrumental sound. In: Proceedings of the 17th International Conference on Pattern Recognition, ICPR 2004, vol. 2, pp. 367–370. IEEE (2004)Google Scholar
  2. 2.
    Chanrungutai, A., Ratanamahatana, C.A.: Singing voice separation for mono-channel music using non-negative matrix factorization. In: International Conference on Advanced Technologies for Communications, ATC 2008, pp. 243–246. IEEE (2008)Google Scholar
  3. 3.
    Rocamora, M., Herrera, P.: Comparing audio descriptors for singing voice detection in music audio files. In: 11th Brazilian Symposium on Computer Music, São Paulo, Brazil, vol. 26, p. 27 (2007)Google Scholar
  4. 4.
    Hsu, C.L., Jang, J.S.R.: On the improvement of singing voice separation for monaural recordings using the MIR-1K dataset. IEEE Trans. Audio Speech. Lang. Process. 18(2), 310–319 (2010)CrossRefGoogle Scholar
  5. 5.
    Rafii, Z., Pardo, B.: A simple music/voice separation method based on the extraction of the repeating musical structure. In: 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 221–224. IEEE (2011)Google Scholar
  6. 6.
    Rafii, Z., Pardo, B.: Repeating pattern extraction technique (REPET): a simple method for music/voice separation. IEEE Trans. Audio Speech Lang. Process. 21(1), 73–84 (2013)CrossRefGoogle Scholar
  7. 7.
    Liutkus, A., Rafii, Z., Badeau, R., Pardo, B., Richard, G.: Adaptive filtering for music/voice separation exploiting the repeating musical structure. In: 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 53–56. IEEE (2012)Google Scholar
  8. 8.
    Ghosal, A., Chakraborty, R., Dhara, B.C., Saha, S.K.: Song/instrumental classification using spectrogram based contextual features. In: Proceedings of the CUBE International Information Technology Conference, pp. 21–25. ACM (2012)Google Scholar
  9. 9.
    Mauch, M., Fujihara, H., Yoshii, K., Goto, M.: Timbre and melody features for the recognition of vocal activity and instrumental solos in polyphonic music. In: ISMIR, pp. 233–238 (2011)Google Scholar
  10. 10.
    https://www.youtube.com/. Accessed 1 Mar 2018
  11. 11.
    https://www.ethnologue.com/. Accessed 1 Mar 2018
  12. 12.
    Mukherjee, H., Obaidullah, S.M., Phadikar, S., Roy, K.: SMIL-a musical instrument identification system. In: International Conference on Computational Intelligence, Communications, and Business Analytics, pp. 129–140. Springer, Singapore (2017)Google Scholar
  13. 13.
    Mukherjee, H., Phadikar, S., Rakshit, P., Roy, K.: REARC-a Bangla Phoneme recognizer. In: 2016 International Conference on Accessibility to Digital World (ICADW), pp. 177–180. IEEE (2016)Google Scholar
  14. 14.
    Paliwal, K.K.: On the use of line spectral frequency parameters for speech recognition. Digit. Sig. Process. 2(2), 80–87 (1992)CrossRefGoogle Scholar
  15. 15.
    Friedman, N., Geiger, D., Goldszmidt, M.: Bayesian network classifiers. Mach. Learn. 29(2–3), 131–163 (1997)CrossRefGoogle Scholar
  16. 16.
    Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods. Cambridge University Press, Cambridge (2000)CrossRefGoogle Scholar
  17. 17.
    Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: LIBLINEAR: a library for large linear classification. J. Mach. Learn. Res. 9, 1871–1874 (2008)zbMATHGoogle Scholar
  18. 18.
    Sumner, M., Frank, E., Hall, M.: Speeding up logistic model tree induction. In: European Conference on Principles of Data Mining and Knowledge Discovery, pp. 675–683. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  19. 19.
    Kohavi, R.: The power of decision tables. In: European Conference on Machine Learning, pp. 174–189. Springer, Heidelberg (1995)Google Scholar
  20. 20.
    Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. ACM SIGKDD Explor. Newsl. 11(1), 10–18 (2009)CrossRefGoogle Scholar
  21. 21.
    Mukherjee, H., Halder, C., Phadikar, S., Roy, K.: READ-a Bangla Phoneme recognition system. In: Proceedings of the 5th International Conference on Frontiers in Intelligent Computing: Theory and Applications, pp. 599–607. Springer, Singapore (2017)Google Scholar
  22. 22.
    Dems̆ar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)MathSciNetGoogle Scholar
  23. 23.
    Mukherjee, H., Obaidullah, Sk.Md., Santosh, K.C., Gonçalves, T., Phadikar, S., Roy, K.: Instrumentals/songs separation for background music removal. In: Kulczycki, P., Kowalski, P.A., Lukasik, S. (eds.) Contemporary Computational Science, p. 204. AGH-UST Press, Cracow (2018)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  • Himadri Mukherjee
    • 1
    Email author
  • Sk Md Obaidullah
    • 2
  • K. C. Santosh
    • 3
  • Teresa Gonçalves
    • 2
  • Santanu Phadikar
    • 4
  • Kaushik Roy
    • 1
  1. 1.Department of Computer ScienceWest Bengal State UniversityKolkataIndia
  2. 2.Department of InformaticsUniversity of EvoraÉvoraPortugal
  3. 3.Department of Computer ScienceThe University of South DakotaVermillionUSA
  4. 4.Department of Computer Science and EngineeringMaulana Abul Kalam Azad University of TechnologyKolkataIndia

Personalised recommendations