Music Generation Using Deep Learning

Bhave, Aishwarya; Sharma, Mayank; Janghel, Rekh Ram

doi:10.1007/978-981-13-3393-4_21

Music Generation Using Deep Learning

Aishwarya Bhave¹⁸,
Mayank Sharma¹⁸ &
Rekh Ram Janghel¹⁸

Conference paper
First Online: 14 February 2019

865 Accesses
4 Citations

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 898))

Abstract

Deep learning has recently been used for many art-related activities such as automatic generation of music and pictures. This paper deals with music generation by using raw audio files in the frequency domain using Restricted Boltzmann Machine and Long Short- Term Memory architectures. The work does not use any information about musical structure to aid the learning, instead, it learns from a previous permutation of notes and generates an optimal and pleasant permutation. It also serves as a comparative study for music generation using Long Short-Term Memory and Restricted Boltzmann Machine.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Horner, A., Goldberg, D.E.: Genetic algorithms and computer-assisted music composition. Urbana 51(61801), 437–441 (1991)
Google Scholar
Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: Advances in Neural Information Processing Systems (2014)
Google Scholar
Nayebi, A., Vitelli, M.: Gruv: algorithmic music generation using recurrent neural networks. Course CS224D: Deep Learning for Natural Language Processing (Stanford) (2015)
Google Scholar
Van Den Oord, A., Dieleman, S., Zen, H., Simonyan, K., Vinyals. O., Graves, A., Kalchbrenner, N., Senior, A., Kavukcuoglu, K.: Wavenet: A Generative Model for Raw Audio. arXiv preprint (2016)
Google Scholar
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Bickerman, G., et al.: Learning to Create Jazz Melodies Using Deep Belief Nets. In: ICCC (2010)
Google Scholar
Earley, S., Obama, T., Note Identification Using FFT.: Note Identification Using Fast Fourier Transform
Google Scholar
Fliege, N.J.: Multirate Digital Signal Processing, vol. 994. Wiley, New York (1994)
MATH Google Scholar
Yu, G., Mallat, S., Bacry, Emmanuel: Audio denoising by time-frequency block thresholding. IEEE Trans. Signal Process. 56(5), 1830–1839 (2008)
Article MathSciNet Google Scholar
Katoh, K., et al.: MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucl. Acids Res. 30(14), 3059–3066 (2002)
Article Google Scholar
Hinton, G.E.: A practical guide to training restricted Boltzmann machines. In: Neural Networks: Tricks of the Trade, pp. 599–619. Springer, Berlin, Heidelberg (2012)
Google Scholar
Hinton, G.E., Osindero, S., Teh, Yee-Whye: A fast learning algorithm for deep belief nets. Neural Comput. 18(7), 1527–1554 (2006)
Article MathSciNet Google Scholar
Hinton, G.E.: To recognize shapes, first learn to generate images. Prog. Brain Res. 165, 535-547 (2007)
Google Scholar
Salakhutdinov, R., Mnih, A., Hinton, G.: Restricted Boltzmann machines for collaborative filtering. In: Proceedings of the 24th International Conference on Machine Learning. ACM (2007)
Google Scholar
Susskind, J.M., et al.: Generating facial expressions with deep belief nets. Affective Computing. InTech (008)
Google Scholar
Smolensky, P.: Foundations of harmony theory: cognitive dynamical systems and the subsymbolic theory of information processing. Parallel Distrib. Process. Explor. Microstruct. Cogn. 1, 191–281 (1986)
Google Scholar
Chollet, F.: “Keras (2015)” (2017)
Google Scholar
Huang, A., Wu, R.: Deep learning for music. arXiv preprint arXiv:1606.04930 (2016)
Eck, D., Schmidhuber, J.: Learning the long-term structure of the blues. In: International Conference on Artificial Neural Networks. Springer, Berlin, Heidelberg (2002)
Chapter Google Scholar
Huang, Y.-S., Chou, S.-Y., Yang, Y.-H.: Generating Music Medleys via Playing Music Puzzle Games
Google Scholar
Hinton, Geoffrey E.: Training products of experts by minimizing contrastive divergence. Neural Comput. 14(8), 1771–1800 (2002)
Article Google Scholar

Download references

Author information

Authors and Affiliations

National Institute of Technology, Raipur, India
Aishwarya Bhave, Mayank Sharma & Rekh Ram Janghel

Authors

Aishwarya Bhave
View author publications
You can also search for this author in PubMed Google Scholar
Mayank Sharma
View author publications
You can also search for this author in PubMed Google Scholar
Rekh Ram Janghel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Aishwarya Bhave .

Editor information

Editors and Affiliations

Department of Computer Science and Software Engineering, Monmouth University, West Long Branch, NJ, USA
Jiacun Wang
Department of Information Technology, National Institute of Technology Karnataka, Surathkal, Mangaluru, Karnataka, India
G. Ram Mohana Reddy
Department of Computer Science and Engineering, JNTUH College of Engineering Hyderabad, Hyderabad, Telangana, India
V. Kamakshi Prasad
Department of Electronics and Communication Engineering, Malla Reddy College of Engineering & Technology, Secunderabad, Telangana, India
V. Sivakumar Reddy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bhave, A., Sharma, M., Janghel, R.R. (2019). Music Generation Using Deep Learning. In: Wang, J., Reddy, G., Prasad, V., Reddy, V. (eds) Soft Computing and Signal Processing . Advances in Intelligent Systems and Computing, vol 898. Springer, Singapore. https://doi.org/10.1007/978-981-13-3393-4_21

Download citation

DOI: https://doi.org/10.1007/978-981-13-3393-4_21
Published: 14 February 2019
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-3392-7
Online ISBN: 978-981-13-3393-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics