Abstract
In the last ten years, our ways to listen to music have drastically changed: In earlier times, we went to record stores or had to use low bit-rate audio coding to get some music and to store it on PCs. Nowadays, millions of songs are within reach via on-line distributors. Some music lovers already got terabytes of music on their hard disc. Users are now no longer desparate to get music, but to select, to find the music they love. A number of technologies has been developed to adress these new requirements. There are techniques to identify music and ways to search for music. Recommendation today is a hot topic as well as organizing music into playlists.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
- 11.
- 12.
- 13.
- 14.
- 15.
- 16.
- 17.
- 18.
- 19.
- 20.
- 21.
- 22.
- 23.
References
Abeßer, J., Dittmar, C., Großmann, H.: Automatic genre and artist classification by analyzing improvised solo parts from musical recordings. In: Proceedings of the Audio Mostly Conference (AMC). Piteå, Sweden (2008)
Allamanche, E., Herre, J., Hellmuth, O., Kastner, T., Ertel, C.: A multiple feature model for music similarity retrieval. In: Proceedings of the 4th International Symposium of Music Information Retrieval (ISMIR). Baltimore, Maryland, USA (2003)
Allamanche, E., Herre, J., Helmuth, O., Froba, B., Kastner, T., Cremer, M.: Content-based identification of audio material using MPEG-7 low level description. In: Proceedings of the 2nd International Symposium of Music Information Retrieval (ISMIR). Bloomington, Indiana, USA (2001)
Anderson, C.: The Long Tail: Why the Future of Business is Selling Less of More. Hyperion, New York, NY, USA (2006)
Aucouturier, J.J., Defreville, B., Pachet, F.: The bag-of-frame approach to audio pattern recognition: A sufficient model for urban soundscapes but not for polyphonic music. Journal of the Acoustical Society of America 122(2), 881–891 (2007)
Aucouturier, J.J., Pachet, F.: Music similarity measures: What’s the use? In: Proceedings of the 3rd International Conference on Music Information Retrieval (ISMIR). Paris, France (2002)
Aucouturier, J.J., Pachet, F.: Improving timbre similarity: How high is the sky? Journal of Negative Results in Speech and Audio Sciences 1(1), 1–13 (2004)
Aucouturier, J.J., Pachet, F., Sandler, M.: The way it sounds: timbre models for analysis and retrieval of music signals. IEEE Transactions on Multimedia 7(6), 1028–1035 (2005)
Bainbridge, D., Cunningham, S., Downie, J.: Visual collaging of music in a digital library. In: Proceedings of the International Conference on Music Information Retrieval (ISMIR). Barcelona, Spain (2004)
Bastuck, C., Dittmar, C.: An integrative framework for content-based music similarity retrieval. In: Proceedings of the 35th German Annual Conference on Acoustics (DAGA). Dresden, Germany (2008)
Bello, J.P., Pickens, J.: A robust mid-level representation for harmonic content in music signals. In: Proceedings of the 6th International Conference on Music Information Retrieval (ISMIR). London, UK (2005)
Brown, J.: Determination of the meter of musical scores by autocorrelation. Journal of the Acoustical Society of America 94(4), 1953–1957 (1993)
Casey, M.: MPEG-7 sound recognition. IEEE Transactions on Circuits and Systems Video Technology, special issue on MPEG-7 11, 737–747 (2001)
Celma, O.: Music recommendation and discovery in the long tail. Ph.D. thesis, Universitat Pompeu Fabra, Barcelona, Spain (2008)
Chen, P.H., Cheh-Jen, L., Schölkopf, B.: A turorial on ν-support vector machines. Tech. rep., Department of Computer Science and Information Engineering, Taipei, Max Planck Institute for Biological Cybernetics, Tübingen (2005)
Cunningham, S., Caulder, S., Grout, V.: Saturday night or fever? Context aware music playlists. In: Proceeding of the Audio Mostly Conference (AMC). Piteå, Sweden (2008)
Cunningham, S., Zhang, Y.: Development of a music organizer for children. In: Proceedings of the 9th International Conference on Music Information Retrieval (ISMIR). Philadelphia, Pennsylvania (2008)
Dempster, A.P., Laird, N.M., Rdin, D.B.: Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society, Series B 39, 1–38 (1977)
Dittmar, C., Bastuck, C., Gruhne, M.: Novel mid-level audio features for music similarity. In: Proc. of the Intern. Conference on Music Communication Science (ICOMCS). Sydney, Australia (2007)
Dittmar, C., Dressler, K., Rosenbauer, K.: A toolbox for automatic transcription of polyphonic music. In: Proceedings of the Audio Mostly Conference (AMC). Ilmenau, Germany (2007)
Dittmar, C., Uhle, C.: Further steps towards drum transcription of polyphonic music. In: Proceedings of the AES 116th Convention (2004)
Dixon, S.: Onset detection revisited. In: Proceedings of the 9th International Conference on Digital Audio Effects (DAFx06). Montréal, Québec, Canada (2006)
Dunker, P., Nowak, S., Begau, A., Lanz, C.: Content-based mood classification for photos and music: A generic multi-modal classification framework and evaluation approach. In: Proceedings of the International Conference on Multimedia Information Retrieval (ACM MIR). Vancouver, Canada (2008)
Eck, D., Bertin-Mahieux, T., Lamere, P.: Autotagging music using supervised machine learning. In: Proceedings of the 8th International Conference on Music Information Retrieval (ISMIR). Vienna, Austria (2007)
Eerola, T., North, A.C.: Expectancy-based model of melodic complexity. In: Proceedings of the 6th International Conference of Music Perception and Cognition (ICMPC). Keele, Staffordshire, England (2000)
Ellis, D.: Classifying music audio with timbral and chroma features. In: Proceedings of the 8th International Conference on Music Information Retrieval (ISMIR). Vienna, Austria (2007)
Feng, Y., Zhuang, Y., Pan, Y.: Music information retrieval by detecting mood via computational media aesthetics. International Conference onWeb Intelligence (IEEE/WIC) pp. 235–241 (2003)
Flexer, A., Pampalk, E., Widmer, G.: Hidden markov models for spectral similarity of songs. In: Proceedings of the 8th International Conference on Digital Audio Effects (DAFX’05). Madrid, Spain (2008)
Foote, J.: Visualizing music and audio using self-similarity. In: Proceedings of the seventh ACM international conference on Multimedia (Part 1). New York, NY, USA (1999)
Foote, J.T.: Content-based retrieval of music and audio. In: Proceeding of SPIE Conference on Multimedia Storage and Archiving Systems II. Dallas, TX, USA (1997)
Fukunaga, K.: Introduction to Statistical Pattern Recognition, Second Edition (Computer Science and Scientific Computing Series). Academic Press (1990)
Gillet, O., Richard, G.: Enst-drums: an extensive audio-visual database for drum signals processing. In: Proceedings of the 7th International Conference on Music Information Retrieval (ISMIR). Victoria, BC, Canada (2006)
Goto, M.: A real-time music-scene-description system - predominant-f0 estimation for detecting melody and bass lines in real-world audio signals. Speech Communication 43, 311–329 (2004)
Goto, M.: AIST annotation for the RWC music database. In: Proceedings of the 7th International Conference on Music Information Retrieval (ISMIR). Victoria, BC, Canada (2006)
Goussevskaia, O., Kuhn, M., Lorenzi, M., Wattenhofer, R.: From Web to Map: Exploring the World of Music. In: Proceedings of IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT). Sydney, Australia (2008)
Gouyon, F., Fabig, L., Bonada, J.: Rhythmic expressiveness transformations of audio recordings - swing modifications. In: Proceedings of the 60th International Conference on Digital Audio Effects (DAFx). London, UK (2003)
Gouyon, F., Herrera, P.: Determination of the meter of musical audio signals: Seeking recurrences in beat segment descriptors. In: Proceedings of the 114th AES Convention. Amsterdam, Netherlands (2003)
Gouyon, F., Klapuri, A., Dixon, S., Alonso, M., Tzanetakis, G., Uhle, C., Cano, P.: An experimental comparison of audio tempo induction algorithms. IEEE Transactions on Speech and Audio Processing 14, 1832–1844 (2006)
Gouyon, F., Pachet, F., Delerue, O.: The use of zerocrossing rate for an application of classification of percussive sounds. In: COST G-6 Conference on Digital Audio Effects (DAFx). Verona, Italy (2000)
Hainsworth, S.W., Macleod, M.D.: Automatic bass line transcription from polyphonic music. In: Proceedings of the International Computer Music Conference (ICMC). Havana, Cuba (2001)
Hanjalic, A.: Extracting moods from pictures and sounds. IEEE Signal Processing Magazine 23(2), 90–100 (2006)
Harte, C.A., Sandler, M.B.: Automatic chord identification using a quantised chromagram. In: Proceedings of the 118th AES Convention. Barcelona, Spain (2005)
Herre, J., Allamanche, E., Ertel, C.: How similar do songs sound? In: Proceedings of the IEEE Workshop on Applications of Singal Processing to Audio and Acoustics (WASPAA). Mohonk, New York, USA (2003)
Herrera, P., Sandvold, V., Gouyon, F.: Percussion-related semantic descriptors of music audio files. In: Proceedings of the 25th International AES Conference. London, UK (2004)
Hevner, K.: Experimental studies of the elements of expression in music. American Journal of Psychology 48(2), 246–268 (1936)
Hilliges, O., Holzer, P., Kluber, R., Butz, A.: AudioRadar: A metaphorical visualization for the navigation of large music collections. Lecture Notes in Computer Science 4073, 82 (2006)
Hiraga, R., Mizaki, R., Fujishiro, I.: Performance visualization: a new challenge to music through visualization. In: Proceedings of the 10th ACM international conference on Multimedia. New York, NY, USA (2002)
Hsu, C., Chang, C., Lin, C., et al.: A practical guide to support vector classification. Tech. rep., National Taiwan University, Taiwan (2003)
Hsu, J.L., Liu, C.C., Chen, A.L.P.: Discovering nontrivial repeating patterns in music data. IEEE Transactions on Multimedia 3(3), 311–325 (2001)
Hu, X., Downie1, J.S., Laurier, C., Bay, M., Ehmann, A.F.: The 2007 MIREX Audio Mood Classification Task: Lessons Learned. In: Proceedings of the 9th International Conference on Music Information Retrieval (ISMIR). Philadelphia, Pennsylvania, USA (2008)
Hunt, M.J., Lennig, M., Mermelstein, P.: Experiments in syllable-based recognition of continuous speech. In: Proceedings of the International Conference on Acoustics and Signal Processing (ICASSP). Denver, Colorado, USA (1980)
Isaacson, E.: What you see is what you get: on visualizing music. In: Proceedings of the International Conference on Music Information Retrieval. London, UK (2005)
ISO/IEC: ISO/IEC 15938-4 (MPEG-7 Audio). ISO (2002)
Jennings, D.: Net, Blogs and Rock ’n’ Roll: How Digital Discovery Works and What it Means for Consumers. Nicholas Brealey Publishing (2007)
Johnston, J.: Transform coding of audio signals using perceptual noise criteria. IEEE Journal on Selected Areas in Communications 6(2), 314–322 (1988)
Kim, Y., Whitman, B.: Singer identification in popular music recordings using voice coding features. In: Proceedings of 3rd International Symposium on Music Information Retrieval (ISMIR). Paris, France (2002)
Kolhoff, P., Preuß, J., Loviscach, J.: Content-based icons for music files. Computers & Graphics 32(5), 550–560 (2008)
Kullback, S.: Information Theory and Statistics (Dover Books on Mathematics). Dover Publications (1997)
de Léon, P.J.P., Inesta, J.M.: Pattern recognition approach for music style identification using shallow statistical descriptors. IEEE Transactions on System, Man and Cybernetics - Part C : Applications and Reviews 37(2), 248–257 (2007)
Lew, M.S., Sebe, N., Lifl, C.D., Jain, R.: Content-based multimedia information retrieval: State of the art and challenges. ACM Transactions on Multimedia Computing, Communications, and Applications (2006)
Li, T., Ogihara, M.: Detecting emotion in music. Proceedings of the Fifth International Symposium on Music Information Retrieval pp. 239–240 (2003)
Li, T., Ogihara, M.: Content-based music similarity search and emotion detection. Proceedings IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). 5 (2004)
Licklider, J., Pollack, I.: Effects of differentiation, integration, and infinite peak clipping on the intelligibility of speech. Journal Acoustical Society of America 20, 42–51 (1948)
Lidy, T., Rauber, A., Pertusa, A., Iesta, J.M.: Improving genre classification by combination of audio and symbolic descriptors using a transcription system. In: Proceedings of the 8th International Conference on Music Information Retrieval (ISMIR). Vienna, Austria (2007)
Lillie, A.S.: Musicbox: Navigating the space of your music. Master’s thesis, Massachusetts Institute of Technology, USA (2008)
Liu, B.: Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data. Springer, New York, NY, USA (2008)
Liu, C., Yang, Y., Wu, P., Chen, H.: Detecting and classifying emotion in popular music. In: 9th Joint International Conference on Information Sciences (2006)
Liu, D., Lu, L., Zhang, H.: Automatic mood detection from acoustic music data. In: Proceedings International Symposium Music Information Retrieval (ISMIR), pp. 81–87 (2003)
Liu, Z., Huang, Q.: Content-based indexing and retrieval-by-example in audio. In: Proceedings of IEEE International Conference on Multimedia and Expo (ICME). New York City, NY, USA (2000)
Logan, B.: Mel frequency cepstral coefficients for music modeling. In: Proceedings of 1st International Symposium on Music Information Retrieval (ISMIR). Plymouth, Massachusetts, USA (2000)
Logan, B., Salomon, A.: A music similarity function based on signal analysis. In: Proceedings of IEEE International Conference on Multimedia and Expo (ICME). Tokyo, Japan (2001)
Lu, L., Liu, D., Zhang, H.: Automatic mood detection and tracking of music audio signals. IEEE Transactions on Audio, Speech & Language Processing 14(1), 5–18 (2006)
Lukashevich, H., Dittmar, C.: Applying statistical models and parametric distance measures for music similarity search. In: Proceedings of the 32nd Annual Conference of German Classification Society. Hamburg, Germany (2008)
Madsen, S.T., Widmer, G.: A complexity-based approach to melody track identification in midi files. In: Proceedings of the International Workshop on Artificial Intelligence and Music (MUSIC-AI). Hyderabad, India (2007)
Magno, T., Sable, C.: A comparison of signal-based music recommendation to genre labels, collaborative filtering, musicological analysis, human recommendation, and random baseline. In: Proceedings of the 9th International Conference on Music Information Retrieval (ISMIR). Philadelphia, USA (2008)
Mandel, M.I., Poliner, G.E., Ellis, D.P.: Support vector machine active learning for music retrieval. Multimedia Systems 12, 1–11 (2006)
de Mántaras, R.L., Arcos, J.L.: AI and music: From composition to expressive performances. AI Magazine 23, 43–57 (2002)
McKay, C., Fujinaga, I.: Automatic genre classification using large high-level musical feature sets. In: Proceedings of the International Conference in Music Information Retrieval (ISMIR). Barcelona, Spain (2004)
Mierswa, I., Morik, K.: Automatic feature extraction for classifying audio data. Machine Learning Journal 58, 127–149 (2005)
Mörchen, F., Ultsch, A., Nöcker, M., Stamm, C.: Databionic visualization of music collections according to perceptual distance. In: Proceedings of the 6th International Conference on Music Information Retrieval (ISMIR). London, UK (2005)
Moschou, V., Kotti, M., Benetos, E., Kotropoulos, C.: Systematic comparison of BIC-based speaker segmentation systems. In: Proceedings of IEEE 9th Workshop on Multimedia Signal Processing (MMSP). Crete, Greece (2007)
Müller, M., Appelt, D.: Path-constrained partial music synchronization. In: Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Las Vegas, USA (2008)
Neumayer, R., Dittenbach, M., Rauber, A.: PlaySOM and PocketSOMPlayer, alternative interfaces to large music collections. In: Proceedings of the 6th International Conference on Music Information Retrieval (ISMIR). London, UK (2005)
Nowak, S., Bastuck, C., Dittmar, C.: Exploring music collections through automatic similarity visualization. In: Tagungsband der DAGA Fortschritte der Akustik. Dresden, Germany (2008)
Pampalk, E.: Computational models of music similarity and their application in music information retrieval. Ph.D. thesis, Vienna University of Technology, Vienna, Austria (2006)
Pampalk, E., Pohle, T., Widmer, G.: Dynamic playlist generation based on skipping behaviour. In: Proceedings of the 6th International Conference on Music Information Retrieval (ISMIR). London, UK (2005)
Pampalk, E., Rauber, A., Merkl, D.: Content-based organization and visualization of music archives. In: Proceedings of the 10th ACM international conference on Multimedia. New York, NY, USA (2002)
Peeters, G.: A large set of audio features for sound description (similarity and classification) in the CUIDADO project. Tech. Rep. CUIDADO I.S.T. Project, Institut de Recherche et Coordination Acoustique/Musique (IRCAM), Paris, France (2004)
Poliner, G.E., Ellis, D.P.W., Ehmann, A.F., Gómez, E., Streich, S., Ong, B.: Melody transcription from music audio: Approaches and evaluation. IEEE Transactions on Audio, Speech, and Language Processing 15, 1247–1256 (2007)
Raimond, Y.: A distributed music information system. Ph.D. thesis, Queen Mary, University of London, London, UK (2008)
Russell, J.: A circumplex model of affect. Journal of Personality and Social Psychology 39(6), 1161–1178 (1980)
Ryyänen, M., Klapuri, A.: Automatic bass line transcription from streaming polyphonic audio. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Honolulu, Hawaii, USA (2007)
Ryynnen, M.P., Klapuri, A.P.: Automatic transcription of melody, bass line, and chords in polyphonic music. Computer Music Journal 32, 72–86 (2008)
Saunders, C., Hardoon, D.R., Shawe-Taylor, J., Widmer, G.: Using string kernels to identify famous performers from ther playing style. In: Proceedings of the 15th European Conference on Machine Learning (ECML). Pisa, Italy (2004)
Schein, A.I., Popescul, R., Ungar, L.H., Pennock, D.M.: Methods and metrics for cold-start recommendations. In: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Tampere, Finland (2002)
Schuller, B., Eyben, F., Rigoll, G.: Tango or waltz?: Putting ballroom dance style into tempo detection. EURASIP Journal on Audio, Speech, and Music Processing (JASMP) 2008(6), 1–12 (2008)
Serra, J., Gomez, E.: Audio cover song identification based on tonal sequence alignment. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal processing (ICASSP). Las Vegas, USA (2008)
Sethares, W., Staley, T.: Meter and periodicity in musical performance. Journal of New Music Research 30(2), 149–158 (2001)
Shao, X., Xu, C., Kankanhalli, M.: Unsupervised classification of music genre using hidden markov model. In: Proceedings of the IEEE International Conference on Multimedia and Expo (ICME). Edinburgh, Scotland,United Kingdom (2004)
Smith, G.: Tagging: People-Powered Metadata for the Social Web. New Riders, Berkeley, CA, USA (2008)
Sordo, M., Celma, Ó., Blech, M., Guaus, E.: The quest for musical genres: Do the experts and the wisdom of crowds agree? In: Proceedings of the Ninth International Conference on Music Information Retrieval (ISMIR). Philadelphia, Pennsylvania, USA (2008)
Tellegen, A., Watson, D., Clark, L.: On the dimensional and hierarchical structure of affect. Psychological Science 10, 297–303 (1999)
Thayer, R.: The Biopsychology of Mood and Arousal. Oxford University Press (1989)
Tiemann, M., Pauws, S., Vignoli, F.: Ensemble learning for hybrid music recommendation. In: Proceedings of the 8th International Conference on Music Information Retrieval (ISMIR). Vienna, Austria (2007)
Tolos, M., Tato, R., Kemp, T.: Mood-based navigation through large collections of musical data. In: 2nd IEEE Consumer Communications and Networking Conference. Las Vegas, Nevada, USA (2005)
Torrens, M., Hertzog, P., Arcos, J.: Visualizing and exploring personal music libraries. In: Proceedings of the International Conference on Music Information Retrieval (ISMIR). Barcelona, Spain (2004)
Trohidis, K., Tsoumakas, G., Kalliris, G., Vlahavas, I.: Multilabel classification of music into emotions. In: Proceedings of the 9th International Conference on Music Information Retrieval (ISMIR). Philadelphia, Pennsylvania, USA (2008)
Tsai, W., Wang, H.: Automatic singer recognition of popular music recordings via estimation and modeling of solo vocal signals. IEEE Transactions on Audio, Speech, and Language Processing 14(1), 330–431 (2006)
Tzanetakis, G.: Manipulation, analysis and retrieval systems for audio signals. Ph.D. thesis, Princeton University, NJ, USA (2002)
Tzanetakis, G., Cook, P.: Musical genre classification of audio signals. IEEE transactions on Speech and Audio Processing 10(5), 293–302 (2002)
Uhle, C.: Automatisierte extraktion rhythmischer merkmale zur anwendung in music information retrieval-systemen. Ph.D. thesis, Ilmenau University, Ilmenau, Germany (2008)
Wang, M., Zhang, N., Zhu, H.: User-adaptive music emotion recognition. In: 7th International Conference on Signal Processing, vol. 2, pp. 1352–1355 (2004)
Webb, A.: Statistical Pattern Recognition, 2nd edn. John Wiley and Sons Ltd. (2002)
West, K., Cox, S.: Features and classifiers for the automatic classification of musical audio signals. In: Proceedings of the 5th International Conference on Music Information Retrieval (ISMIR). Barcelona, Spain (2004)
Wolter, K., Bastuck, C., Gärtner, D.: Adaptive user modeling for content-based music retrieval. In: Proceedings of the 6th Workshop on Adaptive Multimedia Retrieval (AMR). Paris, France (2008)
Wu, T., Jeng, S.: Probabilistic estimation of a novel music emotion model. In: 14th International Multimedia Modeling Conference. Springer (2008)
Yang, D., Lee, W.: Disambiguating music emotion using software agents. In: Proc. of the International Conference on Music Information Retrieval (ISMIR). Barcelona, Spain (2004)
Yoshii, K., Goto, M.: Music thumbnailer: Visualizing musical pieces in thumbnail images based on acoustic features. In: Proceedings of the 9th International Conference on Music Information Retrieval (ISMIR). Philadelphia, Pennsylvania, USA (2008)
Yoshii, K., Goto, M., Komatani, K., Ogata, T., Okuno, H.G.: Hybrid collaborative and content-based music recommendation using probabilistic model with latent user preferences. In: Proceedings of the 7th International Conference on Music Information Retrieval (ISMIR). Victoria, BC, Canada (2006)
Yoshii, K., Goto, M., Okuno, H.G.: Automatic drum sound description for real-world music using template adaption and matching methods. In: Proceedings of the 5th International Music Information Retrieval Conference (ISMIR). Barcelona, Spain (2004)
Zils, A., Pachet, F.: Features and classifiers for the automatic classification of musical audio signals. In: Proceedings of the 5th International Conference on Music Information Retrieval (ISMIR). Barcelona, Spain (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Brandenburg, K. et al. (2009). Music Search and Recommendation. In: Furht, B. (eds) Handbook of Multimedia for Digital Entertainment and Arts. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-89024-1_16
Download citation
DOI: https://doi.org/10.1007/978-0-387-89024-1_16
Published:
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-89023-4
Online ISBN: 978-0-387-89024-1
eBook Packages: Computer ScienceComputer Science (R0)