Abstract
In Big Data Era, stretching a music piece to a given length is a common demand in people’s daily lives, e.g., in audio–video synchronization and animation production. However, it is not always guaranteed that the stretched music piece is acceptable for general audience since music stretching suffers from people’s perceptual artifacts. Overstretching a music piece will make it uncomfortable for human psychoacoustic hearing. The research on music stretching resistance attempts to estimate the maximum stretchability of music pieces to further avoid overstretch. It has been observed that musical genres can significantly improve the accuracy of automatic estimation of music stretching resistance, but how musical genres are related to music stretching resistance has never been explained or studied in detail in the literature. In this paper, the characteristics of music stretching resistance are compared across different musical genres. It is found that music stretching resistance has strong intra-genre cohesiveness and inter-genre discrepancies in the experiments. Moreover, the ambiguity and the symmetry of music stretching resistance are observed in the experimental analysis. These findings lead to a new measurement on the similarity between different musical genres based on their music stretching resistance. In addition, the analysis of variance (ANOVA) also supports the findings in this paper by verifying the significance of musical genre in shaping music stretching resistance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Verhelst, W., Roelands, M.: An Overlap-Add Technique based on Waveform Similarity (WSOLA) for High Quality Time-Scale Modification of Speech. In: IEEE International Conference on Acoustics, Speech and Signal Processing. pp. 554–557 (1993)
Verhelst, W.: Overlap-Add Methods for Time-Scaling of Speech. Speech Communication 30(4), 207–221 (2000)
Chen, J., Wang, C.: Automatic Music Stretching Resistance Classification Using Audio Features and Genres. IEEE Signal Processing Letters 20(12), 1249–1252 (2013)
Liu, Z., Wang, C., Wang, J., Wang, H., Bai, Y.: Adaptive Music Resizing with Stretching, Cropping and Insertion. Multimedia System 19(4), 359–380 (2013)
Liu, Z., Wang, C., Bai, Y., Wang, H., Wang, J.: MUSIZ: A Generic Framework for Music Resizing with Stretching and Cropping. In: Proceedings of the ACM Multimedia. pp. 523–532 (2011)
Liu, Z., Wang, C., Guo, L., Bai, Y., Wang, J.: Lydar: A Lyrics Density based Approach to Non-Homogeneous Music Resizing. In: IEEE International Conference on Multimedia and Expo. pp. 310–315 (2010)
Charles, J., Wenner, B.: Scalable Music: Automatic Music Retargeting and Synthesis. Eurographics 32(2), 345–354 (2013)
Brennan, D., Stevens, C.: The Effect of Pitch, Tempo and Proportional Pitch and Tempo Manipulation on Memory of Familiar Musical Excerpts. In: International Conference on Music Perception and Cognition. pp. 1771–1778 (2006)
Berlin, P., Zatorre, R., Lafaile, P., Ahad, P., Pike, B.: Voice-Selective Areas in Human Auditory Cortex. Nature 403, 309–312 (2000)
Madison, G., Paulin, J.: Ratings of Speed in Real Music as a Function of both Original and Manipulated Beat Tempo. Journal of the Acoustical Society America 128(5), 3032–3040 (2010)
Bisesi, E., Vicario, G.B.: Psychoacoustic Aspects of the Speed of Melody Performance. In: International Conference of Students of Systematic Musicology. pp. 7–11 (2008)
Large, E., Palmer, C.: Perceiving Temporal Regularity in Music. Mathematical Behavior 26(1), 1–37 (2002)
Jones, M., Boltz, M.: Dynamic Attending and Responses to Time. Psychological Review 96(3), 459–491 (1989)
Drake, C., Jones, M., Baruch, C.: The Development of Rhythmic Attending in Auditory Sequences: Attunement, Referent Period, Focal Attending. Cognition 77, 251–288 (2000)
Large, E., Jones, M.: The dynamics of attending: how people track time-varying events. Psychological Review 106(1), 119–159 (1999)
Moelants, D., Mckinney, M.F.: Tempo Perception and Musical Content: What Makes a Piece Fast, Slow or Temporally Ambiguous? In: International Conference on Music Perception and Cognition. pp. 558–562 (2004)
Mckinney, M.F., Moelants, D.: Deviations from the Resonance Theory of Tempo Induction. In: International Conference on Interdisciplinary Musicology. pp. 124–125 (2004)
Lee, E., Nakra, T.M., Borchers, J.: You’re the Conductor: A Realistic Interactive Conducting System for Children. In: International Conference on New Interfaces for Musical Expression. pp. 68–73 (2004)
Quinn, S., Watt, R.: The Perception of Tempo in Music. Perception 35, 267–280 (2006)
Scaringella, N., Zoia, G., Mlynek, D.: Automatic Genre Classification of Music Content: A Survey. IEEE Signal Processing Magazine 23(2), 133–141 (2006)
Li, T., Ogihara, M., Li, Q.: A Comparative Study on Content-Based Music Genre Classification. In: Proceedings of the ACM SIGIR. pp. 282–289 (2003)
Tzanetakis, G., Essel, G., Cook, P.: Automatic Musical Genre Classification of Audio Signals. IEEE Transaction on Speech and Audio Processing 10(5), 293–302 (2002)
Bagci, U., Erzin, E.: Automatic Classification of Musical Genres Using Inter-Genre Similarity. IEEE Signal Processing Letters 14(8), 521–524 (2007)
Roucos, S., Wilgus, A.: High Quality Time-Scale Modification for Speech. In: IEEE International Conference on Acoustics, Speech and Signal processing. pp. 493–496 (1985)
Chen, J., Wang, C.: RESIC: A Tool for Music Stretching Resistance Estimation. In: Proceedings of the International Conference on Multimedia Modeling. Lecture Note in Computer Science, vol. 4352, pp. 386–389. Springer, Heidelberg (2014)
Acknowledgements
This work was supported in part by the National Natural Science Foundation of China (No. 61373023) and Intelligent Manufacturing Comprehensive Standardization and New Pattern Application Project of Ministry of Industry and Information Technology (Experimental validation of key technical standards for trusted services in industrial Internet). We would like to thank all the volunteers who participated in the listening experiments for their contributions which form the basis of this paper.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Chen, J., Wang, C. (2019). Investigating the Role of Musical Genre in Human Perception of Music Stretching Resistance. In: Lee, W., Leung, C. (eds) Big Data Applications and Services 2017. BIGDAS 2017. Advances in Intelligent Systems and Computing, vol 770. Springer, Singapore. https://doi.org/10.1007/978-981-13-0695-2_14
Download citation
DOI: https://doi.org/10.1007/978-981-13-0695-2_14
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-0694-5
Online ISBN: 978-981-13-0695-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)