Abstract
This paper presents an audio watermarking scheme in fast Fourier transform (FFT) domain based on singular value decomposition (SVD) and Cartesian-polar transformation (CPT). In our proposed scheme, initially the original audio is segmented into nonoverlapping frames. FFT is applied to each frame and low frequency FFT coefficients are selected. SVD is applied to the selected FFT coefficients of each frame represented in a matrix form. The highest singular values of each frame are selected and are decomposed into two components using CPT. Watermark information is embedded into each of these CPT components using an embedding function. Experimental results indicate that the proposed watermarking scheme is highly robust against various signal processing attacks. In addition, the proposed scheme has a high data payload. Moreover, it outperforms state-of-the-art audio watermarking methods in terms of imperceptibility, robustness, and data payload.
Similar content being viewed by others
References
Al-Nuaimy, W., El-Bendary, M. A. M., Shafik, A., Shawki, F., Abou-El-Azm, A. E., El-Fishawy, N. A., Elhalafawy, S. M., Diab, S. M., Sallam, B. M., El-Samie, F. E. A., & Kazemian, H. B. (2011). An SVD audio watermarking approach using chotic encrypted images. Digital Signal Processing, 21(6), 764–779.
Alghoniemy, M., & Tewfik, A. H. (2004). Geometric invariance in image watermarking. IEEE Transactions on Image Processing, 13(2), 145–153.
Ali, A. H., & Ahmad, M. (2010). Digital audio watermarking based on the discrete wavelet transform and singular value decomposition. European Journal of Scientific Research, 39(1), 6–21.
Bassia, P., Pitas, I., & Nikolaidis, N. (2001). Robust audio watermarking in the time domain. IEEE Transactions on Multimedia, 3(2), 232–241.
Baudry, S., Delaigle, J.-F., Sankur, B., Macq, B., & Maıtre, H. (2001). Analyses of error correction strategies for typical communication channels in watermarking. Signal Processing, 81(6), 1239–1250.
Bender, W., Gruhl, D., Morimoto, N., & Lu, A. (1996). Techniques of data hiding. IBM Systems Journal, 35(3/4), 313–336.
Bhat, V. K., Sengupta, I., & Das, A. (2010). An adaptive audio watermarking based on the singular value decomposition in the wavelet domain. Digital Signal Processing, 20(6), 1547–1558.
Bhat, V. K., Sengupta, I., & Das, A. (2011). An audio watermarking scheme using singular value decomposition and dither-modulation quantization. Multimedia Tools and Applications, 52(2/3), 369–383.
Campisi, P., Kundur, D., & Neri, A. (2004). Robust digital watermarking in the ridgelet domain. IEEE Signal Processing Letters, 11(10), 806–830.
Chan, P. W., Lyu, M. R., & Chin, R. T. (2005). A novel scheme for hybrid digital video watermarking: approach, evaluation and experimentation. IEEE Transactions on Circuits and Systems for Video Technology, 15(12), 1638–1649.
Chen, S. T., Wu, G. D., & Huang, H. N. (2010a). Wavelet-domain audio watermarking scheme using optimisation-based quantization. IET Signal Processing, 4(6), 720–727.
Chen, S. T., Huang, H. N., Chen, C. J., & Wu, G. D. (2010b). Energy-proportion based scheme for audio watermarking. IET Signal Processing, 4(5), 576–587.
Chu, W. C. (2003). DCT-based image watermarking using subsampling. IEEE Transactions on Multimedia, 5(1), 34–38.
Cox, I., Killian, J., Leighton, F., & Shamoon, T. (1997). Secure spread spectrum watermarking for multimedia. IEEE Transactions on Image Processing, 6(12), 1673–1687.
Cox, I., Miller, M., Bloom, J., Fridrich, J., & Kalker, T. (2007). The Morgan Kaufmann series in multimedia information systems. Digital watermarking and steganography. Amsterdam: Elsevier.
El-Samie, F. E. A. (2009). An efficient singular value decomposition algorithm for digital audio watermarking. International Journal of Speech Technology, 12(1), 27–45.
Erçelebi, E., & Batakçı, L. (2009). Audio watermarking scheme based on embedding strategy in low frequency components with a binary image. Digital Signal Processing, 19(2), 265–277.
Erfani, Y., & Siahpoush, S. (2009). Robust audio watermarking using improved TS echo hiding. Digital Signal Processing, 19(5), 809–814.
Fan, M., & Wang, H. (2009). Chaos-based discrete fractional sine transform domain audio watermarking scheme. Computers & Electrical Engineering, 35(3), 506–516.
Kang, H., & Jung, S.-H. (2006). An efficient audio watermark extraction in time domain. Journal of Information Processing Systems, 2(1), 13–17.
Khaldi, K., & Boudraa, A. O. (2013). Audio watermarking via EMD. IEEE Transactions on Audio, Speech, and Language Processing, 21(3), 675–680.
Kim, H. S., & Lee, H. S. (2003). Invariant image watermark using zernike moments. IEEE Transactions on Circuits and Systems for Video Technology, 13(8), 766–775.
Kirovski, D., & Malver, H. S. (2003). Spread-spectrum watermarking of audio signals. IEEE Transactions on Signal Processing, 54(4), 1020–1033.
Lei, B. Y., Soon, I. Y., & Li, Z. (2011). Blind and robust audio watermarking scheme based on SVD-DCT. Signal Processing, 91(8), 1973–1984.
Lie, W.-N., & Chang, L.-C. (2006). Robust high quality time domain audio watermarking based on low frequency amplitude modification. IEEE Transactions on Multimedia, 8(1), 46–59.
Liu, Z., & Huang, J. (2003). Audio watermarking techniques using sinusoidal patterns based on pseudorandom sequences. IEEE Transactions on Circuits and Systems for Video Technology, 13(8), 801–812.
Megías, D., Serra-Ruiz, J., & Fallahpour, M. (2010). Efficient self-synchronized blind audio watermarking system based on time domain and FFT amplitude modification. Signal Processing, 90(12), 3078–3092.
Noorkami, M., & Mersereau, R. M. (2008). Digital video watermarking in p-frames with controlled video bit rate increase. IEEE Transactions on Information Forensics and Security, 3(3), 441–455.
Ozer, H., Sankur, B., & Memon, N. (2005). An SVD-based audio watermarking technique. In Proceedings of the 7th ACM workshop on multimedia and security (pp. 51–56).
Swanson, M. D., Zhu, B., Tewfiq, A. H., & Boney, L. (1998). Robust audio watermarking using perceptual masking. Signal Processing, 66(3), 337–355.
Thiede, T., Treurniet, W. C., Bitto, R., Schmidmer, C., Sporer, T., Beerens, J. G., Colomes, C., Keyhl, M., Stoll, G., Brandenburg, K., & Feiten, B. (2000). PEAQ—the ITU standard for objective measurement of perceived audio quality. Journal of the Audio Engineering Society, 48(1/2), 3–29.
Wang, X.-Y., & Zhao, H. (2006). A novel synchronization invariant audio watermarking scheme based on DWT and DCT. IEEE Transactions on Signal Processing, 54(12), 4835–4840.
Wang, R., Xu, Chen, D. J., & Du, C. (2004). Digital audio watermarking algorithm based on linear predictive coding in wavelet domain. In Proceedings of 7th IEEE international conference on signal processing (Vol. 1, pp. 2393–2396).
Wang, J., Healy, R., & Timoney, J. (2011). A robust audio watermarking scheme based on reduced singular value decomposition and distortion removal. Signal Processing, 91(8), 1693–1708.
Wu, S., Huang, J., Huang, D., & Shi, Y. Q. (2005). Efficiently self-synchronized audio watermarking for assured audio data transmission. IEEE Transactions on Broadcasting, 51(1), 69–76.
Xiang, S., & Huang, J. (2007). Histogram based audio watermarking against time scale modification and cropping attacks. IEEE Transactions on Multimedia, 9(7), 1357–1372.
Xiang, S., Kim, H. J., & Huang, J. (2008). Audio watermarking robust against time scale modification and MP3 compression. Signal Processing, 88(10), 2372–2387.
Acknowledgements
This work was supported by Ministry of Education, Culture, Sports, Science, and Technology (MEXT), Japan.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Dhar, P.K., Shimamura, T. Audio watermarking in transform domain based on singular value decomposition and Cartesian-polar transformation. Int J Speech Technol 17, 133–144 (2014). https://doi.org/10.1007/s10772-013-9214-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10772-013-9214-4