Speech Watermarking

Nematollahi, Mohammad Ali; Vorakulpipat, Chalee; Rosales, Hamurabi Gamboa

doi:10.1007/978-981-10-2095-7_3

Speech Watermarking

Mohammad Ali Nematollahi⁶,
Chalee Vorakulpipat⁶ &
Hamurabi Gamboa Rosales⁷

Chapter
First Online: 09 August 2016

1467 Accesses
1 Citations

Part of the book series: Springer Topics in Signal Processing ((STSP,volume 11))

Abstract

Speech is the most important form of human communication which carries valuable information on who/what/how speaker speaks. Currently, applying speech signal for computer science is growing due to three major reasons [1]. First, speech is easy to be produced, captured, and transmitted as it has a lower cost compared to image. Second, speech signal can be captured from a distance (non-invasive). Third, speech carries other types of information such as emotion, age, and gender.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Nematollahi, M.A., and S. Al-Haddad. 2015. Distant speaker recognition: An overview. International Journal of Humanoid Robotics 1550032.
Google Scholar
William, S. 2006. Cryptography and network security, 4 edn. Pearson Education India.
Google Scholar
Huang, H.-C., and W.-C. Fang. 2010. Metadata-based image watermarking for copyright protection. Simulation Modelling Practice and Theory 18(4): 436–445.
Article Google Scholar
Huang, H.-C., et al. 2011. Tabu search based multi-watermarks embedding algorithm with multiple description coding. Information Sciences 181(16): 3379–3396.
Article Google Scholar
Faundez-Zanuy, M., J.J. Lucena-Molina, and M. Hagmüller. 2010. Speech watermarking: An approach for the forensic analysis of digital telephonic recordings*. Journal of Forensic Sciences 55(4): 1080–1087.
Article Google Scholar
Faundez-Zanuy, M. 2010. Digital watermarking: New speech and image applications. Advances in Nonlinear Speech Processing, 84–89.
Google Scholar
Faundez-Zanuy, M., M. Hagmüller, and G. Kubin. 2006. Speaker verification security improvement by means of speech watermarking. Speech Communication 48(12): 1608–1619.
Article MATH Google Scholar
Faundez-Zanuy, M., M. Hagmüller, and G. Kubin. 2007. Speaker identification security improvement by means of speech watermarking. Pattern Recognition 40(11): 3027–3034.
Article MATH Google Scholar
Hagmüller, M., et al. 2004. Speech watermarking for air traffic control. Watermark 8(9): 10.
Google Scholar
Hofbauer, K., G. Kubin, and W.B. Kleijn. 2009. Speech watermarking for analog flat-fading bandpass channels. IEEE Transactions on Audio, Speech, and Language Processing 17(8): 1624–1637.
Article Google Scholar
Hofbauer, K., H. Hering, and G. Kubin. 2005. Speech watermarking for the VHF radio channel. In Proceedings of the 4th Eurocontrol innovative research workshop.
Google Scholar
Rabiner, L.R., and R.W. Schafer. 1978. Digital processing of speech signals. Prentice Hall.
Google Scholar
Al-Shoshan, A.I. 2006. Speech and music classification and separation: A review. Journal of King Saud University 19(1): 95–133.
Google Scholar
Flanagan, J.L. 1972. Speech analysis: Synthesis and perception.
Google Scholar
Rabiner, L.R., and R.W. Schafer. 2009. Theory and application of digital speech processing. Preliminary Edition.
Google Scholar
Blamey, P., et al. 1987. Acoustic parameters measured by a formant-estimating speech processor for a multiple-channel cochlear implant. The Journal of the Acoustical Society of America 82(1): 38–47.
Article Google Scholar
Schroeder, M.R., B.S. Atal, and J. Hall. 1979. Optimizing digital speech coders by exploiting masking properties of the human ear. The Journal of the Acoustical Society of America 66(6): 1647–1652.
Article Google Scholar
Taal, C.H., R.C. Hendriks, and R. Heusdens. 2012. A low-complexity spectro-temporal distortion measure for audio processing applications. IEEE Transactions on Audio, Speech, and Language Processing 20(5): 1553–1564.
Article Google Scholar
Swanson, M.D., et al. 1998. Robust audio watermarking using perceptual masking. Signal Processing 66(3): 337–355.
Article MATH Google Scholar
Bassia, P., I. Pitas, and N. Nikolaidis. 2001. Robust audio watermarking in the time domain. IEEE Transactions on Multimedia 3(2): 232–241.
Article Google Scholar
Cvejic, N., A. Keskinarkaus, and T. Seppanen. 2001. Audio watermarking using m-sequences and temporal masking. In IEEE workshop on the applications of signal processing to audio and acoustics, 2001. IEEE.
Google Scholar
Kubin, G., B. Atal, and W. Kleijn. 1993. Performance of noise excitation for unvoiced speech. In Proceedings of IEEE workshop on speech coding for telecommunications, 1993. IEEE.
Google Scholar
Kim, D.-S. 2003. Perceptual phase quantization of speech. IEEE Transactions on Speech and Audio Processing 11(4): 355–364.
Article Google Scholar
Takahashi, A., R. Nishimura, and Y. Suzuki. 2005. Multiple watermarks for stereo audio signals using phase-modulation techniques. IEEE Transactions on Signal Processing 53(2): 806–815.
Article MathSciNet Google Scholar
Malvar, H.S. 1992. Signal processing with lapped transforms. Artech House.
Google Scholar
Malvar, H.S. 1992. Extended lapped transforms: Properties, applications, and fast algorithms. IEEE Transactions on Signal Processing 40(11): 2703–2714.
Article MATH Google Scholar
Shlien, S. 1997. The modulated lapped transform, its time-varying forms, and its applications to audio coding standards. IEEE Transactions on Speech and Audio Processing 5(4): 359–366.
Article Google Scholar
Cox, I.J., et al. 2002. Digital watermarking. Vol. 53. Springer.
Google Scholar
Costa, M.H. 1983. Writing on dirty paper (corresp.). IEEE Transactions on Information Theory 29(3): 439–441.
Google Scholar
Chu, W.C. 2004. Speech coding algorithms: Foundation and evolution of standardized coders. Wiley.
Google Scholar
Arora, S. and S. Emmanuel. 2003. Adaptive spread spectrum based watermarking of speech. In 9th National undergraduate research opportunities programme congress Poster 15.
Google Scholar
Cheng, Q. and J. Sorensen. 2001. Spread spectrum signaling for speech watermarking. In Proceedings (ICASSP’01) IEEE international conference on acoustics, speech, and signal processing, 2001. IEEE.
Google Scholar
Geiser, B. and P. Vary. 2008. High rate data hiding in ACELP speech codecs. In IEEE international conference on acoustics, speech and signal processing, 2008. ICASSP 2008. IEEE.
Google Scholar
Lacy, J., et al. 1998. On combining watermarking with perceptual coding. In Proceedings of the 1998 IEEE international conference on acoustics, speech and signal processing, 1998. IEEE.
Google Scholar
Liu, C.-H. and O.T.-C. Chen. 2004. Fragile speech watermarking scheme with recovering speech contents. In The 2004 47th midwest symposium on circuits and systems, 2004. MWSCAS’04. IEEE.
Google Scholar
Zhe-Ming, L., Y. Bin, and S. Sheng-He. 2005. Watermarking combined with CELP speech coding for authentication. IEICE Transactions On Information And Systems 88(2): 330–334.
Google Scholar
Yan, B., and Y.-J. Guo. 2013. Speech authentication by semi-fragile speech watermarking utilizing analysis by synthesis and spectral distortion optimization. Multimedia Tools And Applications 67(2): 383–405.
Article Google Scholar
Gurijala, A. 2007. Speech watermarking through parametric modeling. ProQuest.
Google Scholar
Chen, S. and H. Leung. 2006. Concurrent data transmission through PSTN by CDMA. In Proceedings of 2006 IEEE international symposium on circuits and systems, 2006. ISCAS 2006. IEEE.
Google Scholar
Malik, H.M., R. Ansari, and A.A. Khokhar. 2007. Robust data hiding in audio using allpass filters. IEEE Transactions on Audio, Speech, and Language Processing 15(4): 1296–1304.
Article Google Scholar
Narimannejad, M. and S.M. Ahadi. 2011. Watermarking of speech signal through phase quantization of sinusoidal model. In 19th Iranian conference on electrical engineering (ICEE), 2011. IEEE.
Google Scholar
Hatada, M., et al. 2002. Digital watermarking based on process of speech production. In ITCom 2002: the convergence of information technologies and communications. International Society for Optics and Photonics.
Google Scholar
Garcia-Hernandez, J.J., M. Nakano-Miyatake, and H. Perez-Meana. 2008. Data hiding in audio signal using rational dither modulation. IEICE Electronics Express 5(7): 217–222.
Article Google Scholar
Al-Haj, A. 2014. An imperceptible and robust audio watermarking algorithm. EURASIP Journal on Audio, Speech, and Music Processing 2014(1): 1–12.
Article Google Scholar
Bhat, V., I. Sengupta, and A. Das. 2010. An adaptive audio watermarking based on the singular value decomposition in the wavelet domain. Digital Signal Processing 20(6): 1547–1558.
Article Google Scholar
Xiang, S. 2011. Audio watermarking robust against D/A and A/D conversions. EURASIP Journal on Advances In Signal Processing 2011: 3.
Article Google Scholar
Özer, H., B. Sankur, and N. Memon. 2005. An SVD-based audio watermarking technique. In Proceedings of the 7th workshop on multimedia and security. ACM.
Google Scholar
Wang, X., W. Qi, and P. Niu. 2007. A new adaptive digital audio watermarking based on support vector regression. IEEE Transactions on Audio, Speech, and Language Processing 15(8): 2270–2277.
Article Google Scholar
Lei, B., et al. 2012. A robust audio watermarking scheme based on lifting wavelet transform and singular value decomposition. Signal Processing 92(9): 1985–2001.
Article Google Scholar
Lei, B.Y., I.Y. Soon, and Z. Li. 2011. Blind and robust audio watermarking scheme based on SVD–DCT. Signal Processing 91(8): 1973–1984.
Article MATH Google Scholar
Hu, H.-T., et al. 2014. Incorporation of perceptually adaptive QIM with singular value decomposition for blind audio watermarking. EURASIP Journal on Advances in Signal Processing 2014(1): 1–12.
Article Google Scholar

Download references

Author information

Authors and Affiliations

National Electronics and Computer Technology Center (NECTEC), Pathumthani, Thailand
Mohammad Ali Nematollahi & Chalee Vorakulpipat
Universidad Autónoma de Zacatecas, Zacatecas, Mexico
Hamurabi Gamboa Rosales

Authors

Mohammad Ali Nematollahi
View author publications
You can also search for this author in PubMed Google Scholar
Chalee Vorakulpipat
View author publications
You can also search for this author in PubMed Google Scholar
Hamurabi Gamboa Rosales
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mohammad Ali Nematollahi .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Nematollahi, M.A., Vorakulpipat, C., Rosales, H.G. (2017). Speech Watermarking. In: Digital Watermarking . Springer Topics in Signal Processing, vol 11. Springer, Singapore. https://doi.org/10.1007/978-981-10-2095-7_3

Download citation

DOI: https://doi.org/10.1007/978-981-10-2095-7_3
Published: 09 August 2016
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-2094-0
Online ISBN: 978-981-10-2095-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics