Speech Watermarking Based on Coding of the Harmonic Phase

Hernaez, Inma; Saratxaga, Ibon; Ye, Jianpei; Sanchez, Jon; Erro, Daniel; Navas, Eva

doi:10.1007/978-3-319-13623-3_27

Inma Hernaez²³,
Ibon Saratxaga²³,
Jianpei Ye²³,
Jon Sanchez²³,
Daniel Erro^23,24 &
…
Eva Navas²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8854))

819 Accesses
1 Citations

Abstract

This paper presents a new speech watermarking technique using harmonic modelling of the speech signal and coding of the harmonic phase. We use a representation of the instantaneous harmonic phase which allows straightforward manipulation of its values to embed the digital watermark. The technique converts each harmonic into a communication channel, whose performance is analysed in terms of distortion and BER. The developed tests show that with a simple coding scheme a bit rate of 300bps can be achieved with minimal perceptual distortion and almost zero BER.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Nematollahi, M., Al-Haddad, S.: An overview of digital speech watermarking. International Journal of Speech Technology 16(4), 471–488 (2013)
Article Google Scholar
Bender, W., Gruhl, D., Morimoto, N., Lu, A.: Techniques for data hiding. IBM Syst. J. 35(3-4), 313–336 (1996)
Article Google Scholar
Arnold, M.: Audio watermarking: features, applications and algorithms. In: Proc. of IEEE Int. Conf. on Multimedia and Expo, vol. 2, pp. 1013–1016 (2000)
Google Scholar
Cox, I.J., Miller, M.L., Bloom, J.A., Fridrich, J., Kalker, T.: Digital Watermarking and Steganography, 2nd edn. The Morgan Kaufmann Series in Multimedia Information and Systems. Morgan Kaufmann (2008)
Google Scholar
Bai, Y., Bai, S., Zhu, G., You, C., Liu, B.: A blind audio watermarking algorithm based on fft coeficients quantization. In: Proceedings of the Int. Conf. on Artificial Intelligence and Education (ICAIE), pp. 529–533 (2010)
Google Scholar
Chen, S., Leung, H.: Speech bandwidth extension by data hiding and phonetic classification. In: Proceedings of the IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), vol. 4, pp. IV593–IV596 (2007)
Google Scholar
Sakaguchi, S., Arai, T., Murahara, Y.: The efect of polarity inversion of speech on human perception and data hiding as an application. In: Proceedings of the IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), vol. 2, pp. II917–II920 (2000)
Google Scholar
Hsieh, C.T., Sou, P.Y.: Blind cepstrum domain audio watermarking based on time energy features. In: Proc. of the 14th Int. Conf. on Digital Signal Processing, vol. 2, pp. 705–708 (2002)
Google Scholar
Megías, D., Serra-Ruiz, J., Fallahpour, M.: Efficient self-synchronised blind audio watermarking system based on time domain and FFT amplitude modification. Signal Processing 90(12), 3078–3092 (2010)
Article MATH Google Scholar
Saratxaga, I., Hernaez, I., Pucher, M., Navas, E., Sainz, I.: Perceptual importance of the phase related information in speech. In: Proceedings of the 13th Annual Conference of the International Speech Communication Association, pp. 1448–1451 (2012)
Google Scholar
Ansari, R., Malik, H., Khokhar, A.: Data-hiding in audio using frequency selective phase alteration. In: Proc. of the IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), vol. 5, pp. V389–V392 (2004)
Google Scholar
Dong, X., Bocko, M., Ignjatovic, Z.: Data hiding via phase manipulation of audio signals. In: Proc. of the IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), vol. 5, pp. V377–V380 (2004)
Google Scholar
Liew, P., Armand, M.: Inaudible watermarking via phase manipulation of random frequencies. Multimedia Tools and Applications 35(3), 357–377 (2007)
Article Google Scholar
Kuo, S., Johnston, J.D., Turin, W., Quackenbush, S.R.: Covert audio watermarking using perceptually tuned signal independent multiband phase modulation. In: Proc. of the Int. Conf. on Acoustics, Speech and Signal Processing, vol. II, pp. 1753–1756 (2002)
Google Scholar
Hofbauer, K., Kubin, G., Kleijn, W.B.: Speech Watermarking for Analog Flat-Fading Bandpass Channels. IEEE Trans. on Audio, Speech, and Language Processing 17(8), 1624–1637 (2009)
Article Google Scholar
Chen, S.H., Yu, S.Y., Chang, C.H.: Speech watermarking based on wavelet transform and bch coding. In: Proc. of the IEEE Int. Conf. on Sensor Networks, Ubiquitous and Trustworthy Computing (SUTC), pp. 507–512 (2008)
Google Scholar
Huang, J., Wang, Y., Shi, Y.: A blind audio watermarking algorithm with self-synchronization. In: Proc. of the IEEE Int. Symposium on Circuits and Systems, vol. 3, pp. 627–630 (2002)
Google Scholar
Celik, M., Sharma, G., Tekalp, A.: Pitch and duration modification for speech watermarking. In: Proc. of the IEEE Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), vol. 2, pp. 17–20 (2005)
Google Scholar
Akhaee, M., Kalantari, N., Marvasti, F.: Robust multiplicative audio and speech watermarking using statistical modelling. In: Proc. of the IEEE Int. Conf. on Communications (ICC), pp. 1–5 (2009)
Google Scholar
Hatada, M., Sakai, T., Komatsu, N., Yamazaki, Y.: Digital watermarking based on process of speech production. In: Proc. SPIE Multimedia Systems and Applications V, vol. 4861, pp. 258–267 (2002)
Google Scholar
Saratxaga, I., Hernaez, I., Erro, D., Navas, E., Sanchez, J.: Simple representation of signal phase for harmonic speech models. Electronics Letters 45(7), 381–383 (2009)
Article Google Scholar
Stylianou, Y.: Harmonic Plus Noise Models for Speech, Combined with Statistical Methods, for Speech and Speaker Modification. Ph.D. thesis, Ecole Nationale Superieure des Telecommunications, Paris, France (1996)
Google Scholar

Download references

Author information

Authors and Affiliations

Aholab (UPV/EHU), ETSI Bilbao, Alda. Urquijo s/n, Bilbao, Spain
Inma Hernaez, Ibon Saratxaga, Jianpei Ye, Jon Sanchez, Daniel Erro & Eva Navas
IKERBASQUE, Alda. Urquijo, 36-5, Bilbao, Spain
Daniel Erro

Authors

Inma Hernaez
View author publications
You can also search for this author in PubMed Google Scholar
Ibon Saratxaga
View author publications
You can also search for this author in PubMed Google Scholar
Jianpei Ye
View author publications
You can also search for this author in PubMed Google Scholar
Jon Sanchez
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Erro
View author publications
You can also search for this author in PubMed Google Scholar
Eva Navas
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

ETSIT, Las Palmas de Gran Canaria, Spain
Juan Luis Navarro Mesa , Eduardo Hernández Pérez , Pedro Quintana Morales , Antonio Ravelo García & Iván Guerra Moreno , , , &
University of Zaragoza, Spain
Alfonso Ortega
Dep. of Electronics, Telecommunications and Informatics Engineering, University of Aveiro, Portugal
António Teixeira
ATVS Biometric Recognition Group,, Universidad Autónoma de Madrid, Spain
Doroteo T. Toledano

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hernaez, I., Saratxaga, I., Ye, J., Sanchez, J., Erro, D., Navas, E. (2014). Speech Watermarking Based on Coding of the Harmonic Phase. In: Navarro Mesa, J.L., et al. Advances in Speech and Language Technologies for Iberian Languages. Lecture Notes in Computer Science(), vol 8854. Springer, Cham. https://doi.org/10.1007/978-3-319-13623-3_27

Download citation

DOI: https://doi.org/10.1007/978-3-319-13623-3_27
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13622-6
Online ISBN: 978-3-319-13623-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics