Advertisement

High-Quality and Low-Complexity Real-Time Voice Changing with Seamless Switching for Digital Imaging Devices

  • Sung Dong Jo
  • Young Han Lee
  • Ji Hun Park
  • Hong Kook Kim
  • Ji Woon Kim
  • Myeong Bo Kim
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 151)

Abstract

In this paper, we propose a voice changing method to provide a seamless switchable function with a low computational complexity for digital imaging devices. The proposed method combines a waveform similarity overlap-and-add (WSOLA) algorithm with a sampling rate changing technique that operates in the time domain. In addition, the proposed method includes a noise technique in the region where the voice changing switching mode changes from on to off, and vice versa. We finally compare the performance of the proposed method with that of a conventional one in terms of the processing time and speech quality. It is shown from the experiments that the proposed voice changing method gives a relative complexity reduction of 84.5% in a resource-constrained device having an ARM processor and is more preferred than the conventional method by 76%.

Keywords

Voice changing time-scale modification waveform similarity overlap-and-add (WSOLA) sampling rate change digital imaging device 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Salor, Ö., Demirekler, M.: Dynamic programming approach to voice transformation. Speech Communication 48(10), 1262–1272 (2006)CrossRefGoogle Scholar
  2. 2.
    Moulines, E., Laroche, J.: Non-parametric techniques for pitch-scale and time-scale modification of speech. Speech Communication 16(2), 175–205 (1995)CrossRefGoogle Scholar
  3. 3.
    Stylianou, Y.: Voice transformation: a survey. In: Proceedings of ICASSP, pp. 3585–3588 (2009)Google Scholar
  4. 4.
    Benesty, J., Sondhi, M., Huang, Y.: Handbook of Speech Processing. Springer, Heidelberg (2007)Google Scholar
  5. 5.
    Vergin, R., O’Shaughnessy, D., Farhat, A.: Time domain technique for pitch modification and robust voice transformation. In: Proceedings of ICASSP, pp. 947–950 (1997)Google Scholar
  6. 6.
    Roucos, S., Wilgus, A.: High quality time-scale modification for speech. In: Proceedings of ICASSP, pp. 493–496 (1986)Google Scholar
  7. 7.
    Wayman, J., Wilson, D.: Some improvements on the synchronized-overlap-add method of time scale modification for use in real-time speech compression and noise filtering. IEEE Transactions on Acoustics, Speech, and Signal Processing 36(1), 139–140 (1988)CrossRefGoogle Scholar
  8. 8.
    Verhelst, W., Roelands, M.: An overlap-add technique based on waveform similarity (WSOLA) for high quality time-scale modification of speech. In: Proceedings of ICASSP, pp. 554–557 (1993)Google Scholar
  9. 9.
    Hardam, E.: High quality time scale modification of speech signals using fast synchronized overlap add algorithms. In: Proceedings of ICASSP, pp. 409–412 (1990)Google Scholar
  10. 10.
    Moulines, E., Charpentier, F.: Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones. Speech Communication 9(5-6), 453–467 (1990)CrossRefGoogle Scholar
  11. 11.
    Keogh, E., Pazzani, M.: Derivative dynamic time warping. In: Proceedings of 1st SIAM International Conference on Data Mining, pp. 1–11 (2001)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Sung Dong Jo
    • 1
  • Young Han Lee
    • 1
  • Ji Hun Park
    • 1
  • Hong Kook Kim
    • 1
  • Ji Woon Kim
    • 2
  • Myeong Bo Kim
    • 2
  1. 1.School of Information and CommunicationsGwangju Institute of Science and Technology(GIST)GwangjuKorea
  2. 2.Digital Imaging BusinessSamsung ElectronicsGyenggi-doKorea

Personalised recommendations