Advertisement

An AFK-SVD Sparse Representation Approach for Speech Signal Processing

  • Fenglian LiEmail author
  • Xueying Zhang
  • Hongle Zhang
  • Yu-Chu Tian
Conference paper
Part of the Smart Innovation, Systems and Technologies book series (SIST, volume 82)

Abstract

Sparse representation is a common issue in many signal processing problems. In speech signal processing, how to sparsely represent a speech signal by dictionary learning for improving transmission efficiency has attracted considerable attention in recent years. K-SVD algorithm for dictionary learning is a typical method. But it requires to know the dictionary size prior to dictionary training. A suitable dictionary size can effectively avoid the problem of under-representation or over-representation, which affects the quality of reconstruction speech significantly. To tackle this problem, an Adaptive dictionary size Feedback filtering K-SVD (AFK-SVD) approach is presented in this paper for dictionary leaning. The proposed method first selects the dictionary size adaptively based on the speech signal feasure prior to dictionary learning, and then filters out the noise caused by over-representation. The approach has two unique features: (1) a learning model is constructed based on the training set specifically for adaptive determination of a range of the dictionary size; and (2) a two-level feedback filter measure is developed for removal of speech distortion caused by over-representation. The speech signals from TIMIT speech data sets are used to demonstrate the presented AFK-SVD approach. Experimental results showed that, in comparison with K-SVD, the proposed AFK-SVD method can improve the quality of the reconstructed speech signal in PESQ by 0.8 and SNR by 3 - 7 dB in average.

Keywords

AFK-SVD Speech signal sparse representation Dictionary learning Dictionary size Feedback filter 

References

  1. 1.
    Aharon, M., Elad, M., Bruckstein, A.: \( rm k \)-SVD: an algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans. Signal Process. 54(11), 4311–4322 (2006)CrossRefGoogle Scholar
  2. 2.
    Zhou, J., Wang, J.: Fabric defect detection using adaptive dictionaries. Text. Res. J. 83(17), 1846–1859 (2013)CrossRefGoogle Scholar
  3. 3.
    Bierman, R., Singh, R.: Influence of dictionary size on the lossless compression of microarray images Twentieth IEEE International Symposium on Computer-Based Medical Systems: CBMS 2007. IEEE (2007)Google Scholar
  4. 4.
    Sun, Y., Gomez, F., Schmidhuber, J.: On the size of the online kernel sparsification dictionary. arXiv preprint arXiv: 1206.4623 (2012)
  5. 5.
    Zhou, Y., et al.: Immune K-SVD algorithm for dictionary learning in speech denoising. Neurocomputing 137, 223–233 (2014)CrossRefGoogle Scholar
  6. 6.
    Zhou, Y., Zhao, H., Lie, P.X.: Detection from speech analysis based on K–SVD deep belief network model. In: International Conference on Intelligent Computing, pp. 189–196. Springer (2015)Google Scholar
  7. 7.
    Tjoa, S.K., et al.: Harmonic variable-size dictionary learning for music source separation. In: 2010 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP). IEEE (2010)Google Scholar

Copyright information

© Springer International Publishing AG 2018

Authors and Affiliations

  • Fenglian Li
    • 1
    Email author
  • Xueying Zhang
    • 1
  • Hongle Zhang
    • 1
  • Yu-Chu Tian
    • 1
    • 2
  1. 1.College of Information EngineeringTaiyuan University of TechnologyTaiyuanChina
  2. 2.School of Electrical Engineering and Computer ScienceQueensland University of TechnologyBrisbaneAustralia

Personalised recommendations