Skip to main content

An AFK-SVD Sparse Representation Approach for Speech Signal Processing

  • Conference paper
  • First Online:
  • 1189 Accesses

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 82))

Abstract

Sparse representation is a common issue in many signal processing problems. In speech signal processing, how to sparsely represent a speech signal by dictionary learning for improving transmission efficiency has attracted considerable attention in recent years. K-SVD algorithm for dictionary learning is a typical method. But it requires to know the dictionary size prior to dictionary training. A suitable dictionary size can effectively avoid the problem of under-representation or over-representation, which affects the quality of reconstruction speech significantly. To tackle this problem, an Adaptive dictionary size Feedback filtering K-SVD (AFK-SVD) approach is presented in this paper for dictionary leaning. The proposed method first selects the dictionary size adaptively based on the speech signal feasure prior to dictionary learning, and then filters out the noise caused by over-representation. The approach has two unique features: (1) a learning model is constructed based on the training set specifically for adaptive determination of a range of the dictionary size; and (2) a two-level feedback filter measure is developed for removal of speech distortion caused by over-representation. The speech signals from TIMIT speech data sets are used to demonstrate the presented AFK-SVD approach. Experimental results showed that, in comparison with K-SVD, the proposed AFK-SVD method can improve the quality of the reconstructed speech signal in PESQ by 0.8 and SNR by 3 - 7 dB in average.

F. Li — Authors would like to acknowledge the National Natural Science foundation of China (NSFC)(No.61371193) for its financial support for this research. Financial support from the Science and Technology Department of Shanxi Provincial Government under the international collaboration grant scheme (No.2015081007) and Special Talents Projects Grant Scheme (grant no.: 201605D211021) are also appreciated.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Aharon, M., Elad, M., Bruckstein, A.: \( rm k \)-SVD: an algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans. Signal Process. 54(11), 4311–4322 (2006)

    Article  Google Scholar 

  2. Zhou, J., Wang, J.: Fabric defect detection using adaptive dictionaries. Text. Res. J. 83(17), 1846–1859 (2013)

    Article  Google Scholar 

  3. Bierman, R., Singh, R.: Influence of dictionary size on the lossless compression of microarray images Twentieth IEEE International Symposium on Computer-Based Medical Systems: CBMS 2007. IEEE (2007)

    Google Scholar 

  4. Sun, Y., Gomez, F., Schmidhuber, J.: On the size of the online kernel sparsification dictionary. arXiv preprint arXiv: 1206.4623 (2012)

  5. Zhou, Y., et al.: Immune K-SVD algorithm for dictionary learning in speech denoising. Neurocomputing 137, 223–233 (2014)

    Article  Google Scholar 

  6. Zhou, Y., Zhao, H., Lie, P.X.: Detection from speech analysis based on K–SVD deep belief network model. In: International Conference on Intelligent Computing, pp. 189–196. Springer (2015)

    Google Scholar 

  7. Tjoa, S.K., et al.: Harmonic variable-size dictionary learning for music source separation. In: 2010 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP). IEEE (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Fenglian Li .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this paper

Cite this paper

Li, F., Zhang, X., Zhang, H., Tian, YC. (2018). An AFK-SVD Sparse Representation Approach for Speech Signal Processing. In: Pan, JS., Tsai, PW., Watada, J., Jain, L. (eds) Advances in Intelligent Information Hiding and Multimedia Signal Processing. IIH-MSP 2017. Smart Innovation, Systems and Technologies, vol 82. Springer, Cham. https://doi.org/10.1007/978-3-319-63859-1_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-63859-1_23

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-63858-4

  • Online ISBN: 978-3-319-63859-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics