An AFK-SVD Sparse Representation Approach for Speech Signal Processing

Li, Fenglian; Zhang, Xueying; Zhang, Hongle; Tian, Yu-Chu

doi:10.1007/978-3-319-63859-1_23

An AFK-SVD Sparse Representation Approach for Speech Signal Processing

Fenglian Li⁷,
Xueying Zhang⁷,
Hongle Zhang⁷ &
…
Yu-Chu Tian^7,8

Conference paper
First Online: 18 July 2017

1189 Accesses

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 82))

Abstract

Sparse representation is a common issue in many signal processing problems. In speech signal processing, how to sparsely represent a speech signal by dictionary learning for improving transmission efficiency has attracted considerable attention in recent years. K-SVD algorithm for dictionary learning is a typical method. But it requires to know the dictionary size prior to dictionary training. A suitable dictionary size can effectively avoid the problem of under-representation or over-representation, which affects the quality of reconstruction speech significantly. To tackle this problem, an Adaptive dictionary size Feedback filtering K-SVD (AFK-SVD) approach is presented in this paper for dictionary leaning. The proposed method first selects the dictionary size adaptively based on the speech signal feasure prior to dictionary learning, and then filters out the noise caused by over-representation. The approach has two unique features: (1) a learning model is constructed based on the training set specifically for adaptive determination of a range of the dictionary size; and (2) a two-level feedback filter measure is developed for removal of speech distortion caused by over-representation. The speech signals from TIMIT speech data sets are used to demonstrate the presented AFK-SVD approach. Experimental results showed that, in comparison with K-SVD, the proposed AFK-SVD method can improve the quality of the reconstructed speech signal in PESQ by 0.8 and SNR by 3 - 7 dB in average.

F. Li — Authors would like to acknowledge the National Natural Science foundation of China (NSFC)(No.61371193) for its financial support for this research. Financial support from the Science and Technology Department of Shanxi Provincial Government under the international collaboration grant scheme (No.2015081007) and Special Talents Projects Grant Scheme (grant no.: 201605D211021) are also appreciated.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Aharon, M., Elad, M., Bruckstein, A.: \( rm k \)-SVD: an algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans. Signal Process. 54(11), 4311–4322 (2006)
Article Google Scholar
Zhou, J., Wang, J.: Fabric defect detection using adaptive dictionaries. Text. Res. J. 83(17), 1846–1859 (2013)
Article Google Scholar
Bierman, R., Singh, R.: Influence of dictionary size on the lossless compression of microarray images Twentieth IEEE International Symposium on Computer-Based Medical Systems: CBMS 2007. IEEE (2007)
Google Scholar
Sun, Y., Gomez, F., Schmidhuber, J.: On the size of the online kernel sparsification dictionary. arXiv preprint arXiv: 1206.4623 (2012)
Zhou, Y., et al.: Immune K-SVD algorithm for dictionary learning in speech denoising. Neurocomputing 137, 223–233 (2014)
Article Google Scholar
Zhou, Y., Zhao, H., Lie, P.X.: Detection from speech analysis based on K–SVD deep belief network model. In: International Conference on Intelligent Computing, pp. 189–196. Springer (2015)
Google Scholar
Tjoa, S.K., et al.: Harmonic variable-size dictionary learning for music source separation. In: 2010 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP). IEEE (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

College of Information Engineering, Taiyuan University of Technology, No.79, West Yingze Street, Taiyuan, 030024, Shanxi, China
Fenglian Li, Xueying Zhang, Hongle Zhang & Yu-Chu Tian
School of Electrical Engineering and Computer Science, Queensland University of Technology, GPO Box 2434, Brisbane, QLD, 4001, Australia
Yu-Chu Tian

Authors

Fenglian Li
View author publications
You can also search for this author in PubMed Google Scholar
Xueying Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Hongle Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yu-Chu Tian
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fenglian Li .

Editor information

Editors and Affiliations

Fujian Provincial Key Lab of Big Data Mining and Applications, Fujian University of Technology, Fuzhou, Fujian, China
Jeng-Shyang Pan
Swinburne University of Technology, Hawthorn, Victoria, Australia
Pei-Wei Tsai
Universiti Teknologi Petronas, Teronoh, Malaysia
Junzo Watada
University of Canberra, Bruce, Aust Capital Terr, Australia
Lakhmi C. Jain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, F., Zhang, X., Zhang, H., Tian, YC. (2018). An AFK-SVD Sparse Representation Approach for Speech Signal Processing. In: Pan, JS., Tsai, PW., Watada, J., Jain, L. (eds) Advances in Intelligent Information Hiding and Multimedia Signal Processing. IIH-MSP 2017. Smart Innovation, Systems and Technologies, vol 82. Springer, Cham. https://doi.org/10.1007/978-3-319-63859-1_23

Download citation

DOI: https://doi.org/10.1007/978-3-319-63859-1_23
Published: 18 July 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-63858-4
Online ISBN: 978-3-319-63859-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics