Does k Matter? k-NN Hubness Analysis for Kernel Additive Modelling Vocal Separation

Fano Yela, Delia; Stowell, Dan; Sandler, Mark

doi:10.1007/978-3-319-93764-9_27

Delia Fano Yela¹⁸,
Dan Stowell¹⁸ &
Mark Sandler¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 10891))

Included in the following conference series:

International Conference on Latent Variable Analysis and Signal Separation

1672 Accesses
1 Citations

Abstract

Kernel Additive Modelling (KAM) is a framework for source separation aiming to explicitly model inherent properties of sound sources to help with their identification and separation. KAM separates a given source by applying robust statistics on the selection of time-frequency bins obtained through a source-specific kernel, typically the k-NN function. Even though the parameter k appears to be key for a successful separation, little discussion on its influence or optimisation can be found in the literature. Here we propose a novel method, based on graph theory statistics, to automatically optimise k in a vocal separation task. We introduce the k-NN hubness as an indicator to find a tailored k at a low computational cost. Subsequently, we evaluate our method in comparison to the common approach to choose k. We further discuss the influence and importance of this parameter with illuminating results.

D. Fano Yela—This work was funded by EPSRC grant EP/L019981/1.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

FitzGerald, D.: Vocal separation using nearest neighbours and median filtering. In: Proceedings of the Irish Signals and Systems Conference (ISSC), pp. 1–5 (2012)
Google Scholar
Rafii, Z., Pardo, B.: Repeating pattern extraction technique (REPET): a simple method for music/voice separation. IEEE Trans. Audio Speech Lang. Process. 21(1), 71–82 (2013)
Article Google Scholar
FitzGerald, D.: Harmonic/percussive separation using median filtering. In: Proceedings of the International Conference on Digital Audio Effects (DAFx), Graz, Austria, pp. 246–253 (2010)
Google Scholar
Fano Yela, D., Ewert, S., FitzGerald, D., Sandler, M.B.: Interference reduction in music recordings combining kernel additive modelling and non-negative matrix factorization. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), New Orleans, USA (2017)
Google Scholar
Fano Yela, D., Ewert, S., Fitzgerald, D., Sandler, M.: On the importance of temporal context in proximity kernels: A vocal separation case study. In: Audio Engineering Society Conference: 2017 AES International Conference on Semantic Audio, June 2017
Google Scholar
Fano Yela, D., Ewert, S., O’Hanlon, K., Sandler, M.B.: Shift-invariant kernel additive modelling for audio source separation. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Calgary, Canada (2018)
Google Scholar
Rafii, Z., Pardo, B.: Music/voice separation using the similarity matrix. In: ISMIR, pp. 583–588 (2012)
Google Scholar
Liutkus, A., FitzGerald, D., Rafii, Z., Pardo, B., Daudet, L.: Kernel additive models for source separation. IEEE Trans. Sig. Process. 62(16), 4298–4310 (2014)
Article MathSciNet Google Scholar
Rafii, Z., Pardo, B.: Online repet-sim for real-time speech enhancement. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 848–852. IEEE (2013)
Google Scholar
Cano, E., FitzGerald, D., Brandenburg, K.: Evaluation of quality of sound source separation algorithms: human perception vs quantitative metrics. In: Proceedings of the European Signal Processing Conference (EUSIPCO) (2016)
Google Scholar
Vincent, E., Gribonval, R., Févotte, C.: Performance measurement in blind audio source separation. IEEE Trans. Audio Speech Lang. Process. 14(4), 1462–1469 (2006)
Article Google Scholar
Radovanović, M., Nanopoulos, A., Ivanović, M.: Nearest neighbors in high-dimensional data: the emergence and influence of hubs. In: Proceedings of the 26th Annual International Conference on Machine Learning, pp. 865–872. ACM (2009)
Google Scholar
Erdös, P., Rényi, A.: On random graphs, i. Pub. Math. (Debrecen) 6, 290–297 (1959)
MATH Google Scholar
Liutkus, A., Stöter, F.-R., Rafii, Z., Kitamura, D., Rivet, B., Ito, N., Ono, N., Fontecave, J.: The 2016 signal separation evaluation campaign. In: Tichavský, P., Babaie-Zadeh, M., Michel, O.J.J., Thirion-Moreau, N. (eds.) LVA/ICA 2017. LNCS, vol. 10169, pp. 323–332. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-53547-0_31
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Queen Mary University of London, Mile End Road, London, E1 4NS, UK
Delia Fano Yela, Dan Stowell & Mark Sandler

Authors

Delia Fano Yela
View author publications
You can also search for this author in PubMed Google Scholar
Dan Stowell
View author publications
You can also search for this author in PubMed Google Scholar
Mark Sandler
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Delia Fano Yela .

Editor information

Editors and Affiliations

Paul Sabatier University, Toulouse, France
Yannick Deville
Bar-Ilan University, Ramat Gan, Israel
Sharon Gannot
University of Surrey, Guildford, United Kingdom
Russell Mason
University of Surrey, Guildford, United Kingdom
Mark D. Plumbley
University of Surrey, Guildford, United Kingdom
Dominic Ward

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fano Yela, D., Stowell, D., Sandler, M. (2018). Does k Matter? k-NN Hubness Analysis for Kernel Additive Modelling Vocal Separation. In: Deville, Y., Gannot, S., Mason, R., Plumbley, M., Ward, D. (eds) Latent Variable Analysis and Signal Separation. LVA/ICA 2018. Lecture Notes in Computer Science(), vol 10891. Springer, Cham. https://doi.org/10.1007/978-3-319-93764-9_27

Download citation

DOI: https://doi.org/10.1007/978-3-319-93764-9_27
Published: 06 June 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-93763-2
Online ISBN: 978-3-319-93764-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics