Voice Identification Using Nonparametric Density Matching

Higgins, A.; Bahler, L.; Porter, J.

doi:10.1007/978-1-4613-1367-0_9

Voice Identification Using Nonparametric Density Matching

A. Higgins³,
L. Bahler³ &
J. Porter³

Chapter

431 Accesses
1 Citations

Part of the book series: The Kluwer International Series in Engineering and Computer Science ((SECS,volume 355))

Abstract

Text-independent speaker recognition is often based on the premise that acoustic measurements derived from the speech utterances of an individual are characterized by stable, speaker-unique probability density functions (PDFs). This chapter describes a method of comparing speech utterances to determine whether or not the underlying PDFs are the same, hence likely to have been spoken by the same person. The method is independent of assumptions about the form of the PDFs. Based on a conjecture regarding the local relationship between probability density and nearest-neighbor distance, the algorithm is shown to measure global differences between the speakers’ underlying feature distributions. Experimental results are presented for the King telephone database.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

T. Matsui and S. Furui, “Concatenated phoneme models for text-variable speaker recognition,” Proc. ICASSP-93, volume II, pp. 391–394, Minneapolis, April 1993.
Google Scholar
Y. H. Kao, P. K. Rajasekaran, and J. S. Baras, “Robust free-text speaker identification over long distance telephone channels,” Proc. ICASSP-93, Minneapolis, April 1993.
Google Scholar
A. E. Rosenberg, C. H. Lee, and F. K. Soong, “Sub-word unit talker verification using hidden Markov models,” Proc. ICASSP-90, pp. 269–272, Albuquerque, New Mexico, April 1990.
Google Scholar
L. Gillick, J. Baker, J. Baker, J. Bridle, M. Hunt, Y. Ito, S. Lowe, J. Orloff, B. Peskin, R. Roth, and F. Scattone, “Application of large vocabulary continuous speech recognition to topic and speaker identification using telephone speech,” Proc. ICASSP-93, volume II, pp. 471–474, Minneapolis, April 1993.
Google Scholar
C. Olano, “An investigation of spectral match statistics using a phonetically marked data base,” Proc. ICASSP-83, 1983.
Google Scholar
J. D. Markel, B. T. Oshika, and A. H. Gray Jr., “Long-term feature averaging for speaker recognition,” IEEE Trans, on Acoustics, Speech, and Signal Processing, volume ASSP-25, pp. 330–337, 1977.
Article Google Scholar
H. Gish, K. Karnofsky, M, Krasner, S. Roucos, R. Schwartz, and J. Wolf, “Investigation of text-independent speaker identification over telephone channels,” Proc. ICASSP-85, volume 1, pp. 379–382, Tampa, FL, 1985.
Google Scholar
H. Gish, “Robust discrimination in automatic speaker identification,” Proc. ICASSP-90, pp. 289–292, 1990.
Google Scholar
R. Rose and D. Reynolds, “Text independent speaker identification using automatic acoustic segmentation,” Proc. ICASSP-90, pp. 293–296, 1990.
Google Scholar
F. Soong, A. Rosenberg, L. Rabiner, and B. Juang, “A vector quantization approach to speaker recognition,” Proc. ICASSP-85, volume 1, pp. 387–390, Tampa, FL, 1985.
Google Scholar

Download references

Author information

Authors and Affiliations

ITT Aerospace/Communications Division, San Diego, California, 92131, USA
A. Higgins, L. Bahler & J. Porter

Authors

A. Higgins
View author publications
You can also search for this author in PubMed Google Scholar
L. Bahler
View author publications
You can also search for this author in PubMed Google Scholar
J. Porter
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

AT&T Bell Laboratories, Murray Hill, NJ, 07974, USA
Chin-Hui Lee & Frank K. Soong &
School of Microelectronic Engineering, Griffith University, Australia
Kuldip K. Paliwal

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Higgins, A., Bahler, L., Porter, J. (1996). Voice Identification Using Nonparametric Density Matching. In: Lee, CH., Soong, F.K., Paliwal, K.K. (eds) Automatic Speech and Speaker Recognition. The Kluwer International Series in Engineering and Computer Science, vol 355. Springer, Boston, MA. https://doi.org/10.1007/978-1-4613-1367-0_9

Download citation

DOI: https://doi.org/10.1007/978-1-4613-1367-0_9
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4612-8590-8
Online ISBN: 978-1-4613-1367-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics