Abstract
The speaker recognition scene changes radically when attempts are made to apply modern technology to the problem. Indeed, with the seeming limitless power of electronic hardware and computers, it appears that solutions are but a step away. Yet such may not be the case. For example, many years have passed since the earliest efforts were made to develop machines that would (1) type letters dictated by voice, (2) automatically translate the speech of one language into another, (3) understand spoken speech and (4) identify a person from voice analysis alone. Authors such as Hecker (40) insist that there are no machines which are both as sensitive and as powerful (for these purposes) as the human ear. What Hecker means by “ear” is, of course, the entire auditory sensory system coupled to the brain, with all its sophisticated memory and cognitive functions. He may be correct in his assumptions, but I do not think so. Hence, the issue I will address in this chapter is: can machines/computers be made to operate at least as efficiently as the auditory system for speaker identification purposes? That is, can they be made to mimic these processes or, if not mimic them, at least parallel the recognition task by some other method? Probably so, but the task is not an easy one.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Atal, B. S. (1972) Automatic Speaker Recognition Based on Pitch Contours, J. Acoust. Soc. Amer. 52:1687–1697.
Atal, B. S. (1974) Effectiveness of Linear Prediction Characteristics of the Speech Wave for Automatic Speaker Identification and Verification, J. Acoust. Soc. Amer. 55:1304–1312.
Atal, B. S. (1976) Automatic Recognition of Speakers from Their Voices, Proceed. IEEE 64:460–475.
Bakis, R. and Dixon, N. R. (1982) Toward Speaker-Independent Recognition-by-Synthesis, IEEE Proceed. ICASSP, 566-569.
Basztura, C. S. and Majewski, W. (1978) The Application of Long-Term Analysis of the Zero-Crossing of a Speech Signal in Automatic Speaker Identification, Arch. Acoust. 3:3–15.
Becker, R. W, Clarke, F. R., Poza, F. and Young, J. R. (1973) A Semi-Automatic Speaker Recognition System, Research, LEAA, U.S. Dept of Justice, Washington, DC, 1–37.
Bobrow, D. C. and Klatt, D. H. (1968) A Limited Speech Recognition System, AFIPS Conf. Proceed. Thompson Book Co., Washington, DC, 33:305–318.
Bogner, R. E. (1981) On Talker Verification via Orthogonal Parameters, IEEE Trans. Acoust. Speech Signal Process. ASSP 29:1–12.
Bricker, P. D., Gnanadesikan, R., Mathews, M. V, Pruzansky, S., Tukey P. A., Wachter, K. W. and Warner, J. L. (1971) Statistical Techniques for Talker Identification, Bell System Tech. J. 50:1427–1450.
Bricker, P. D. and Pruzanski, S. (1966) Effects of Stimulus Content and Duration on Talker Identification, J. Acoust. Soc. Amer. 40:1441–1450.
Bunge, E. (1975) Automatic Speaker Recognition by Computers, Proceed., 8th Internat. Cong. Phonetic Sci., Leeds, UK.
Bunge, E. (1977) Automatic Speaker Recognition System Auros for Security Systems and Forensic Voice Identification, Proceed., Internat. Conf. Crime Countermeas., Oxford, UK, 1-8.
Calinski, T., Jassem, W. and Kaczmarck, Z. (1970) Investigation of Vowel Formant Frequencies as Personal Voice Characteristics by Means of Multivariate Analysis of Variance, in Speech Analysis and Synthesis (W. Jassem, Ed.), Warsaw, Poland, 2:7–40.
Carbonell, J. R., Stevens, K. N., Williams, C. E. and Woods, B. (1965) Speaker Identification by a Matching-From-Samples Technique, J. Acoust. Soc. Amer. 40:1205–1206.
Cheun, R. S. (1978) Feature Selection Using Adaptive Learning Network for Text-Independent Speaker Verification, J. Acoust. Soc. Amer. 64:S182.
Clarke, F. R. and Becker, R. W. (1969) Comparison of Techniques for Discriminating Among Talkers, J. Speech Hear. Res. 12:747–761.
Compton, A. J. (1963) Effects of Filtering and Vocal Duration Upon the Identification of Speakers Aurally, J. Acoust. Soc. Amer. 35:1748–1752.
Das, S. K. and Mohn, W. S. (1969) Pattern Recognition in Speaker Verification, Proceed. Joint Comput. Conf., AFIPS Conf., Mondale, NY, 35:721-732.
Das, S. K. and Mohn, W. S. (1971) A Scheme for Speech Processing in Automatic Speaker Verification, IEEE Trans. Audio Electroacoust. AU-19:32–43.
Doddington, G. R. (1970) A Method of Speaker Verification, Unpublished Ph.D. Dissertation, University of Wisconsin.
Doddington, G. R. (1980) Whither Speech Recognition? in Trends in Speech Recognition (W. Lea, Ed.), NY, Prentice-Hall, 556–561.
Doddington, G. R., Hyrick, B. and Beek, B. (1974) Some Results on Speaker Verification Using Amplitude Spectra, J. Acoust. Soc. Amer. 55:S463.
Doherty, E. T. (1976) An Evaluation of Selected Acoustic Parameters for Use in Speaker Identification, J. Phonetics 4:321–326.
Doherty, E. T. and Hollien, H. (1978) Multiple Factor Speaker Identification of Normal and Distorted Speech, J. Phonetics 6:1–8.
Edie, J. and Sebestyen, G. S. (1972) Voice Identification General Criteria, Report RADCTDR-62-278, Rome Air Develp. Ctr., Air Force Systems Command, Griffis AFB, NY.
Endres, W., Bambach, W. and Flosser, G. (1971) Voice Spectrograms as a Function of Age, Voice Disguise and Voice Imitation, J. Acoust. Soc. Amer. 49:1842–1848.
Everett, S. S. (1985) Automatic Speaker Recognition Using Vocoded Speech, IEEE ICASSP CH 2118:383–386.
Feiz, W and DeGeorge, M. (1985) A Speaker Verification System for Access Control, IEEE ICASSP, CH 2118:399–402.
Floyd, W (1964) Voice Identification Techniques, Report RADC-TDR-64-312, Rome Air Develp. Ctr., Air Force Systems Command, Griffis AFB, NY.
Foodman, M. J. (1981) Experiments in Automatic Speaker Verification, Proceed., Carnahan Conf. Crime Countermeasures, Lexington, KY, May.
Furui, S. (1974) An Analysis of Long-Term Variation of Feature Parameters of Speech and Its Application to Talker Recognition, Electronic Comm. Japan A57:880–887.
Furui, S. (1978) Effects of Long-Term Spectral Variability on Speaker Recognition, J. Acoust. Soc. Amer. 64:S183.
Goldstein, U. G. (1976) Speaker-Identifying Features Based on Formant Tracks, J. Acoust. Soc. Amer. 59:176–182.
Gubrynowicz, R. (1973) Application of a Statistical Spectrum Analysis to Automatic Voice Identification, in Speech Analysis and Synthesis (W. Jassem, Ed.), Warsaw, Poland, 3:171–180.
Hair, G. D. and Rekieta, T. W. (1973) Speaker Identification Research Final Report, Research, U.S. Dept. of Justice, LEAA, Washington, DC, 38–74.
Hall, M. (1975) Spectrographic Analysis of Interspeaker and Intraspeaker Variabilities of Professional Mimicry, Unpublished MA Thesis, Michigan State University.
Hargreaves, W. A. and Starkweather, J. A. (1963) Recognition of Speaker Identity, Lang., Speech 6:63–67.
Hazen, B. M. (1972) Speaker Identification Using Spectrograms Made on Different Sound Spectrographs, Unpublished MA Thesis, State University of New York, Buffalo.
Hazen, B. M. (1973) Effects of Differing Phonetic Contexts on Spectrographic Speaker Identification, J. Acoust. Soc. Amer. 54:650–660.
Hecker, M. H. L., Stevens, K. N., von Bismarck, G. and Williams, C. E. (1968) Manifestations of Task-Induced Stress in the Acoustic Speech Signal, J. Acoust. Soc. Amer. 44:993–1001.
Hennessey, J. J. (1970) An Analysis of Voiceprint Identification, Unpublished MA Thesis, Michigan State University.
Hollien, H. (1974) The Peculiar Case of “Voiceprints,” J. Acoust. Soc. Amer. 56:210–213.
Hollien H. (1980) Vocal Indicators of Psychological Stress, in Forensic Psychology and Psychiatry (F. Wright, C. Bahn and R. Reiber, Eds.), New York Academy of Sciences, 47–72.
Hollien, H. (1985) Natural Speech Vectors in Speaker Identification, Proceed., Speech Tech’ 85, New York, Media Dimensions Inc., 331–334.
Hollien, H., Childers, D. G. and Doherty, E. T. (1977) Semi-Automatic Speaker Identification System (SAUSI), Proceed., IEEE, ICASSP 26:768–771.
Hollien, H., Geifer, M. P. and Huntley, R. (1990) The Natural Speech Vector Concept in Speaker Identification, Neue Tend. Amgerwandten, Phonetik III, Hamburg, Helmut Buske, Verlag, 62:71–87.
Hollien, H., Hicks, J. W., Jr. and Oliver, L. H. (1990) A Semiautomatic System for Speaker Identification, Neue Tend. Amgerwandten, Phonetik III, Hamburg, Helmut Buske, Verlag, 62:88–106.
Hollien, H. and McGlone, R. E. (1976) An Evaluation of the “Voiceprint” Technique of Speaker Recognition, Proceed., Carnahan Conf. Crime Counter-measures., 30-45; reprinted in Nat. J. Crim. Def. 2:117-130, 1976 and in Course Handbook, Institute Contin. Legal Ed., Ann Arbor, Michigan, 391-404.
Hollien, H. and Majewski, W. (1977) Speaker Identification by Long-Term Spectra Under Normal and Distorted Speech Conditions, J. Acoust. Soc. Amer. 62:975–980.
Hollien, H., Majewski, W. and Hollien, P. A. (1975) Analysis of F0 as a Speaker Identification Technique, Eighth Internat. Cong. Phonetic Sci., Abstract of Papers, 337.
Hunt, M. (1983) Further Experiments in Text-Independent Speaker Recognition Over Communications Channels, Proceed. ICASSP, Boston, 563-566.
Ichikawa, A., Nakajima, A. and Nakata, K. (1979) Speaker Verification from Actual Telephone Voice, J. Acoust. Soc. Japan 35:63–69.
Iles, M. (1972) Speaker Identification as a Function of Fundamental Frequency and Resonant Frequencies, Unpublished Ph.D. Dissertation, University of Florida.
Jassem, W. (1968) Formant Frequencies as Cues to Speaker Discrimination, in Speech Analysis and Synthesis (W. Jassem, Ed.), Warsaw, Poland, 1:9-41.
Jassem, W., Steffen-Batog, M. and Czajka, S. (1973) Statistical Characteristics of Short-Term Average of Distribution as Personal Voice Features, in Speech Analysis and Synthesis (W. Jassem, Ed.), Warsaw, Poland, 3:209-228.
Jesorsky, P. (1977) Principles of Automatic Speaker Recognition in Natural Lang. Comm. with Computers (L. Bolc, Ed.), 1-15.
Johnson, C. C., Hollien, H. and Hicks, J. W, Jr. (1984) Speaker Identification Utilizing Selected Temporal Speech Features, J. Phonetics 12:319–327.
Kashyap, R. L. (1975) Speaker Recognition from an Unknown Utterance and Speaker Speech Interaction, IEEE Trans. Acoust. Speech Sig. Process. ASSP-24:481–488.
Kersta, L. G. (1962) Voiceprint Identification, Nature 196:1253–1257.
Kosiel, U. (1973) Statistical Analysis of Speaker-Dependent Differences in the Long-Term Average Spectrum of Polish Speech, in Speech Analysis and Synthesis (W. Jassem, Ed.), Warsaw, Poland, 3:180-208.
Ladefoged, P. and Broadbent, D. E. (1957) Information Conveyed by Vowels, J. Acoust. Soc. Amer. 29:98–104.
LaRiviere, C. L. (1971) Some Acoustic and Perceptual Correlates of Speaker Identification, Unpublished Ph.D. Dissertation, University of Florida.
LaRiviere, C. L. (1974) Speaker Identification for Turbulent Portions of Fricatives, Phonetica 29:98–104.
LaRiviere, C. L. (1975) Contributions of Fundamental Frequency and Formant Frequencies to Speaker Identification, Phonetica 31:185–197.
Li, K. P., Dammann, J. E. and Chapman, W. D. (1966) Experimental Studies in Speaker Verification Using an Adaptive System, J. Acoust. Soc. Amer. 40:966–978.
Li, K. P. and Wrench, E. H., Jr. (1983) An Approach to Text-Independent Speaker Recognition with Short Utterances, Proceed. ICASSP, Boston, MA, 555-558.
Luck, J. E. (1969) Automatic Speaker Verification Using Cepstral Measurements, J. Acoust. Soc. Amer. 46:1026–1032.
Lummis, R. C. (1972a) Implementation of an On-Line Speaker Verification Scheme, J. Acoust. Soc. Amer. 52:S181.
Lummis, R. C. (1972b) Speaker Verification: A Step Toward the ‘Checkless’ Society, Bell Laboratories Record 50:254–259.
Lummis, R. C. (1973) Speaker Verification by Computer Using Speech Intensity for Temporal Registration, IEEE Trans. Audio. Electroacoust. AU-21:50–59.
Majewski, W and Hollien, H. (1974) Euclidean Distance Between Long-Term Speech Spectra and a Criterion for Speaker Identification Proceed. Speech Comm. Seminar-74, Stockholm, Sweden, 3:303-310.
Makhoul, J. and Wolf, J. (1973) The Use of a Two-Pole Linear Prediction Model in Speech Recognition, Bolt, Beranek and Newman Report No. 2537, 1-21.
Meeker, W. F. (1967) Speaker Authentication Techniques, Tech. Report ECOM-02526-F, U.S. Army Electronics Command, Ft. Monmouth, NJ.
Meltzer, D. and Lehiste, I. (1972) Vowel and Speaker Identification in Natural and Synthetic Speech, J. Acoust. Soc. Amer. 51:S131.
Ney, H. and Giercoff, R. (1982) Speaker Recognition Using a Feature Welding Technique, Proceed. ICASSP, Paris, 1645-1648.
Obrecht, D. H. (1975) Fingerprints and Voiceprint Identification, Proceed., Eighth Internal. Cong. Phonetic Sci., Leeds, UK.
Preusse, J. W. (1971) Word Recognition and Speaker Authentication Using Amplitude Independent and Time Independent Word Features, Tech. Report, ECOM-3439, U.S. Army Electronics Command, Ft. Monmouth, NJ.
Pruzanski, S. (1963) Pattern Matching Procedure for Automatic Talker Recognition, J. Acoust. Soc. Amer. 35:354–358.
Pruzanski, S. and Mathews, M. W. (1964) Talker-Recognition Procedure Based on Analysis of Variance, J. Acoust. Soc. Amer. 36:2041–2047.
Ramishvili, G. S. (1965) Automatic Recognition of Speaking Persons, Report FTG-TT-65-1079, Air Force Systems Command, Wright-Patterson AFB.
Ramishvilli, G. S. (1966) Automatic Voice Recognition, Engng. Cybernetics, 5:84–90.
Ramishvili, G. S. (1974) Experiments on Automatic Verification of Speakers, Proceed., Second Internal. Conf. Pattern Recognition, Copenhagen, 389-393.
Rosenberg, A. E. (1973) Listener Performance in Speaker Verification Tasks, IEEE Trans. Audio, Electroacoust. AU-21:221–225.
Rosenberg, A. E. (1974) A Practical Implementation of an Automatic Speaker Verification System, Proceed., Eighth Internal. Cong. Acoustics, London, 1:268.
Rosenberg, A. E. (1975) Evaluation of an Automatic Speaker Verification System Over Telephone Lines, J. Acoust. Soc. Amer. 57:S23.
Rosenberg, A. E. (1976) Automatic Speech Verification: A Review, Proceed., IEEE 64:475–487.
Rothman, H. B. (1975) Perceptual (Aural) and Spectrographic Investigation of Speaker Homogeneity, J. Acoust. Soc. Amer. 58:S107.
Sambur, M. R. (1973) Speaker Recognition and Verification Using Linear Prediction Analysis, QPR No. 108, Massachusetts Institute of Technology, 261-268.
Sambur, M. R. (1975) Selection of Acoustic Features for Speaker Identification, IEEE Trans. on Acoustics, Speech and Signal Process. ASSP-23:176–192.
Sambur, M. R. (1976a) Speaker Recognition Using Orthogonal Linear Prediction, IEEE Trans. Acoust. Speech, Signal Process. ASSP-24:283–287.
Sambur, M. R. (1976b) Text-Independent Speaker Recognition Using Orthogonal Linear Prediction, Proceed., IEEE ICASSP, Philadelphia, PA, 727-729.
Scarr, R. W. A. (1971) Speech Recognition by Machine—Art or Science? Electronics and Power, 302-307.
Schwartz, R., Roncos, S. and Berouti, M. (1982) The Application of Probability Density Estimation to Text-Independent Speaker Identification, Proceed., ICASSP, 1649-1652.
Smith, J. E. (1962) Decision-Theoretic Speaker Recognizer, J. Acoust. Soc. Amer. 34:1988.
Steffen-Batog. M., Jassem, W. and Gruszka-Koscielak, H. (1970) Statistical Distribution of Short-Term f0 Values as a Personal Voice Characteristic, in Speech Analysis and Synthesis (W. Jassem, Ed.), Warsaw, Poland, 2:197-208.
Stevens, K. N. (1971) Sources of Inter-and Intra-Speaker Variability in the Acoustic Properties of Speech Sounds, Proceed., Seventh Inter. Cong, of Phonetic Sci., Montreal, 206-232.
Stevens, K. N., Williams, C. E., Carbonell, J. R. and Woods, D. (1968) Speaker Authentication and Identification: A Comparison of Spectrographic and Auditory Presentation of Speech Materials, J. Acoust. Soc. Amer. 44:1596–1607.
Tarnoczy, T. (1961) Uber Das Individuelle Sprach Spectrum, Proceed, Fourth Inter. Cong. Phonetic Sciences, 259-264.
Tosi, O., Oyer, H., Lashbrook, W., Pedrey, C., Nichol, J. and Nash, W. (1972) Experiment on Voice Identification, J. Acoust. Soc. Amer. 51:2030–2043.
Voiers, W (1964) Perceptual Basis of Speaker Identity, J. Acoust. Soc. Amer. 36:1065–1073.
Waldrop, M. M. (1988) A Landmark in Speech Recognition, Science 240:1615.
Wolf, J. J. (1970) Simulation of the Measurement Phase of an Automatic Speaker Recognition System, J. Acoust. Soc. Amer. 47:S83.
Wolf, J. J. (1972) Efficient Acoustic Parameters for Speaker Recognition, J. Acoust. Soc. Amer. 51:2044–2055.
Wolf, J., Krasner, M., Karnofsky, K., Schwartz, R. and Roucos, S. (1983) Further Investigation of Probabilistic Methods For Text-Independent Speaker Identification, Proceed. ICASSP, 551-554.
Young, M. A. and Campbell, R. A. (1967) Effects of Context on Talker Identification, J. Acoust. Soc. Amer. 42:1250–1254.
Yang, M. C. K., Hollien, H. and Huntley, R. (1986) A Speaker Identification System for Field use, Speech Tech’ 86, New York, Media Dimensions, 277–280.
Zalewski, J., Majewski, W. and Hollien, H. (1975) Cross-Correlation Between Long-Term Speech Spectra as a Criterion for Speaker Identification, Acustica 34:20–24.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 1990 Springer Science+Business Media New York
About this chapter
Cite this chapter
Hollien, H. (1990). Machine/Computer Approaches. In: The Acoustics of Crime. Applied Psycholinguistics and Communication Disorders. Springer, Boston, MA. https://doi.org/10.1007/978-1-4899-0673-1_11
Download citation
DOI: https://doi.org/10.1007/978-1-4899-0673-1_11
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4899-0675-5
Online ISBN: 978-1-4899-0673-1
eBook Packages: Springer Book Archive