Detection of Vocal Fold Paralysis and Edema Using Linear Discriminant Classifiers

Ziogas, Euthymius; Kotropoulos, Constantine

doi:10.1007/11752912_45

Euthymius Ziogas²² &
Constantine Kotropoulos²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3955))

Included in the following conference series:

Hellenic Conference on Artificial Intelligence

1747 Accesses

Abstract

In this paper, a two-class pattern recognition problem is studied, namely the automatic detection of speech disorders such as vocal fold paralysis and edema by processing the speech signal recorded from patients affected by the aforementioned pathologies as well as speakers unaffected by these pathologies. The data used were extracted from the Massachusetts Eye and Ear Infirmary database of disordered speech. The linear prediction coefficients are used as input to the pattern recognition problem. Two techniques are developed. The first technique is an optimal linear classifier design, while the second one is based on the dual-space linear discriminant analysis. Two experiments were conducted in order to assess the performance of the techniques developed namely the detection of vocal fold paralysis for male speakers and the detection of vocal fold edema for female speakers. Receiver operating characteristic curves are presented. Long-term mean feature vectors are proven very efficient in detecting the voice disorders yielding a probability of detection that may approach 100% for a probability of false alarm equal to 9.52%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Quek, F., Harper, M., Haciahmetoglou, Y., Chen, L., Ramig, L.O.: Speech pauses and gestural holds in Parkinson ’s Disease. In: Proc. 2002 Int. Conf. Spoken Language Processing, pp. 2485–2488 (2002)
Google Scholar
Will, L., Ramig, L.O., Spielman, J.L.: Application of Lee Silverman Voice Treatment (LSVT) to individuals with multiple sclerosis, ataxic dysarthria, and stroke. In: Proc. 2002 Int. Conf. Spoken Language Processing, pp. 2497–2500 (2002)
Google Scholar
Spielman, J.L., Ramig, L.O., Borod, J.C.: Oro-facial changes in Parkinson’s Disease following intensive voice therapy (LSVT). In: Proc. 2002 Int. Conf. Spoken Language Processing, pp. 2489–2492 (2002)
Google Scholar
Parsa, V., Jamieson, D.G.: Interactions between speech coders and disordered speech. Speech Communication 40(7), 365–385 (2003)
Article Google Scholar
www.emedicine.com/ent/byname/vocal-fold-paralysis-unilateral.htm
Gavidia-Ceballos, L., Hansen, J.H.L.: Direct speech feature estimation using an iterative EM algorithm for vocal fold pathology detection. IEEE Trans. Biomedical Engineering 43, 373–383 (1996)
Article Google Scholar
Dibazar, A.A., Narayanan, S., Berger, T.W.: Feature analysis for automatic detection of pathological speech. In: Proc. Engineering Medicine and Biology Symposium 2002, vol. 1, pp. 182–183 (2002)
Google Scholar
Rosa, M.O., Pereira, J.C., Grellet, M.: Adaptive estimation of residue signal for voice pathology diagnosis. IEEE Trans. Biomedical Engineering 47, 96–104 (2000)
Article Google Scholar
Marinaki, M., Kotropoulos, C., Pitas, I., Maglaveras, N.: Automatic detection of vocal fold paralysis and edema. In: Proc. 2004 Int. Conf. Spoken Language Processing (2004)
Google Scholar
Nayak, J., Bhat, P.S.: Identification of voice disorders using speech samples. In: Proc. IEEE TenCon 2003, vol. 395 (2003)
Google Scholar
Gómez, P., Godino, J.I., Rodríguez, F., Díaz, F., Nieto, V., Álvarez, A., Rodellar, V.: Evidence of vocal cord pathology from the mucosal wave cepstral contents. In: Proc. 2004 IEEE Int. Conf. Acoustics, Speech, and Signal Processing, vol. 5, pp. 437–440 (2004)
Google Scholar
Fukunaga, K.: Introduction in Statistical Pattern Recognition, 2nd edn. Academic Press, San Diego CA (1990)
MATH Google Scholar
Tang, X., Wang, W.: Dual space linear discriminant analysis for face recognition. In: Proc. 2004 IEEE Computer Society Conf. Computer Vision and Pattern Recognition, pp. 1064–1068 (2004)
Google Scholar
Voice and Speech Laboratory, Massachusetts Eye and Ear Infirmary, Boston MA, Voice Disorders Database, 1.03 edition, Kay Elemetrics Corp. (1994)
Google Scholar
Deller, J.R., Proakis, J.G., Hansen, J.H.L.: Discrete Time Processing of Speech Signals. MacMillan Publishing Company, NY (1993)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Informatics, Aristotle University of Thessaloniki, Thessaloniki, 54124, Greece
Euthymius Ziogas & Constantine Kotropoulos

Authors

Euthymius Ziogas
View author publications
You can also search for this author in PubMed Google Scholar
Constantine Kotropoulos
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Science Department of University of Crete, Greece
Grigoris Antoniou
Institute of Computer Science, Foundation for Research & Technology – Hellas (FORTH), Vassilika Vouton, P.O. Box 1385, 71110, Heraklion, Greece
George Potamias
Institute of Informatics and Telecommunications, NCSR "Demokritos", 15310 A., Paraskevi Attikis, Greece
Costas Spyropoulos
Institute of Computer Science, FO.R.T.H., Vassilika Vouton, P.O. Box 1385, GR 71110, Heraklion, Greece
Dimitris Plexousakis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ziogas, E., Kotropoulos, C. (2006). Detection of Vocal Fold Paralysis and Edema Using Linear Discriminant Classifiers. In: Antoniou, G., Potamias, G., Spyropoulos, C., Plexousakis, D. (eds) Advances in Artificial Intelligence. SETN 2006. Lecture Notes in Computer Science(), vol 3955. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11752912_45

Download citation

DOI: https://doi.org/10.1007/11752912_45
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34117-8
Online ISBN: 978-3-540-34118-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics