Skip to main content

Detection of Vocal Fold Paralysis and Edema Using Linear Discriminant Classifiers

  • Conference paper
Advances in Artificial Intelligence (SETN 2006)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3955))

Included in the following conference series:

  • 1747 Accesses

Abstract

In this paper, a two-class pattern recognition problem is studied, namely the automatic detection of speech disorders such as vocal fold paralysis and edema by processing the speech signal recorded from patients affected by the aforementioned pathologies as well as speakers unaffected by these pathologies. The data used were extracted from the Massachusetts Eye and Ear Infirmary database of disordered speech. The linear prediction coefficients are used as input to the pattern recognition problem. Two techniques are developed. The first technique is an optimal linear classifier design, while the second one is based on the dual-space linear discriminant analysis. Two experiments were conducted in order to assess the performance of the techniques developed namely the detection of vocal fold paralysis for male speakers and the detection of vocal fold edema for female speakers. Receiver operating characteristic curves are presented. Long-term mean feature vectors are proven very efficient in detecting the voice disorders yielding a probability of detection that may approach 100% for a probability of false alarm equal to 9.52%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Quek, F., Harper, M., Haciahmetoglou, Y., Chen, L., Ramig, L.O.: Speech pauses and gestural holds in Parkinson ’s Disease. In: Proc. 2002 Int. Conf. Spoken Language Processing, pp. 2485–2488 (2002)

    Google Scholar 

  2. Will, L., Ramig, L.O., Spielman, J.L.: Application of Lee Silverman Voice Treatment (LSVT) to individuals with multiple sclerosis, ataxic dysarthria, and stroke. In: Proc. 2002 Int. Conf. Spoken Language Processing, pp. 2497–2500 (2002)

    Google Scholar 

  3. Spielman, J.L., Ramig, L.O., Borod, J.C.: Oro-facial changes in Parkinson’s Disease following intensive voice therapy (LSVT). In: Proc. 2002 Int. Conf. Spoken Language Processing, pp. 2489–2492 (2002)

    Google Scholar 

  4. Parsa, V., Jamieson, D.G.: Interactions between speech coders and disordered speech. Speech Communication 40(7), 365–385 (2003)

    Article  Google Scholar 

  5. www.emedicine.com/ent/byname/vocal-fold-paralysis-unilateral.htm

  6. Gavidia-Ceballos, L., Hansen, J.H.L.: Direct speech feature estimation using an iterative EM algorithm for vocal fold pathology detection. IEEE Trans. Biomedical Engineering 43, 373–383 (1996)

    Article  Google Scholar 

  7. Dibazar, A.A., Narayanan, S., Berger, T.W.: Feature analysis for automatic detection of pathological speech. In: Proc. Engineering Medicine and Biology Symposium 2002, vol. 1, pp. 182–183 (2002)

    Google Scholar 

  8. Rosa, M.O., Pereira, J.C., Grellet, M.: Adaptive estimation of residue signal for voice pathology diagnosis. IEEE Trans. Biomedical Engineering 47, 96–104 (2000)

    Article  Google Scholar 

  9. Marinaki, M., Kotropoulos, C., Pitas, I., Maglaveras, N.: Automatic detection of vocal fold paralysis and edema. In: Proc. 2004 Int. Conf. Spoken Language Processing (2004)

    Google Scholar 

  10. Nayak, J., Bhat, P.S.: Identification of voice disorders using speech samples. In: Proc. IEEE TenCon 2003, vol. 395 (2003)

    Google Scholar 

  11. Gómez, P., Godino, J.I., Rodríguez, F., Díaz, F., Nieto, V., Álvarez, A., Rodellar, V.: Evidence of vocal cord pathology from the mucosal wave cepstral contents. In: Proc. 2004 IEEE Int. Conf. Acoustics, Speech, and Signal Processing, vol. 5, pp. 437–440 (2004)

    Google Scholar 

  12. Fukunaga, K.: Introduction in Statistical Pattern Recognition, 2nd edn. Academic Press, San Diego CA (1990)

    MATH  Google Scholar 

  13. Tang, X., Wang, W.: Dual space linear discriminant analysis for face recognition. In: Proc. 2004 IEEE Computer Society Conf. Computer Vision and Pattern Recognition, pp. 1064–1068 (2004)

    Google Scholar 

  14. Voice and Speech Laboratory, Massachusetts Eye and Ear Infirmary, Boston MA, Voice Disorders Database, 1.03 edition, Kay Elemetrics Corp. (1994)

    Google Scholar 

  15. Deller, J.R., Proakis, J.G., Hansen, J.H.L.: Discrete Time Processing of Speech Signals. MacMillan Publishing Company, NY (1993)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ziogas, E., Kotropoulos, C. (2006). Detection of Vocal Fold Paralysis and Edema Using Linear Discriminant Classifiers. In: Antoniou, G., Potamias, G., Spyropoulos, C., Plexousakis, D. (eds) Advances in Artificial Intelligence. SETN 2006. Lecture Notes in Computer Science(), vol 3955. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11752912_45

Download citation

  • DOI: https://doi.org/10.1007/11752912_45

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-34117-8

  • Online ISBN: 978-3-540-34118-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics