C2SI corpus: a database of speech disorder productions to assess intelligibility and quality of life in head and neck cancers

Abstract

Within the framework of the Carcinologic Speech Severity Index (C2SI) INCa Project, we collected a large database of French speech recordings aiming at validating Disorder Severity Indexes. Such a database will be useful for measuring the impact of oral and pharyngeal cavity cancer on speech production. It will permit to assess patients’ quality of life after treatment. The database is composed of audio recordings from 134 sessions and associated metadata. Several intelligibility and comprehensibility levels of speech functions have been evaluated. Acoustics and prosody have been assessed. Perceptual evaluation rates from both naive and expert juries are being produced. Automatic analyzes are being carried out. It is intended to provide speech therapists and physicians with objective tools, which take into account the intelligibility and comprehensibility of patients which received cancer treatment (surgery and/or radiotherapy and/or chemotherapy). The aim of this paper is to justify the necessity of such a corpus and to present its data collection. This C2SI corpus will be available to the scientific community through the Scientific Interest Group Parolothèque.

This is a preview of subscription content, log in to check access.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Notes

  1. 1.

    http://www.lpl-aix.fr/~lpldev/perceval/.

  2. 2.

    http://voxygen.fr/.

  3. 3.

    https://www.univ-toulouse.fr/actualites/comite-d-ethique-de-recherche-cer.

  4. 4.

    http://www.lpl-aix.fr/~lpldev/perceval/.

  5. 5.

    https://www.parolotheque.fr.

  6. 6.

    http://voxygen.fr/.

References

  1. Astésano, C., Bard, E. G., & Turk, A. (2007). Structural influences on initial accent placement in french. Language and Speech, 50(3), 423–446.

    Article  Google Scholar 

  2. Aura, K. (2012). Protocole d’évaluation du langage fondé sur le traitement de fonctions prosodiques : étude exploratoire de deux patients atteints de gliomes de bas grade en contexte péri-opératoire. Ph.D. thesis, Université Toulouse 2. http://www.theses.fr/2012TOU20110/document.

  3. Balaguer, M., Boisguerin, A., Galtier, A., Gaillard, N., Puech, M., & Woisard, V. (2019). Assessment of impairment of intelligibility and of speech signal after oral cavity and oropharynx cancer. Annales fran caises d’oto-rhino-laryngologie et de pathologie cervico-faciale, 136(5), 355–359. https://doi.org/10.1016/j.anorl.2019.05.012.

    Article  Google Scholar 

  4. Borggreven, P. A., Aaronson, N. K., Verdonck-de Leeuw, I. M., Muller, M. J., Heiligers, M. L., & de Bree, R., et al. (2007). Quality of life after surgical treatment for oral and oropharyngeal cancer: a prospective longitudinal assessment of patients reconstructed by a microvascular flap. Oral oncology, 43(10), 1034–1042.

    Article  Google Scholar 

  5. Brierley, J. D., Gospodarowicz, M. K., & Wittekind, C. (2016). TNM classification of malignant tumours. Hoboken: Wiley.

    Google Scholar 

  6. Cardol, M., Brandsma, J., De Groot, I., van den BOSOE, G., De Haan, R., & De Jong, B. (1999). Handicap questionnaires: what do they assess? Disability and rehabilitation, 21(3), 97–105.

    Article  Google Scholar 

  7. Dwivedi, R. C., St Rose, Rose, Roe, J. W., Chisholm, E., Elmiyeh, B., & Nutting, C. M., et al. (2011). First report on the reliability and validity of speech handicap index in native english-speaking patients with head and neck cancer. Head & neck, 33(3), 341–348.

    Google Scholar 

  8. Enderby, P.M. (1983). Frenchay dysarthria assessment. Pro-ed

  9. Enderby, P.M., & Palmer, R. (2008) FDA-2: Frenchay Dysarthria Assessment: Examiner’s Manual. Pro-ed

  10. Fichaux-Bourin, P., Woisard, V., Grand, S., Puech, M., & Bodin, S. (2009). Validation of a self assessment for speech disorders (phonation handicap index). Revue de laryngologie-otologie-rhinologie, 130(1), 45–51.

    Google Scholar 

  11. Fredouille, C., Ghio, A., Laaridh, I., Lalain, M., & Woisard, V. (2019). Acoustic-phonetic decoding for speech intelligibility evaluation in the context of head and neck cancers. In: Proceedings of Intl Congress of Phonetic Sciences (ICPhS’19). Melbourne, Australia

  12. Galliano, S., Geoffrois, E., Mostefa, D., Choukri, K., Bonastre, J.F., & Gravier, G. (2005). The ester phase ii evaluation campaign for the rich transcription of french broadcast news. In: Ninth European Conference on Speech Communication and Technology

  13. Ghio, A., Lalain, M., Giusti, L., Pouchoulin, G., Robert, D., Rebourg, M., Fredouille, C., Laaridh, I., & Woisard, V. (2018).Une mesure d’intelligibilité par décodage acoustico-phonétique de pseudo-mots dans le cas de parole atypique. In: XXXII éme Journées d’Etudes sur la Parole 10.21437/JEP.2018-33. https://hal.archives-ouvertes.fr/hal-01770161/file/190996.pdf.

  14. Ghio, A., Pouchoulin, G., Teston, B., Pinto, S., Fredouille, C., & De Looze, C., et al. (2012). How to manage sound, physiological and clinical data of 2500 dysphonic and dysarthric speakers? Speech Communication, 54(5), 664–679.

    Article  Google Scholar 

  15. Laaridh, I., Fredouille, C., Ghio, A., Lalain, M., & Woisard, V. (2018). Automatic Evaluation of Speech Intelligibility Based on i-vectors in the Context of Head and Neck Cancers. In: Interspeech, pp. 2943–2947. ISCA, Hyderabad, India. 10.21437/interspeech.2018-1266. https://hal.archives-ouvertes.fr/hal-01962170.

  16. Laaridh, I., Kheder, W.B., Fredouille, C., & Meunier, C. (2017). Automatic prediction of speech evaluation metrics for dysarthric speech. In: Proc. Interspeech, pp. 1834–1838

  17. Lindblom, B. (1990). Explaining phonetic variation: A sketch of the h&h theory. In: Speech production and speech modelling, vol. 55, pp. 403–439. Springer, Dordrecht https://doi.org/10.1007/978-94-009-2037-8_16

  18. Magne, C., Astésano, C., Lacheret-Dujour, A., Morel, M., Alter, K., & Besson, M. (2005). On-line processing of “pop-out” words in spoken french dialogues. Journal of cognitive neuroscience, 17(5), 740–756.

    Article  Google Scholar 

  19. Middag, C., Clapham, R., Van Son, R., & Martens, J. P. (2014). Robust automatic intelligibility assessment techniques evaluated on speakers treated for head and neck cancer. Computer speech & language, 28(2), 467–482.

    Article  Google Scholar 

  20. Mlynarek, A. M., Rieger, J. M., Harris, J. R., O’Connell, D. A., Al-Qahtani, K. H., & Ansari, K., et al. (2008). Methods of functional outcomes assessment following treatment of oral and oropharyngeal cancer: Review of the literature. Journal of otolaryngology - head and neck surgery, 37(1), 2–10.

    Google Scholar 

  21. Nocaudie, O., Astésano, C., Ghio, A., Lalain, M., & Woisard, V. (2018) .Evaluation de la compréhensibilité et conservation des fonctions prosodiques en perception de la parole de patients post traitement de cancers de la cavité buccale et du pharynx. In: XXXIIe Journées d’Etudes sur la Parole, pp. 196–204

  22. Pisoni, D.B., & Dedina, M.J. (1986). Comprehension of digitally encoded natural speech using a sentence verification task: a first report. Tech. Rep. Progress report 12, Indiana University

  23. Pisoni, D. B., Manous, L. M., & Dedina, M. J. (1987). Comprehension of natural and synthetic speech: Effects of predictability on the verification of sentences controlled for intelligibility. Computer speech & language, 2(3–4), 303–320.

    Article  Google Scholar 

  24. Porcaro, C., Evitts, P., King, N., Hood, C., Campbell, E., & White, L., et al. (2019). Effect of dysphonia and cognitive-perceptual listener strategies on speech intelligibility. Journal of Voice in Press,. https://doi.org/10.1016/j.jvoice.2019.03.013.

    Article  Google Scholar 

  25. Pyo HY, S.H.S. (2007). A study of speech intelligibility affected by voice quality degradation. Communication Sciences & Disorders, 12(2), 256–278 http://www.e-csd.org/journal/view.php?number=326.

  26. Rinkel, R. N., Leeuw, I. M. V., van Reij, E. J., Aaronson, N. K., & Leemans, C. R. (2008). Speech handicap index in patients with oral and pharyngeal cancer: Better understanding of patients’ complaints. Journal for the Sciences and Specialties of the Head and Neck, 30(7), 868–874.

    Article  Google Scholar 

  27. Samuel, A. G. (1981). Phonemic restoration: Insights from a new methodology. Journal of Experimental Psychology: General, 110(4), 474.

    Article  Google Scholar 

  28. Sicard, E., Mauclair, J., & Woisard, V. (2017). Etude de paramètres acoustiques des voix de patients traités pour un cancer orl dans le cadre du projet c2si. In: 7èmes Journées de Phonétique Clinique

  29. Thomas, L., Jones, T. M., Tandon, S., Carding, P., Lowe, D., & Rogers, S. (2009). Speech and voice outcomes in oropharyngeal cancer and evaluation of the university of washington quality of life speech domain. Clinical Otolaryngology, 34(1), 34–42.

    Article  Google Scholar 

  30. Ware, J. E, Jr., & Sherbourne, C. D. (1992). The mos 36-item short-form health survey (sf-36): I. conceptual framework and item selection. Medical Care, 30(6), 473–483.

    Article  Google Scholar 

  31. Warren, R. M., & Warren, R. P. (1970). Auditory illusions and confusions. Scientific American, 223(6), 30–37.

    Article  Google Scholar 

  32. Woisard, V., Espesser, R., Ghio, A., & Duez, D. (2013). De l’intelligibilité à la compréhensibilité de la parole, quelles mesures en pratique clinique? Revue de Laryngologie Otologie Rhinologie, 1(134), 27–33.

    Google Scholar 

  33. Zumbiehl, O. (2010). Evaluation perceptive des dysphonies par la sentence verification task. Master’s thesis, Université Aix-Marseille . Mémoire d’Orthophonie (dir. : Cavé, C. and Ghio, Alain)

Download references

Acknowledgements

Grant 2014-135 from Institut National pour le CAncer (INCa) in 2014, “Sciences Humaines et Sociales, épidémiologie et Santé Publique” call. Lead by Pr Virginie Woisard at University Hospital of Toulouse and Grant ANR-18-CE45-0008 from The French National Research Agency in 2018 RUGBI project “Improving the measurement of intelligibility of pathological production disorders impaired speech” lead by Jérôme Farinas at IRIT. We thank the company Voxygen\(^{1}\) for providing us with their speech synthesis platform necessary for the realization of the corpus DAP.

Author information

Affiliations

Authors

Corresponding author

Correspondence to Jérôme Farinas.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Woisard, V., Astésano, C., Balaguer, M. et al. C2SI corpus: a database of speech disorder productions to assess intelligibility and quality of life in head and neck cancers. Lang Resources & Evaluation (2020). https://doi.org/10.1007/s10579-020-09496-3

Download citation

Keywords

  • Speech intelligibility and comprehensibility
  • Quality of life assessment
  • Speech corpus
  • Pathological speech