Skip to main content

Analysis and Automatic Classification of Some Discourse Particles on a Large Set of French Spoken Corpora

  • Conference paper
  • First Online:
Book cover Statistical Language and Speech Processing (SLSP 2017)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10583))

Included in the following conference series:

Abstract

In French, quite a number of words and expressions are frequently used as discourse particles in spoken language, especially in spontaneous speech. The semantic load of these words or expressions differ whether they are used as discourse particles or not. Therefore, the correct identification of their discourse function remains of great importance. In this paper the distribution of the discourse function (or not discourse function), and of the detailed discourse functions of some of these words, is studied on a large set of French corpora ranging from prepared speech (e.g. storytelling and broadcast news) to spontaneous speech (e.g. interviews and interactions between people). The paper is focused on a subset of discourse particles that are recurrent in the considered corpora. The discourse function of a few thousand occurrences of these words have been manually annotated. A statistical analysis of the functions of the words is presented and discussed with respect to the types of spoken corpora. Finally, some statistics with respect to a few prosodic correlates of the discourse particles are presented, as well as some results of automatic classification and detection of the word function (discourse particle or not) using prosodic features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Aijmer, K.: Understanding Pragmatic Markers. A Variational Pragmatic Approach. Edinburgh UP, Edinburgh (2006)

    Google Scholar 

  2. Bartkova, K., Bastien, A., Dargnat, M.: How to be a discourse particle? In: Speech Prosody 2016, Boston, USA, pp. 859–863 (2016)

    Google Scholar 

  3. Degand, L., Fagard, B.: Alors between discourse and grammar: the role of syntactic position. Funct. Lang. 18, 19–56 (2011)

    Google Scholar 

  4. Hansen, M.B.M.: Particles at the Semantics-Pragmatics Interface: Synchronic and Diachronic Issues. Elsevier, Amsterdam (2008)

    Google Scholar 

  5. Wichmann, A., Simon-Vandenbergen, A.-A., Aijmer, K.: How prosody reflects semantic change: a synchronic case study of of course. In: Davidse, K., Vandelanotte, L., Cuyckens, H. (eds.) Subjectification, Intersubjectification and Grammaticalization, pp. 103–154. Mouton de Gruyter, Berlin (2010)

    Chapter  Google Scholar 

  6. Brinton, L.J.: Pragmatic Markers in English. Grammaticalization and Discourse Functions. De Gruyter, Berlin (1996)

    Book  Google Scholar 

  7. Degand, L., Cornillie, B., Pietrandrea, P. (eds.): Discourse Markers and Modal Particles: Categorization and Description. John Benjamins, Amsterdam (2013)

    Google Scholar 

  8. Dostie, G.: Pragmaticalisation et marqueurs discursifs. De Boeck/Duculot, Liège (2004)

    Book  Google Scholar 

  9. Hansen, M.B.M.: The Function of Discourse Particles. Benjamins, Amsterdam (1998)

    Book  Google Scholar 

  10. Ducrot, O.: Le Dire et le dit. Editions de Minuit, Paris (1984)

    Google Scholar 

  11. Kleiber, G.: Sémiotique de l’interjection. Langue française 161, 10–23 (2006)

    Google Scholar 

  12. Sperber, D., Wilson, D.: Relevance: Communication and Cognition. Blackwell, Oxford (1986)

    Google Scholar 

  13. Blakemore, D.: Semantic Constraints on Relevance. Blackwell, Oxford (1987)

    Google Scholar 

  14. Denturck, E.: Ètude des marqueurs discursifs - L’exemple de “quoi”. Master Diss., Gent University (2008)

    Google Scholar 

  15. Fernandez-Vest, J.: Les particules énonciatives dans la construction du discours. Presses Universitaires de France, Paris (1994)

    Google Scholar 

  16. Galliano, S., Gravier, G., Chaubard, L.: The ESTER 2 evaluation campaign for rich transcription of French broadcasts. In: INTERSPEECH 2009, 10th Annual Conference of the International Speech Communication Association, Brighton, UK, pp. 2583–2586 (2009)

    Google Scholar 

  17. ORFEO project: http://www.projet-orfeo.fr/

  18. French oral narrative: http://frenchoralnarrative.qub.ac.uk

  19. CFPP2000: http://cfpp2000.univ-paris3.fr/

  20. Branca-Rosoff, S., Fleury, S., Lefeuvre, F., Pires, M.: Discours sur la ville. Présentation du Corpus de Français Parlé Parisien des années 2000 (CFPP 2000)

    Google Scholar 

  21. C-ORAL-ROM: http://lablita.dit.unifi.it/corpora/descriptions/coralrom/

  22. Cresti, E., do Nascimento, F. B., Moreno-Sandoval, A., Veronis, J., Martin, P., Choukri, K.: The C-ORAL-ROM CORPUS. A multilingual resource of spontaneous speech for romance languages. In: LREC 2004, 4th International Conference on Language Resources and Evaluation, Lisbon, Portugal (2004)

    Google Scholar 

  23. CRFP: http://www.up.univ-mrs.fr/delic/corpus/index.html

  24. Delic team: Autour du Corpus de référence du français parlé. Recherches sur le français parlé, no. 18, Publications de l’université de Provence, 265 p. (2004)

    Google Scholar 

  25. TUFS: http://www.tufs.ac.jp/ts/personal/ykawa/art/2014_Waseda_Corpus_TUFS.pdf

  26. Valibel: http://www.uclouvain.be/81834.html

  27. CLAPI: http://clapi.ish-lyon.cnrs.fr/

  28. FLEURON: https://apps.atilf.fr/fleuron2/

  29. TCOF: http://www.cnrtl.fr/corpus/tcof/

  30. OFROM: http://www.unine.ch/ofrom

  31. Avanzi, M., Béguelin, M.-J., Diémoz, F.: Présentation du corpus OFROM - corpus oral de français de Suisse romande. Université de Neuchâtel, Switzerland (2012–2015)

    Google Scholar 

  32. Bechet, F., Maza, B., Bigouroux, N., Bazillon, T., El-Beze, M., De Mori, R., Arbillot, E.: DECODA: a call-centre human-human spoken conversation corpus. In: LREC 2012, 8th International Conference on Language Resources and Evaluation, Istanbul, Turkey (2012)

    Google Scholar 

  33. Stede, M., Schmitz, B.: Discourse particles and discourse functions. Mach. Transl. 15(1–2), 125–147 (2000)

    Article  MATH  Google Scholar 

  34. Dargnat, M., Bartkova, K., Jouvet, D.: Discourse particles in French: prosodic parameters extraction and analysis. In: SLSP 2015, International Conference on Statistical Language and Speech Processing, Budapest, Hungary (2015)

    Google Scholar 

  35. Bartkova, K., Jouvet, D.: Automatic detection of the prosodic structures of speech utterances. In: Železný, M., Habernal, I., Ronzhin, A. (eds.) SPECOM 2013. LNCS, vol. 8113, pp. 1–8. Springer, Cham (2013). doi:10.1007/978-3-319-01931-4_1

    Chapter  Google Scholar 

  36. Martin, P.: Prosodic and rhythmic structures in French. Linguistics 25, 925–949 (1987)

    Article  Google Scholar 

  37. Keras: https://keras.io/

  38. Talkin, D.: A robust algorithm for pitch tracking (RAPT). In: Kleijn, W.B., Paliwal, K.K. (eds.) Speech Coding and Synthesis, pp. 495–518. Elsevier, Amsterdam (1995)

    Google Scholar 

  39. SPTK: http://sp-tk.sourceforge.net/

Download references

Acknowledgments

This work has been carried out in the framework of the ProsodCorpus operation supported by the CPER LCHN (Contrat Plan Etat Région “Langues, Connaissances et Humanités Numériques”). Some experiments presented in this paper have been carried out using the Grid’5000 testbed, supported by a scientific interest group hosted by Inria and including CNRS, RENATER and several Universities as well as other organizations (see https://www.grid5000.fr).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Denis Jouvet .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Jouvet, D., Bartkova, K., Dargnat, M., Lee, L. (2017). Analysis and Automatic Classification of Some Discourse Particles on a Large Set of French Spoken Corpora. In: Camelin, N., Estève, Y., Martín-Vide, C. (eds) Statistical Language and Speech Processing. SLSP 2017. Lecture Notes in Computer Science(), vol 10583. Springer, Cham. https://doi.org/10.1007/978-3-319-68456-7_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-68456-7_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-68455-0

  • Online ISBN: 978-3-319-68456-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics