Skip to main content

Forced Alignment of the Phonologie du Français Contemporain Corpus

  • Conference paper
  • First Online:
Statistical Language and Speech Processing (SLSP 2018)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11171))

Included in the following conference series:

  • 583 Accesses

Abstract

The Phonologie du Français Contemporain project is an international, collaborative research effort to create resources for the study of contemporary French phonology. It has produced a large, partially transcribed and annotated corpus of spoken French, consisting of approximately 300 h of recordings, and covering 48 geographical regions (including Metropolitan France, Belgium, Switzerland, Canada, and French-speaking countries of Africa). Following a detailed protocol, speakers read aloud a word list and a short text and engage in guided and spontaneous conversation with an interviewer. The corpus presents several challenges: significant regional accent variation; variable recording quality and different types of environment noise; variation in speaker characteristics (age, sex); and interspersed segments of overlapping speech. In this article, we describe the procedure followed to address these challenges and produce an automatic forced alignment of the corpus at the phone, syllable and token level, starting from the initial transcriptions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Avanzi, M.: A corpus-based approach to French regional prosodic variation. Nouveaux Cahiers de Linguistique Française 31, 309–332 (2014). (Proceedings of the 3rd SWIP)

    Google Scholar 

  2. Bigi, B., Hirst, D.: Speech phonetization alignment and syllabification (SPPAS): a tool for the automatic analysis of speech prosody. In: Proceedings of the 6th Speech Prosody Conference, 22–25 May, Shanghai, China (2012)

    Google Scholar 

  3. Boersma, P., Weenink, D.: Praat: doing phonetics by computer, ver. 6.0.37 (2018). http://www.praat.org

  4. Brognaux, S., Roekhaut, S., Drugman, T., Beaufort, R.: Train & Align: a new online tool for automatic phonetic alignment. In: 2012 IEEE Spoken Language Technology Workshop (SLT), pp. 416–421, December 2012

    Google Scholar 

  5. Christodoulides, G.: Praaline: integrating tools for speech corpus research. In: LREC 2014—Proceedings of the 9th International Conference on Language Resources and Evaluation, 26–31 May, Reykjavik, Iceland, pp. 31–34 (2014). http://www.praaline.org

  6. Christodoulides, G., Barreca, G.: Expériences sur l’analyse morphosyntaxique des corpus oraux avec l’annotateur multi-niveaux DisMo. Corela: Cognition, Représentation, Langage HS-21 (2017). https://journals.openedition.org/corela/4867

  7. Durand, J., Laks, B., Lyche, C.: Phonologie, variation et accents du français. Hermes, Paris (2009)

    Google Scholar 

  8. Durand, J., Lyche, C.: French liaison in the light of corpus data. J. Fr. Lang. Stud. 18(1), 33–66 (2008)

    Google Scholar 

  9. Goldman, J.P.: EasyAlign: an automatic phonetic alignment tool under Praat. In: INTERSPEECH 2011—Proceedings of the 12th Annual Conference of the International Speech Communication Association, 27–31 August , Florence, Italy, pp. 3233–3236 (2011)

    Google Scholar 

  10. Hathout, N., Sajous, F., Calderone, B.: GLÀFF, a large versatile French lexicon. In: LREC 2014—Proceedings of the 9th International Conference on Language Resources and Evaluation, 26–31 May, Reykjavik, Iceland (2014)

    Google Scholar 

  11. Huggins-Daines, D., Kumar, M., Chan, A., Black, A.W., Ravishankar, M., Rudnicky, A.I.: PocketSphinx: a free, real-time continuous speech recognition system for hand-held devices. In: 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, vol. 1, pp. I-I, May 2006

    Google Scholar 

  12. Katsamanis, A., Black, M.P., Georgiou, P.G., Goldstein, L., Narayanan, S.S.: SailAlign: robust long speech-text alignment. In: Proceedings of the Workshop on New Tools and Methods for Very-Large Scale Phonetics Research (2011)

    Google Scholar 

  13. McAuliffe, M., Socolof, M., Mihuc, S., Wagner, M., Sonderegger, M.: Montreal forced aligner: trainable text-speech alignment using Kaldi. In: Proceedings of the 18th Conference of the International Speech Communication Association (2017)

    Google Scholar 

  14. Povey, D., et al.: The Kaldi speech recognition toolkit. In: IEEE 2011 Workshop on Automatic Speech Recognition and Understanding. IEEE Signal Processing Society, December 2011. IEEE Catalog No. CFP11SRW-USB

    Google Scholar 

  15. Walker, W., et al.: Sphinx-4: a flexible open source framework for speech recognition. Technical report, Sun Microsystems Inc., Mountain View, CA, USA (2004)

    Google Scholar 

  16. Young, S.J., Kershaw, D., Odell, J., Ollason, D., Valtchev, V., Woodland, P.: The HTK Book Version 3.4. Cambridge University Press, Cambridge (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to George Christodoulides .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Christodoulides, G. (2018). Forced Alignment of the Phonologie du Français Contemporain Corpus. In: Dutoit, T., Martín-Vide, C., Pironkov, G. (eds) Statistical Language and Speech Processing. SLSP 2018. Lecture Notes in Computer Science(), vol 11171. Springer, Cham. https://doi.org/10.1007/978-3-030-00810-9_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-00810-9_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-00809-3

  • Online ISBN: 978-3-030-00810-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics