Abstract
The Phonologie du Français Contemporain project is an international, collaborative research effort to create resources for the study of contemporary French phonology. It has produced a large, partially transcribed and annotated corpus of spoken French, consisting of approximately 300 h of recordings, and covering 48 geographical regions (including Metropolitan France, Belgium, Switzerland, Canada, and French-speaking countries of Africa). Following a detailed protocol, speakers read aloud a word list and a short text and engage in guided and spontaneous conversation with an interviewer. The corpus presents several challenges: significant regional accent variation; variable recording quality and different types of environment noise; variation in speaker characteristics (age, sex); and interspersed segments of overlapping speech. In this article, we describe the procedure followed to address these challenges and produce an automatic forced alignment of the corpus at the phone, syllable and token level, starting from the initial transcriptions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Avanzi, M.: A corpus-based approach to French regional prosodic variation. Nouveaux Cahiers de Linguistique Française 31, 309–332 (2014). (Proceedings of the 3rd SWIP)
Bigi, B., Hirst, D.: Speech phonetization alignment and syllabification (SPPAS): a tool for the automatic analysis of speech prosody. In: Proceedings of the 6th Speech Prosody Conference, 22–25 May, Shanghai, China (2012)
Boersma, P., Weenink, D.: Praat: doing phonetics by computer, ver. 6.0.37 (2018). http://www.praat.org
Brognaux, S., Roekhaut, S., Drugman, T., Beaufort, R.: Train & Align: a new online tool for automatic phonetic alignment. In: 2012 IEEE Spoken Language Technology Workshop (SLT), pp. 416–421, December 2012
Christodoulides, G.: Praaline: integrating tools for speech corpus research. In: LREC 2014—Proceedings of the 9th International Conference on Language Resources and Evaluation, 26–31 May, Reykjavik, Iceland, pp. 31–34 (2014). http://www.praaline.org
Christodoulides, G., Barreca, G.: Expériences sur l’analyse morphosyntaxique des corpus oraux avec l’annotateur multi-niveaux DisMo. Corela: Cognition, Représentation, Langage HS-21 (2017). https://journals.openedition.org/corela/4867
Durand, J., Laks, B., Lyche, C.: Phonologie, variation et accents du français. Hermes, Paris (2009)
Durand, J., Lyche, C.: French liaison in the light of corpus data. J. Fr. Lang. Stud. 18(1), 33–66 (2008)
Goldman, J.P.: EasyAlign: an automatic phonetic alignment tool under Praat. In: INTERSPEECH 2011—Proceedings of the 12th Annual Conference of the International Speech Communication Association, 27–31 August , Florence, Italy, pp. 3233–3236 (2011)
Hathout, N., Sajous, F., Calderone, B.: GLÀFF, a large versatile French lexicon. In: LREC 2014—Proceedings of the 9th International Conference on Language Resources and Evaluation, 26–31 May, Reykjavik, Iceland (2014)
Huggins-Daines, D., Kumar, M., Chan, A., Black, A.W., Ravishankar, M., Rudnicky, A.I.: PocketSphinx: a free, real-time continuous speech recognition system for hand-held devices. In: 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, vol. 1, pp. I-I, May 2006
Katsamanis, A., Black, M.P., Georgiou, P.G., Goldstein, L., Narayanan, S.S.: SailAlign: robust long speech-text alignment. In: Proceedings of the Workshop on New Tools and Methods for Very-Large Scale Phonetics Research (2011)
McAuliffe, M., Socolof, M., Mihuc, S., Wagner, M., Sonderegger, M.: Montreal forced aligner: trainable text-speech alignment using Kaldi. In: Proceedings of the 18th Conference of the International Speech Communication Association (2017)
Povey, D., et al.: The Kaldi speech recognition toolkit. In: IEEE 2011 Workshop on Automatic Speech Recognition and Understanding. IEEE Signal Processing Society, December 2011. IEEE Catalog No. CFP11SRW-USB
Walker, W., et al.: Sphinx-4: a flexible open source framework for speech recognition. Technical report, Sun Microsystems Inc., Mountain View, CA, USA (2004)
Young, S.J., Kershaw, D., Odell, J., Ollason, D., Valtchev, V., Woodland, P.: The HTK Book Version 3.4. Cambridge University Press, Cambridge (2006)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Christodoulides, G. (2018). Forced Alignment of the Phonologie du Français Contemporain Corpus. In: Dutoit, T., Martín-Vide, C., Pironkov, G. (eds) Statistical Language and Speech Processing. SLSP 2018. Lecture Notes in Computer Science(), vol 11171. Springer, Cham. https://doi.org/10.1007/978-3-030-00810-9_5
Download citation
DOI: https://doi.org/10.1007/978-3-030-00810-9_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00809-3
Online ISBN: 978-3-030-00810-9
eBook Packages: Computer ScienceComputer Science (R0)