Abstract
This study was made on the base of the ORD corpus of everyday spoken Russian, containing the rich collection of audio recordings made in real-life settings. Speech transcripts of the ORD corpus imply mandatory indication of word and phrase breaks, self-correction, hesitations, fillers and other irregularities of spoken discourse. The paper deals with speech breaks in oral discourse (word breaks, phrase breaks, intraphrasal pauses, etc.). Quantitative analysis performed on the subcorpus of 187 600 tokens has shown that 7,56% of all phrases in everyday communication are not finished. If word breaks can be referred to word search/choice or self-correction, phrase breaks affect the text level and result in ragged, rough, and poorly structured syntactic sequence. Sociolinguistic analysis has revealed that phrase breaks are more frequent in men’s speech than in the women’s (8.16 vs. 7,12%). Seniors have significantly more speech breaks (10,76%) than children (6,78%), youth (6,08%) and middle-aged people (7,37%). As for status groups of speakers, the highest share of breaks is found in speech of unemployed and retired people (10,75%), whereas the lowest percentage of breaks is observed in speech of managers (4,50%) who care, apparently, more about their speech quality than others.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Asinovsky, A., Bogdanova, N., Rusakova, M., Ryko, A., Stepanova, S., Sherstinova, T.: The ORD speech corpus of Russian everyday communication “One Speaker’s Day”: creation principles and annotation. In: Matoušek, V., Mautner, P. (eds.) TSD 2009. LNCS (LNAI), vol. 5729, pp. 250–257. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-04208-9_36
Bogdanova-Beglarian, N., Martynenko, G., Sherstinova, T.: The “One Day of Speech” corpus: phonetic and syntactic studies of everyday spoken Russian. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds.) SPECOM 2015. LNCS (LNAI), vol. 9319, pp. 429–437. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-23132-7_53
Bogdanova-Beglarian, N., Sherstinova, T., Blinova, O., Ermolova, O., Baeva, E., Martynenko, G., Ryko, A.: Sociolinguistic extension of the ORD corpus of Russian everyday speech. In: Ronzhin, A., Potapova, R., Németh, G. (eds.) SPECOM 2016. LNCS (LNAI), vol. 9811, pp. 659–666. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-43958-7_80
Bogdanova-Beglarian, N., Sherstinova, T., Blinova, O., Martynenko, G.: An exploratory study on sociolinguistic variation of Russian everyday speech. In: Ronzhin, A., Potapova, R., Németh, G. (eds.) SPECOM 2016. LNCS (LNAI), vol. 9811, pp. 100–107. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-43958-7_11
Bogdanova-Beglarian, N., Sherstinova, T., Blinova, O., Martynenko, G.: Corpus “One Speaker’s Day” for studies of sociolinguistic variation of russian colloquial speech, analysis of conversational Russian speech (AC3-2017). In: Skrelin, P., Kocharov, D. (eds.) Proceedings of the 7th Interdisciplinary Seminar, pp. pp. 14–20. Polytechnica-Print, Sankt Petersburg (2017)
Bogdanova-Beglarian, N. (ed.): Everyday Russian Language in Different Social Groups, Collective Monograph. LAIKA, 244p, Sankt Petersburg (2016)
Furui, S., Kikuchi, T., Shinnaka, Y., Hori, Ch.: Speech-to-text and speech-to-speech summarization of spontaneous speech. IEEE Trans. Speech Audio Process. 12(4), 401–408 (2004)
Bogdanova-Beglarian, N. (ed.): Speech Corpus as a Base for Analysis. Collective Monograph. Part 1. Reading. Retelling. Description. Philology Department Publ., Saint-Petersburg (2013)
Bogdanova-Beglarian, N.: Word break in the “disfluency spot” and ways of reacting to a communicative difficulty (2017). http://conference-spbu.ru/backend/36/reports/5145/
Acknowledgements
The research was conducted within the framework of the project, supported by the Russian Scientific Foundation, # 14-18-02070 “Everyday Russian Language in Different Social Groups”.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Bogdanova-Beglarian, N. (2019). Phrase Breaks in Everyday Conversations from Sociolinguistic Perspective. In: Eismont, P., Mitrenina, O., Pereltsvaig, A. (eds) Language, Music and Computing. LMAC 2017. Communications in Computer and Information Science, vol 943. Springer, Cham. https://doi.org/10.1007/978-3-030-05594-3_10
Download citation
DOI: https://doi.org/10.1007/978-3-030-05594-3_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-05593-6
Online ISBN: 978-3-030-05594-3
eBook Packages: Computer ScienceComputer Science (R0)