Abstract
In this study, we investigate which factors influence the linguistic distance of Catalan dialectal pronunciations from standard Catalan. We use pronunciations from three regions where the northwestern variety of the Catalan language is spoken (Catalonia, Aragon and Andorra). In contrast to Aragon, Catalan has an official status in both Catalonia and Andorra, which likely influences standardization. Because we are interested in the potentially large range of differences that standardization might promote, we examine 357 words in Catalan varieties and in particular their pronunciation distances with respect to the standard. In order to be sensitive to differences among the words, we fit a generalized additive mixed-effects regression model to this data. This allows us to examine simultaneously the general (i.e. aggregate) patterns in pronunciation distance and to detect those words that diverge substantially from the general pattern. The results reveal higher pronunciation distances from standard Catalan in Aragon than in the other regions. Furthermore, speakers in Catalonia and Andorra, but not in Aragon, show a clear standardization pattern, with younger speakers having dialectal pronunciations closer to the standard than older speakers. This clearly indicates the presence of a border effect within a single country with respect to word pronunciation distances. Since a great deal of scholarship focuses on single segment changes, we compare our analysis to the analysis of three segment changes that have been discussed in the literature on Catalan. This comparison shows that the pattern observed at the word pronunciation level is supported by two of the three cases examined. As not all individual cases conform to the general pattern, the aggregate approach is necessary to detect global standardization patterns.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
In Andorra, Catalan is the only official language. In Catalonia, where Spanish and Aranese (a variety of Occitan) are also official, Catalan was the vehicular language of education during the 1920s and the 1930s and achieved this status again after Franco’s dictatorship in the early 1980s [1]. That means that all subjects except second and third languages are taught in Catalan in the public schools of Catalonia and Andorra. In Aragon, Catalan has only been a voluntary subject in schools in the eastern counties (where Catalan is spoken) since 1984 [2]. The standard variety used at all schools in these areas is the one sanctioned by the Institut d’Estudis Catalans [3].
- 2.
- 3.
It might be argued that this pattern is due to the fact that the Catalan standard language is mainly based on the eastern dialects of Catalonia. Although it is true that the northwestern varieties of Catalonia and Andorra have historically converged towards the (closer and more prestigious) eastern varieties during the twentieth century, Valls et al. [9] have shown that the standardization process has been much more effective in the diffusion of the prestigious features westwards.
- 4.
While the precise effect of speaker’s year of birth is different for both regions (Aragon, and Catalonia and Andorra) across all three variables, the difference in the effect of this predictor on Aragon as opposed to Catalonia and Andorra was never significant (all p’s > 0.07) due to the small number of locations in Aragon (i.e. eight) and the limited number of words. Therefore, strictly speaking, none of the variables completely adheres to the aggregate pattern (where this difference was significant).
References
Woolard K, Gahng T-J (2008) Changing language policies and attitudes in autonomous Catalonia. Lang Soc 19:311–330
Huguet A, Vila I, Llurda E (2000) Minority language education in unbalanced bilingual situations: A case for the linguistic interdependence hypothesis. J Psycholinguist Res 3:313–333
Fabra P (1918) Gramàtica catalana. Institut d’Estudis Catalans, Barcelona
Woolhiser C (2005) Political borders and dialect divergence/convergence in Europe. In: Auer P, Hinskens F, Kerswill P (eds) Dialect change. Convergence and divergence in European languages. Cambridge University Press, New York, pp 236–262
Goebl H (2000) Langues standards et dialectes locaux dans la France du Sud-Est et l’Italie septentrionale sous le coup de l’effet-frontière: une approche dialectomètrique. Int J Sociol Lang 145:181–215
Pradilla M-À (2008) Sociolingüística de la variació i llengua catalana. Institut d’Estudis Catalans, Barcelona
Pradilla M-À (2008) La tribu valenciana. Reflexions sobre la desestructuració de la comunitat lingüística. Onada, Benicarló
Bibiloni G (2002) Un estàndard nacional o tres estàndards regionals? In: Joan B (ed) Perspectives sociolingüístiques a les Illes Balears. Res Publica, Eivissa
Valls E, Wieling M, Nerbonne J (2013) Linguistic advergence and divergence in north-western Catalan: A dialectometric investigation of dialect leveling and border effects. LLC J Digit Scholarsh Humanit 28(1):119–146
Bailey G, Wikle T, Tillery J, Sand L (1991) The apparent time construct. Lang Var Chang 3:241–264
Wieling M (2012) A quantitative approach to social and geographical dialect variation. PhD thesis, Rijksuniversiteit Groningen
Nerbonne J (2009) Data-driven dialectology. Lang Ling Compass 3(1):175–198
Wieling M, Nerbonne J (2011) Bipartite spectral graph partitioning for clustering dialect varieties and detecting their linguistic features. Comput Speech Lang 25(3):700–715
Wieling M, Nerbonne J (2015) Advances in dialectometry. Ann Rev Linguist 1:243–264
Institut d’Estudis Catalans (1999) Proposta per a un estàndard oral de la llengua catalana I. Fonètica. Institut d’Estudis Catalans, Barcelona
Institut d’Estudis Catalans (1999) Proposta per a un estàndard oral de la llengua catalana II. Morfologia. Institut d’Estudis Catalans, Barcelona
Institut d’Estadística de Catalunya (2008, 2010) Territori. http://www.idescat.cat. Accessed 28 Feb 2011
Instituto Aragonés de Estadística (2007, 2009, 2010) Población y Territorio. http://www.aragon.es. Accessed 28 Feb 2011
Departament d’Estadística del Govern d’Andorra (2010) Societat i població: http://www.estadistica.ad. Accessed 28 Feb 2011
Cambra de Comerç – Indústria i Serveis d’Andorra (2008) Informe econòmic
Levenshtein V (1965) Binary codes capable of correcting deletions, insertions and reversals (in Russian). Dokl Akad Nauk SSSR 163:845–848
Heeringa W (2004) Measuring dialect pronunciation distances using Levenshtein distance. PhD thesis, Rijksuniversiteit Groningen
Heeringa W, Kleiweg P, Gooskens C, Nerbonne J (2006) Evaluation of String Distance Algorithms for Dialectology. In: Nerbonne J, Hinrichs E (eds) Linguistic distances workshop at the joint conference of International Committee on Computational Linguistics and the Association for Computational Linguistics, Sydney, pp 51–62
Wieling M, Prokić J, Nerbonne J (2009) Evaluating the pairwise string alignment of pronunciations. In: Borin L, Lendvai P (eds) Language technology and resources for cultural heritage, social sciences, humanities, and education, Workshop at the 12th Meeting of the European Chapter of the Association for Computational Linguistics. Athens, 30 March 2009, pp 26–34
Church K, Hanks P (1990) Word association norms, mutual information, and lexicography. Comput Linguist 16(1):22–29
Wieling M, Margaretha E, Nerbonne J (2012) Inducing a measure of phonetic similarity from pronunciation variation. J Phon 40(2):307–314
Wieling M, Nerbonne J, Baayen RH (2011) Quantitative social dialectology: explaining linguistic variation socially and geographically. PLoS One 6(9):e23613
Wieling M, Bloem J, Mignella K, Timmermeister M, Nerbonne J (2014) Measuring foreign accent strength in English: Validating Levenshtein distance as a measure. Lang Dyn Change 4(2):253–269
Tagliamonte S, Baayen RH (2012) Models, forests and trees of York English: was/were variation as a case study for statistical practice. Lang Var Chang 24(2):135–178
Baayen RH, Davidson DJ, Bates DM (2008) Mixed-effects modeling with crossed random effects for subjects and items. J Mem Lang 59(4):390–412
Barr DJ, Levy R, Scheepers C, Tily HJ (2013) Random effects structure for confirmatory hypothesis testing: Keep it maximal. J Mem Lang 68(3):255–278
Bates D, Kliegl R, Vasishth S, Baayen RH (2015) Parsimonious mixed models. http://arxiv.org/abs/1506.04967
Jaeger F (2008) Categorical data analysis: away from ANOVAs (transformation or not) and towards logit mixed models. J Mem Lang 59(4):434–446
Baayen RH (2008) Analyzing linguistic data. A practical introduction to statistics using R. Cambridge University Press, Cambridge
Hastie TJ, Tibshirani RJ (1990) Generalized additive models, vol 43. CRC Press, Boca Raton
Wood S (2006) Generalized additive models: An introduction with R. Chapman & Hall/CRC, Boca Raton
Wood S (2003) Thin plate regression splines. J R Stat Soc Ser B (Stat Methodol) 65(1):95–114
Wood S (2011) Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models. J R Stat Soc Ser B 73(1):3–36
Nerbonne J (2010) Measuring the diffusion of linguistic change. Philos Trans R Soc B Biol Sci 365:3821–3828
Nerbonne J, Heeringa W (2007) Geographic distributions of linguistic variation reflect dynamics of differentiation. In: Featherston S, Sternefeld W (eds) Roots: Linguistics in search of its evidential base. Mouton De Gruyter, Berlin, pp 267–297
Wurm LH, Fisicaro SA (2014) What residualizing predictors in regression analysis does (and what it does not do). J Mem Lang 72:37–48
Cheshire J (2002) Sex and gender in variationist research. In: Chambers JK, Trudgill P, Schilling-Estes N (eds) The handbook of language variation and change. Blackwell, Oxford, pp 423–443
Labov W (2001) Principles of linguistic change, volume 2. Social factors. Blackwell, Oxford
Gorman K (2010) The consequences of multicollinearity among socioeconomic predictors of negative concord in Philadelphia. In: Lerner M (ed) University of Pennsylvania Working Papers in Linguistics, vol 16(2), pp 66–75
Keating P, Lindblom B, Lubker J, Kreiman J (1994) Variability in jaw height for segments in English and Swedish VCVs. J Phon 22:407–422
Chambers J (2009) Sociolinguistic theory: Linguistic variation and its social significance, 3rd edn. Wiley-Blackwell, Oxford
Recasens D (1996) Fonètica descriptiva del català (assaig de caracterització de la pronúncia del vocalisme i consonantisme del català al segle XX). Institut d’Estudis Catalans, Barcelona
Massanell M (2001) Morfologia flexiva actual de la Seu d’Urgell i Coll de Nargó: Estadis en el procés d’orientalització del català nord-occidental. Z Katalan 14:128–150
Romero S (2001) Canvi lingüístic en morfologia nominal a la Conca de Tremp. Universitat de Barcelona. PhD thesis. Available at http://hdl.handle.net/10803/2082
Acknowledgements
We thank the two anonymous reviewers for their extensive comments which have helped to improve this manuscript. This research was partly funded by the project Descripción e interpretación de la variación dialectal: aspectos fonológicos y morfológicos del catalán (FFI2010-22181-C03-02), financed by MICINN and FEDER.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this chapter
Cite this chapter
Wieling, M., Valls, E., Baayen, R.H., Nerbonne, J. (2018). Border Effects Among Catalan Dialects. In: Speelman, D., Heylen, K., Geeraerts, D. (eds) Mixed-Effects Regression Models in Linguistics. Quantitative Methods in the Humanities and Social Sciences. Springer, Cham. https://doi.org/10.1007/978-3-319-69830-4_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-69830-4_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-69828-1
Online ISBN: 978-3-319-69830-4
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)