Abstract
The definitions in dictionaries are a source of information to support the results obtained by the automatic extraction of collocations from a text corpus. Measures of association, which are generally used in this task, are useful tools to extract candidate combinations. However, they do not offer information about other features of the collocations. They do not distinguish whether a combination is categorized as a collocation because of its frequency properties or because of its structural properties. Moreover, they cannot distinguish between lexical collocations and functional collocations with delexicalized elements. In this paper, we use a graph database for representing collocations and relations between words retrieved from dictionaries. We consider relations between lemmas and definiens in dictionary entries as well as relations between two words used to define the same sense of another one. This allows us to use a clustering algorithm and measures of centrality and influence in networks to identify semantic characteristics of combinations. The aim is to enrich the information on the combinatorial restrictions of words based on frequencies obtained by means of corpus linguistic techniques.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Bibliography
Alonso Ramos, M. (1994). Hacia una definición del concepto de colocación: De J. R. Firth a I. A. Mel’čuk. Revista de Lexicografía, 1, 9–28.
Alonso Ramos, M. (2002a). Colocaciones y contorno de la definición lexicográfica. Lingüística Española Actual, 24(1), 63–96.
Alonso Ramos, M. (2002b). Diccionario de Colocaciones del Español. http://www.dicesp.com/paginas. Accessed 20 October 2017.
Alonso, A., Millon, C., & Williams, G. (2011). Collocational networks and their application to an E-advanced Learner’s dictionary of verbs in science (DicSci). In I. Kosem & K. Kosem (Eds.), Electronic lexicography in the 21st century: New applications for new users: Proceedings of eLex 2011 (pp. 12–122). Liubliana: Trojina, Institute for Applied Slovene Studies.
Alvar Ezquerra, M. (1998). Diccionario Ideológico de la Lengua Española. Madrid: Vox.
Anthony, L. (2018). AntConc (version 3.5.6) [computer software]. Tokyo: Waseda University.
Baker, P. (2005). Public discourses of gay men. London: Routledge.
Baker, P. (2014). Using corpora to analyse gender. London: Bloomsbury.
Baker, P. (2016). The shapes of collocation. International Journal of Corpus Linguistics, 21(2), 139–164.
Bartsch, S. (2004). Structural and functional properties of collocations in English. A corpus study of lexical and pragmatic constrains on lexical co-occurrence. Tübingen: Gunter Narr Verlag.
Bosque, I. (2001). Sobre el concepto de colocación y sus límites. Lingüística Española Actual, 23(1), 9–40.
Bosque, I. (2004a). REDES Diccionario combinatorio del español contemporáneo. Madrid: Ediciones SM.
Bosque, I. (2004b). La direccionalidad en los diccionarios combinatorios y el problema de la selección léxica. In T. Cabré Monné (Ed.), Lingüística teòrica: anàlisi i perspectives I (pp. 13–58). Bellaterra: Universitat Autónoma de Barcelona (Catalan Journal of Linguistics. Monografies; 2).
Bosque, I. (2006). Diccionario Combinatorio Práctico del Español Contemporáneo. Madrid: SM.
Brezina, V., McEnery, T., & Wattam, S. (2015). Collocations in context: A new perspective on collocation networks. International Journal of Corpus Linguistics, 20(2), 139–173.
Cantos, P., & Sánchez, A. (2001). Lexical constellations: What collocates fail to tell. International Journal of Corpus Linguistics, 6(2), 199–228.
Casares, J. (1999). Diccionario Ideológico de la Lengua Española. Barcelona: Editorial Gustavo Gili, SA.
Castillo Carballo, M. (1997). El concepto de Unidad Fraseológica. Revista de Lexicografía, IV, 67–79.
Castillo Carballo, M. A. (1998). El término ‘colocación’ en la lingüística actual. Lingüística Española Actual, 20(1), 41–54.
Castillo Carballo, M. A. (2001). Colocaciones léxicas y variación lingüística: implicaciones didácticas. Lingüística Española Actual, 23(1), 133–143.
Church, K. W., & Hanks, P. (1990). Word association norms, mutual information, and lexicography. Computational Linguistics, 16(1), 22–29.
Corpas Pastor, G. (1996). Manual de Fraseología española. Madrid: Gredos.
Corpas Pastor, G. (2001). Apuntes para el estudio de la colocación. Lingüística Española Actual, 23(1), 41–56.
Corpas Pastor, G. (2003). Diez años de investigación en fraseología: análisis sintáctico-semánticos, contrastivos y traductológicos. Madrid: Iberoamericana.
Diccionario General de la Lengua Española. 2008. Barcelona: Vox.
Evert, S. (2005). The statistics of word coocurrences. Word pairs and collocations. (Unpublished PhD Thesis) Universität Stuttgart.
García Platero, J. M. (2002). Aspectos semánticos de las colocaciones. Lingüística Española Actual, 24(1), 25–34.
García Page, M. (2001). Adverbios restringidos y adverbios colocacionales. Revista de lexicografía, 23(1), 89–106.
Girvan, M., & Newman, M. E. (2002). Community structure in social and biological networks. Proceedings of the National Academy of Sciences, 99(12), 7821–7826.
Gries, S. T. (2013). John Benjamins Publishing Company. International Journal of Corpus Linguistics, 18(1), 137–165.
Hausmann, F. J. (1979). Un dictionnaire des collocations est-il possible? Travaux de linguistique et de litterature, 17(1), 187–195.
Hausmann, F. J., & Blumenthal, P. (2006). Présentation: collocation, corpus, dictionnaires. Langue Française, 150, 3–13.
Jackson, A. F., & Bolger, D. J. (2014). Using a high-dimensional graph of semantic space to model relationships among words. Frontiers in Psychology, 5, 385.
Jiménez Martínez, M. I. (2016). Colocaciones y verbos soporte en latín: semántica y sintaxis del verbo pono. (Unpublished PhD Thesis). Universidad Complutense de Madrid. http://eprints.ucm.es/44254/1/T39088.pdf. Accessed 24 March 2018.
Kolakzyc, E. D., & Csárdi, G. (2014). Statistical analysis of network data with R (Vol. 65). New York: Springer.
Koike, K. (1993). Caracterización y estructuras del verbo compuesto. Hispanic, 37, 14–29.
Koike, K. (2001). Colocaciones léxicas en español actual. Madrid: Universidad de Alcalá.
Leavitt, N. (2010). Will NoSQL databases live up to their promise? Computer, 43(2), 12–14.
McEnery, T. (2006). Swearing in English: Bad language, purity and power from 1586 to the present. Abington: Routledge.
Mel’čuk, I. (1998). Collocations and lexical functions. In A. P. Cowie (Ed.), Phraseology. Theory, analysis and applications (pp. 3–53). Oxford: Clarendon Press.
Molina-Plaza, S., and Sancho-Guinda, C. (2007). Collocational trends of de-lexicalised English verbs (have, make, take) and their Spanish homologues: A preliminary contrastive study. In Proceedins of the 26th international conference on Lexis and grammar, eds. C. Camugli, M. Constant and A. Dister. http://infolingu.univ-mlv.fr/Colloques/Bonifacio/proceedings/molina.pdf. Accessed 24 March 2018.
Penadés Martínez, I. (2001). ¿Colocaciones o locuciones verbales. Lingüística Española Actual, 23(1), 57–88.
Phillips, M. K. (1983). Lexical macrostructure in science text (Unpublished PhD Thesis). University of Birmingham, Birmingham, UK.
Phillips, M. (1985). Aspects of text structure: An investigation of the lexical organisation of text. Amsterdam: North-Holland.
Phillips, M. (1989). Lexical Structure of Text. Birmingham: ELR. In University of Birmingham.
Real Academia Española. (2017). Diccionario de la Lengua Española. Madrid: Espasa Calpe.
Robinson, I., Webber, J., & Eifrem, E. (2013). Graph databases. Sebastopol, CA: O'Reilly Media, Inc..
Santana, O., Pérez, J., Hernández, Z., Carreras, F., & Rodríguez, G. (1997). FLAVER: Flexionador y lematizador automático de formas verbales. Lingüística Española Actual, 19(2), 229–282.
Santana, O., Pérez, J., Carreras, F., Duque, J., Hernández, Z., & Rodríguez, G. (1999). FLANOM: Flexionador y lematizador automático de formas nominales. Lingüística Española Actual, 21(2), 253–297.
Santana-Suárez, S., Aguiar, J. R. P., Riudavets, F. J. C., Figueroa, Z. J. H., del Pino, J. C. R., Roca, M. D., & Rodríguez, G. R. (2007) Development of Support Services for Linguistic Research over the Internet TIN2004-03988.
Santana-Suárez, O., Pérez-Aguiar, J., Sánchez-Berriel, I., & Gutiérrez-Rodríguez, V. (2014). COLEXWEB, herramienta de consulta de las capacidades combinatorias de las palabras del español. Lingüística Española Actual, 36(2), 273–295.
Siepmann, D. (2005). Collocation, colligation and encoding dictionaries. Part I: Lexicological aspects. International Journal of Lexicography, 18(4), 409–443.
Scott, M. (2016). WordSmith Tools version 7 [computer software]. Stroud: Lexical analysis software.
Seco, M. (1978). Problemas formales de la definición. In J. L. García-Arias, M. V. Conde, & J. Martínez-Álvarez (Eds.), Estudios ofrecidos a E. Alarcos Llorach (Vol. II, pp. 217–239). Oviedo: Universidad.
Seco, M. (1979). El contorno de la definición. In G. Suárez-Blanco (Ed.), Homenaje a Samuel Gili Gaya (in memoriam) (pp. 189–191). Barcelona: Vox.
Seco, M., Ramos, G., & Andrés, O. (1992). El diccionario del español actual. Madrid: Aguilar.
Vincze, O., & Alonso Ramos, M. (2013). Incorporating frequency information in a collocation dictionary: Establishing a methodology. In C. Vargas (Ed.), Corpus resources for descriptive and applied studies. Current challenges and future directions: Selected papers from the 5th international conference on Corpus linguistics (CILC2013) (pp. 241–248). Amsterdam: Elsevier.
Vincze, O., Mosqueira, E., & Alonso Ramos, M. (2011). An online collocation dictionary of Spanish. In I. Boguslavsky & L. Wanner (Eds.), Proceedings of the 5thinternational conference on meaning-text theory (pp. 275–286). Barcelona: Árbol Académico.
Wanner, L., Ferraro, G., & Moreno, P. (2016). Towards distributional semantics-based classification of collocations for collocation dictionaries. International Journal of Lexicography, 30(2), 167–186.
Williams, G. (1998). Collocational networks: Interlocking patterns of lexis in a corpus of plant biology research articles. International Journal of Corpus Linguistics, 3(1), 151–171.
Williams, G., Decesaris, J., & Alonso, A. (2017). Studying lexical meaning in context: From collocation to collocational networks and resonance. In S. Torner & E. Bernal (Eds.), Collocations and other lexical combinations in Spanish. Theoretical, lexicographical and applied perspectives (pp. 41–72). London: Routledge.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this chapter
Cite this chapter
Sánchez-Berriel, I., Suárez, O.S., Rodríguez, V.G., Aguiar, J.P. (2018). Network Analysis Techniques Applied to Dictionaries for Identifying Semantics in Lexical Spanish Collocations. In: Cantos-Gómez, P., Almela-Sánchez, M. (eds) Lexical Collocation Analysis. Quantitative Methods in the Humanities and Social Sciences. Springer, Cham. https://doi.org/10.1007/978-3-319-92582-0_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-92582-0_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-92581-3
Online ISBN: 978-3-319-92582-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)