Advertisement

Network Analysis Techniques Applied to Dictionaries for Identifying Semantics in Lexical Spanish Collocations

  • Isabel Sánchez-Berriel
  • Octavio Santana Suárez
  • Virginia Gutiérrez Rodríguez
  • José Pérez Aguiar
Chapter
Part of the Quantitative Methods in the Humanities and Social Sciences book series (QMHSS)

Abstract

The definitions in dictionaries are a source of information to support the results obtained by the automatic extraction of collocations from a text corpus. Measures of association, which are generally used in this task, are useful tools to extract candidate combinations. However, they do not offer information about other features of the collocations. They do not distinguish whether a combination is categorized as a collocation because of its frequency properties or because of its structural properties. Moreover, they cannot distinguish between lexical collocations and functional collocations with delexicalized elements. In this paper, we use a graph database for representing collocations and relations between words retrieved from dictionaries. We consider relations between lemmas and definiens in dictionary entries as well as relations between two words used to define the same sense of another one. This allows us to use a clustering algorithm and measures of centrality and influence in networks to identify semantic characteristics of combinations. The aim is to enrich the information on the combinatorial restrictions of words based on frequencies obtained by means of corpus linguistic techniques.

Bibliography

  1. Alonso Ramos, M. (1994). Hacia una definición del concepto de colocación: De J. R. Firth a I. A. Mel’čuk. Revista de Lexicografía, 1, 9–28.Google Scholar
  2. Alonso Ramos, M. (2002a). Colocaciones y contorno de la definición lexicográfica. Lingüística Española Actual, 24(1), 63–96.Google Scholar
  3. Alonso Ramos, M. (2002b). Diccionario de Colocaciones del Español. http://www.dicesp.com/paginas. Accessed 20 October 2017.
  4. Alonso, A., Millon, C., & Williams, G. (2011). Collocational networks and their application to an E-advanced Learner’s dictionary of verbs in science (DicSci). In I. Kosem & K. Kosem (Eds.), Electronic lexicography in the 21st century: New applications for new users: Proceedings of eLex 2011 (pp. 12–122). Liubliana: Trojina, Institute for Applied Slovene Studies.Google Scholar
  5. Alvar Ezquerra, M. (1998). Diccionario Ideológico de la Lengua Española. Madrid: Vox.Google Scholar
  6. Anthony, L. (2018). AntConc (version 3.5.6) [computer software]. Tokyo: Waseda University.Google Scholar
  7. Baker, P. (2005). Public discourses of gay men. London: Routledge.Google Scholar
  8. Baker, P. (2014). Using corpora to analyse gender. London: Bloomsbury.Google Scholar
  9. Baker, P. (2016). The shapes of collocation. International Journal of Corpus Linguistics, 21(2), 139–164.CrossRefGoogle Scholar
  10. Bartsch, S. (2004). Structural and functional properties of collocations in English. A corpus study of lexical and pragmatic constrains on lexical co-occurrence. Tübingen: Gunter Narr Verlag.Google Scholar
  11. Bosque, I. (2001). Sobre el concepto de colocación y sus límites. Lingüística Española Actual, 23(1), 9–40.Google Scholar
  12. Bosque, I. (2004a). REDES Diccionario combinatorio del español contemporáneo. Madrid: Ediciones SM.Google Scholar
  13. Bosque, I. (2004b). La direccionalidad en los diccionarios combinatorios y el problema de la selección léxica. In T. Cabré Monné (Ed.), Lingüística teòrica: anàlisi i perspectives I (pp. 13–58). Bellaterra: Universitat Autónoma de Barcelona (Catalan Journal of Linguistics. Monografies; 2).Google Scholar
  14. Bosque, I. (2006). Diccionario Combinatorio Práctico del Español Contemporáneo. Madrid: SM.Google Scholar
  15. Brezina, V., McEnery, T., & Wattam, S. (2015). Collocations in context: A new perspective on collocation networks. International Journal of Corpus Linguistics, 20(2), 139–173.CrossRefGoogle Scholar
  16. Cantos, P., & Sánchez, A. (2001). Lexical constellations: What collocates fail to tell. International Journal of Corpus Linguistics, 6(2), 199–228.CrossRefGoogle Scholar
  17. Casares, J. (1999). Diccionario Ideológico de la Lengua Española. Barcelona: Editorial Gustavo Gili, SA.Google Scholar
  18. Castillo Carballo, M. (1997). El concepto de Unidad Fraseológica. Revista de Lexicografía, IV, 67–79.Google Scholar
  19. Castillo Carballo, M. A. (1998). El término ‘colocación’ en la lingüística actual. Lingüística Española Actual, 20(1), 41–54.Google Scholar
  20. Castillo Carballo, M. A. (2001). Colocaciones léxicas y variación lingüística: implicaciones didácticas. Lingüística Española Actual, 23(1), 133–143.Google Scholar
  21. Church, K. W., & Hanks, P. (1990). Word association norms, mutual information, and lexicography. Computational Linguistics, 16(1), 22–29.Google Scholar
  22. Corpas Pastor, G. (1996). Manual de Fraseología española. Madrid: Gredos.Google Scholar
  23. Corpas Pastor, G. (2001). Apuntes para el estudio de la colocación. Lingüística Española Actual, 23(1), 41–56.Google Scholar
  24. Corpas Pastor, G. (2003). Diez años de investigación en fraseología: análisis sintáctico-semánticos, contrastivos y traductológicos. Madrid: Iberoamericana.Google Scholar
  25. Diccionario General de la Lengua Española. 2008. Barcelona: Vox.Google Scholar
  26. Evert, S. (2005). The statistics of word coocurrences. Word pairs and collocations. (Unpublished PhD Thesis) Universität Stuttgart.Google Scholar
  27. García Platero, J. M. (2002). Aspectos semánticos de las colocaciones. Lingüística Española Actual, 24(1), 25–34.Google Scholar
  28. García Page, M. (2001). Adverbios restringidos y adverbios colocacionales. Revista de lexicografía, 23(1), 89–106.Google Scholar
  29. Girvan, M., & Newman, M. E. (2002). Community structure in social and biological networks. Proceedings of the National Academy of Sciences, 99(12), 7821–7826.MathSciNetCrossRefGoogle Scholar
  30. Gries, S. T. (2013). John Benjamins Publishing Company. International Journal of Corpus Linguistics, 18(1), 137–165.CrossRefGoogle Scholar
  31. Hausmann, F. J. (1979). Un dictionnaire des collocations est-il possible? Travaux de linguistique et de litterature, 17(1), 187–195.Google Scholar
  32. Hausmann, F. J., & Blumenthal, P. (2006). Présentation: collocation, corpus, dictionnaires. Langue Française, 150, 3–13.CrossRefGoogle Scholar
  33. Jackson, A. F., & Bolger, D. J. (2014). Using a high-dimensional graph of semantic space to model relationships among words. Frontiers in Psychology, 5, 385.CrossRefGoogle Scholar
  34. Jiménez Martínez, M. I. (2016). Colocaciones y verbos soporte en latín: semántica y sintaxis del verbo pono. (Unpublished PhD Thesis). Universidad Complutense de Madrid. http://eprints.ucm.es/44254/1/T39088.pdf. Accessed 24 March 2018.
  35. Kolakzyc, E. D., & Csárdi, G. (2014). Statistical analysis of network data with R (Vol. 65). New York: Springer.zbMATHGoogle Scholar
  36. Koike, K. (1993). Caracterización y estructuras del verbo compuesto. Hispanic, 37, 14–29.CrossRefGoogle Scholar
  37. Koike, K. (2001). Colocaciones léxicas en español actual. Madrid: Universidad de Alcalá.Google Scholar
  38. Leavitt, N. (2010). Will NoSQL databases live up to their promise? Computer, 43(2), 12–14.CrossRefGoogle Scholar
  39. McEnery, T. (2006). Swearing in English: Bad language, purity and power from 1586 to the present. Abington: Routledge.Google Scholar
  40. Mel’čuk, I. (1998). Collocations and lexical functions. In A. P. Cowie (Ed.), Phraseology. Theory, analysis and applications (pp. 3–53). Oxford: Clarendon Press.Google Scholar
  41. Molina-Plaza, S., and Sancho-Guinda, C. (2007). Collocational trends of de-lexicalised English verbs (have, make, take) and their Spanish homologues: A preliminary contrastive study. In Proceedins of the 26th international conference on Lexis and grammar, eds. C. Camugli, M. Constant and A. Dister. http://infolingu.univ-mlv.fr/Colloques/Bonifacio/proceedings/molina.pdf. Accessed 24 March 2018.
  42. Penadés Martínez, I. (2001). ¿Colocaciones o locuciones verbales. Lingüística Española Actual, 23(1), 57–88.Google Scholar
  43. Phillips, M. K. (1983). Lexical macrostructure in science text (Unpublished PhD Thesis). University of Birmingham, Birmingham, UK.Google Scholar
  44. Phillips, M. (1985). Aspects of text structure: An investigation of the lexical organisation of text. Amsterdam: North-Holland.zbMATHGoogle Scholar
  45. Phillips, M. (1989). Lexical Structure of Text. Birmingham: ELR. In University of Birmingham.Google Scholar
  46. Real Academia Española. (2017). Diccionario de la Lengua Española. Madrid: Espasa Calpe.Google Scholar
  47. Robinson, I., Webber, J., & Eifrem, E. (2013). Graph databases. Sebastopol, CA: O'Reilly Media, Inc..Google Scholar
  48. Santana, O., Pérez, J., Hernández, Z., Carreras, F., & Rodríguez, G. (1997). FLAVER: Flexionador y lematizador automático de formas verbales. Lingüística Española Actual, 19(2), 229–282.Google Scholar
  49. Santana, O., Pérez, J., Carreras, F., Duque, J., Hernández, Z., & Rodríguez, G. (1999). FLANOM: Flexionador y lematizador automático de formas nominales. Lingüística Española Actual, 21(2), 253–297.Google Scholar
  50. Santana-Suárez, S., Aguiar, J. R. P., Riudavets, F. J. C., Figueroa, Z. J. H., del Pino, J. C. R., Roca, M. D., & Rodríguez, G. R. (2007) Development of Support Services for Linguistic Research over the Internet TIN2004-03988.Google Scholar
  51. Santana-Suárez, O., Pérez-Aguiar, J., Sánchez-Berriel, I., & Gutiérrez-Rodríguez, V. (2014). COLEXWEB, herramienta de consulta de las capacidades combinatorias de las palabras del español. Lingüística Española Actual, 36(2), 273–295.Google Scholar
  52. Siepmann, D. (2005). Collocation, colligation and encoding dictionaries. Part I: Lexicological aspects. International Journal of Lexicography, 18(4), 409–443.CrossRefGoogle Scholar
  53. Scott, M. (2016). WordSmith Tools version 7 [computer software]. Stroud: Lexical analysis software.Google Scholar
  54. Seco, M. (1978). Problemas formales de la definición. In J. L. García-Arias, M. V. Conde, & J. Martínez-Álvarez (Eds.), Estudios ofrecidos a E. Alarcos Llorach (Vol. II, pp. 217–239). Oviedo: Universidad.Google Scholar
  55. Seco, M. (1979). El contorno de la definición. In G. Suárez-Blanco (Ed.), Homenaje a Samuel Gili Gaya (in memoriam) (pp. 189–191). Barcelona: Vox.Google Scholar
  56. Seco, M., Ramos, G., & Andrés, O. (1992). El diccionario del español actual. Madrid: Aguilar.Google Scholar
  57. Vincze, O., & Alonso Ramos, M. (2013). Incorporating frequency information in a collocation dictionary: Establishing a methodology. In C. Vargas (Ed.), Corpus resources for descriptive and applied studies. Current challenges and future directions: Selected papers from the 5th international conference on Corpus linguistics (CILC2013) (pp. 241–248). Amsterdam: Elsevier.Google Scholar
  58. Vincze, O., Mosqueira, E., & Alonso Ramos, M. (2011). An online collocation dictionary of Spanish. In I. Boguslavsky & L. Wanner (Eds.), Proceedings of the 5thinternational conference on meaning-text theory (pp. 275–286). Barcelona: Árbol Académico.Google Scholar
  59. Wanner, L., Ferraro, G., & Moreno, P. (2016). Towards distributional semantics-based classification of collocations for collocation dictionaries. International Journal of Lexicography, 30(2), 167–186.Google Scholar
  60. Williams, G. (1998). Collocational networks: Interlocking patterns of lexis in a corpus of plant biology research articles. International Journal of Corpus Linguistics, 3(1), 151–171.CrossRefGoogle Scholar
  61. Williams, G., Decesaris, J., & Alonso, A. (2017). Studying lexical meaning in context: From collocation to collocational networks and resonance. In S. Torner & E. Bernal (Eds.), Collocations and other lexical combinations in Spanish. Theoretical, lexicographical and applied perspectives (pp. 41–72). London: Routledge.Google Scholar

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  • Isabel Sánchez-Berriel
    • 1
  • Octavio Santana Suárez
    • 2
  • Virginia Gutiérrez Rodríguez
    • 1
  • José Pérez Aguiar
    • 2
  1. 1.Universidad de La LagunaTenerifeSpain
  2. 2.Universidad de Las Palmas de Gran CanariaCanariaSpain

Personalised recommendations