Abstract
Senso Comune is an open knowledge base for the Italian language, available through a Web-based collaborative platform, whose construction is in progress. The resource integrates dictionary data coming from both users and legacy resources with an ontological backbone, which provides foundations for a formal characterization of lexical semantic structures (frames). A nucleus of basic Italian lemmas, which have been semantically analyzed and classified, is available for both online access and download. A restricted community of contributors is currently working on increasing the lexical coverage of the resource.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
Besides the authors of this paper, the group, lead by Tullio De Mauro, includes Nicola Guarino, Maurizio Lenzerini and Laure Vieu.
- 3.
IBM Italia Foundation’s symposium La dimensione semantica dell’Information Technology (The semantic dimension of Information Technology), Rome, November 27, 2006.
- 4.
- 5.
See for example: http://multiwordnet.itc.it/english/home
- 6.
- 7.
Lexicons often omit any reference to ontological categories that are not lexicalised in a language, although it sometimes happens, as with EuroWordNet’s ILIs or FrameNet’s non-lexicalised frames.
- 8.
In this respect, our approach is essentially different from OntoNotes [32], where multi-lingual corpora have been annotated with shallow semantic features based on the Omega ontology. Omega contains“no formal concept definitions and only relatively few interconnections” [33] while Senso Comune, conversely, is explicitly grounded on an ontological model (see Sect. 2.2).
- 9.
Hyponymy, antonymy, troponymy, causality, similarity, etc.
- 10.
- 11.
- 12.
For an updated list of wordnet projects see: http://www.globalwordnet.org/
- 13.
- 14.
- 15.
- 16.
There is yet another WordNet for Italian developed by a company, but it is not freely accessible.
- 17.
Grande dizionario italiano dell’uso (Gradit), Torino, UTET
- 18.
Two teams of five linguists each, based in Rome and Bologna, under the supervision of Isabella Chiari and Malvina Nissim, were dedicated to this task.
- 19.
- 20.
- 21.
For the sake of readability, we don’t go through the basic senses of the verb ‘to walk’, also assuming that s1 is adequately selected by the user.
- 22.
Note that we may wish to distinguish descriptions of functions from actual ones, namely those functions which are performed at a certain time by a given object. In the above example we simplify this distinction only focusing on the latter case.
- 23.
Uncertainty will be included only in future releases of the TMEO system.
- 24.
- 25.
See for example: http://msw2.deri.ie/
References
Assem Mv, Gangemi A, Schreiber G (2006) Conversion of WordNet to a standard RDF/OWL representation. In: Proceedings of the international conference on language resources and evaluation (LREC), Genoa, Italy
Baader F, Calvanese D, Mcguinness DL, Nardi D, Patel-Schneider PF (eds) (2003) The description logic handbook: theory, implementation and applications. Cambridge University Press, New York
Basile V, Bos J, Evang K, Venhuizen N (2012) Developing a large semantically annotated corpus. In: To appear in proceedings of LREC, Istanbul, Turkey
Bonial C, Corvey W, Palmer M, Petukhova V, Bunt H (2011) A hierarchical unification of lirics and verbnet semantic roles. In: 2011 fifth IEEE international conference on semantic computing (ICSC), Palo Alto. IEEE, pp 483–489
Bortolini U, Tagliavini C, Zampolli A (1972) Lessico di frequenza della lingua italiana contemporanea. Garzanti, Milano
Buitelaar P, Cimiano P, Haase P, Sintek M (2009) Towards linguistically grounded ontologies. In: Aroyo L, Traverso P, Ciravegna F, Cimiano P, Heath T, Hyvönen E, Mizoguchi R, Oren E, Sabou M, Simperl E (eds) The semantic web: research and applications. Lecture notes in computer science, vol 5554, Springer, Berlin/Heidelberg, pp 111–125
Burchardt A, Erk K, Frank A, Kowalski A, Pado S, Pinkal M (2006) The SALSA corpus: a German corpus resource for lexical semantics. In: Proceedings of LREC 2006, Genoa
Chiarcos C, Nordhoff S, Hellman S (eds) (2012) Integrating WordNet and Wiktionary with lemon. Springer, Frankfurt
Chiari I, Alessandro Oltramari GV (2010) Di cosa parliamo quando parliamo fondamentale? In: Atti del Convegno della Società di linguistica italiana (SLI 2010), Viterbo, pp 221–236
De Mauro T (1965) Introduzione alla semantica. Laterza, Bari
De Mauro T (1980) Guida all’uso delle Parole. Editori riuniti, Roma
De Mauro T, Mancini F, Vedovelli M, Voghera M (1993) Lessico di frequenza dell’italiano parlato (LIP). Etaslibri, Milano
Fellbaum C (1998) WordNet - An Electronic Lexical Database. MIT Press, Cambridge, Massachusetts
Fillmore CJ (1968) The case for case. In: Bach E, Harms T (eds) Universals in linguistic theory. Rinehart and Wiston, New York
Gangemi A (2010) What’s in a schema? A formal metamodel for ECG and FrameNet. Studies in Natural Language Processing. Cambridge University Press, Cambridge
Gildea D, Jurafsky D (2002) Automatic labeling of semantic roles. Comput Linguist 28(3):245–288
Giuglea AM, Moschitti A (2006) Semantic role labeling via FrameNet, VerbNet and PropBank. In: Proceedings of the COLING-ACL, association for computational linguistics, Stroudsburg, PA, USA, ACL-44, pp 929–936. DOI 10.3115/1220175.1220292
Kipper K, Korhonen A, Ryant N, Palmer M (2008) A large-scale classification of English verbs. Lang Resour Eval 42(1):21–40
Lenci A, Bel N, Busa F, Calzolari N, Gola E, Monachini M, Ogonowski A, Peters I, Peters W, Ruimy N, Villegas M, Zampolli A (2000) Simple: a general framework for the development of multilingual lexicons. Int J Lexicogr 13(4):249–263
Lenci A, Calzolari N, Zampolli A (2002) From text to content: computational lexicons and the semantic web. In: Eighteenth national conference on artificial intelligence; AAAI workshop, “Semantic web meets language resources”, Edmonton, Alberta, Canada
Lenci A, Johnson M, Lapesa G (2010) Building an Italian framenet through semi-automatic corpus analysis. In: Proceedings of LREC 2010, Valletta, pp 12–19
Levin B (2003) English verb classes and alternation: a preliminary investigation. University of Chicago Press, Chicago
Màrquez L, Carreras X, Litkowski KC, Stevenson S (2008) Semantic role labeling: an introduction to the special issue. Comput Linguist 34(2):145–159
Masolo C, Gangemi A, Guarino N, Oltramari A, Schneider L (2002) WonderWeb deliverable D17: the WonderWeb library of foundational ontologies. Technical report
Minsky M (1997) A framework for representing knowledge. In: Winston P (ed) Mind design. MIT Press, Cambridge, pp 111–142
Nuzzolese AG, Gangemi A, Presutti V (2011) Gathering lexical linked data and knowledge patterns from FrameNet. In: Proceedings of international conference on knowledge capture (K-CAP2011), Banff
Oltramari A, Mehler A, Kühnberger KU, Lobin H, Lüngen H, Storrer A, Witt A (2012) An introduction to hybrid semantics: the role of cognition in semantic resources, vol 370. Springer, Berlin/Heidelberg, pp 97–109
Pantel P, Bhagat R, Coppola B, Chklovski T, Hovy EH (2007) ISP: learning inferential selectional preferences. In: Proceedings of HLT-NAACL, Rochester, pp 564–571
Patwardhan S, Pedersen T (2006) Using WordNet-based context vectors to estimate the semantic relatedness of concepts. In: Proceedings of the EACL 2006 workshop on making sense of sense: bringing computational linguistics and psycholinguistics together, Trento, Italy, pp 1–8
Pianta E, Bentivogli L, Girardi C (2002) MultiWordNet: developing an aligned multilingual database. In: Proceedings of the first international conference on global WordNet. Central Institute of Indian Languages, Mysore
Poesio M (2005) Domain modelling and NLP: formal ontologies? Lexica? Or a bit of both? Appl Ontol (1):27–33
Pradhan S, Hovy E, Marcys M, Palmer M, Ramshaw L, Weischede R (2002) Ontonotes: a unified relational semantic representation. In: Proceedings of the first IEEE international conference on semantic computing (ICSC-07), Irvine, CA
Prev́ot L, Huang CR, Calzolari N, Gangemi A, Lenci A, Oltramari A (eds) (2010) Ontology and the lexicon. Cambridge University Press, Cambridge/New York
Pustejovsky J (1995) The generative lexicon. MIT Press, Cambridge
Pustejovsky J, Jezek E (2008) Semantic coercion in language: beyond distributional analysis. Ital J Linguist 20(1):175–208
Reale G (1990) A history of ancient philosophy: Plato and Aristotle. SUNY Press, Albany
Resnik P (1996) Selectional constraints: an information-theoretic model and its computational realization. Cognition 61:127–159
Roventini A, Alonge A, Calzolari N, Magnini B, Bertagna F (2000) ItalWordNet: a large semantic database for italian. In: Proceedings LREC 2000, Athens, pp 783–790
Ruppenhofer J, Ellsworth M, Petruck M, Johnson C (2005) FrameNet: Theory and Practice. Available at: http://framenet.icsi.berkeley.edu/
Searle J (1995) The construction of social reality. Paperback. Free Press, New York
Settles B, Small K, Tomanek K (eds) (2010) Proceedings of the NAACL HLT 2010 workshop on active learning for natural language processing. Association for Computational Linguistics, Los Angeles
Shi L, Mihalcea R (2005) Putting pieces together: combining FrameNet, VerbNet and WordNet for robust semantic parsing. In: Proceedings of the 6th international conference on computational linguistics and intelligent text processing, CICLing 2005. Springer, Berlin/Heidelberg/New York, pp 100–111
Simons P (ed) (1987) Parts: a study in ontology. Clarendon Press, Oxford
Stevenson M, Greenwood M (2006) Learning information extraction patterns using wordnet. In: Third international global WordNet conference (GWNC-2006), Jeju Island, Korea, pp 52–60
Suárez-Figueroa MC, Gómez-Pérez A, Motta E, Gangemi A (eds) (2012) Ontology engineering in a networked world. Springer, Berlin
Tonelli S (2010) Semi-automatic techniques for extending the framenet lexical database to new languages. Ph.D. thesis, Department of Language Sciences, Università Ca’ Foscari Venezia, Venezia
Tonelli S, Pighin D, Giuliano C, Pianta E (2009) Semi-automatic development of framenet for italian. In: Proceedings of the FrameNet workshop and masterclass, Milano
Vetere G, Oltramari A, Chiari I, Jezek E, Vieu L, Zanzotto FM (2012) Senso comune, an open knowledge base for Italian. TAL – Traitement Automatique des Langues, Special Issue of the on “Free language resources”. 52(3):217–243
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Oltramari, A. et al. (2013). Senso Comune: A Collaborative Knowledge Resource for Italian. In: Gurevych, I., Kim, J. (eds) The People’s Web Meets NLP. Theory and Applications of Natural Language Processing. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35085-6_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-35085-6_2
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35084-9
Online ISBN: 978-3-642-35085-6
eBook Packages: Computer ScienceComputer Science (R0)