Semantic similarity aggregators for very short textual expressions: a case study on landmarks and points of interest

Martinez-Gil, Jorge

doi:10.1007/s10844-019-00561-0

Semantic similarity aggregators for very short textual expressions: a case study on landmarks and points of interest

Published: 22 May 2019

Volume 53, pages 361–380, (2019)
Cite this article

Journal of Intelligent Information Systems Aims and scope Submit manuscript

Jorge Martinez-Gil ORCID: orcid.org/0000-0002-5730-7965¹

530 Accesses
8 Citations
6 Altmetric
Explore all metrics

Abstract

Semantic similarity measurement aims to automatically compute the degree of similarity between two textual expressions that use different representations for naming the same concepts. However, very short textual expressions cannot always follow the syntax of a written language and, in general, do not provide enough information to support proper analysis. This means that in some fields, such as the processing of landmarks and points of interest, results are not entirely satisfactory. In order to overcome this situation, we explore the idea of aggregating existing methods by means of two novel aggregation operators aiming to model an appropriate interaction between the similarity measures. As a result, we have been able to improve the results of existing techniques when solving the GeReSiD and the SDTS, two of the most popular benchmark datasets for dealing with geographical information.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A survey of density based clustering algorithms

Article 29 September 2020

The Sketch Engine: ten years on

Article 10 July 2014

Dataset search: a survey

Article Open access 24 August 2019

References

Ahlgren, P., Jarneving, B., Rousseau, R. (2003). Requirements for a cocitation similarity measure, with special reference to pearson’s correlation coefficient. JASIST, 54(6), 550–560.
Article Google Scholar
Amir, S., Tanasescu, A., Zighed, D.A. (2017). Sentence similarity based on semantic kernels for intelligent text retrieval. Journal of Intelligent Information System, 48(3), 675–689.
Article Google Scholar
Aouicha, M.B., & Taieb, M.A.H. (2015). G2WS: gloss-based wordnet and wiktionary semantic similarity measure. In 12Th IEEE/ACS international conference of computer systems and applications, AICCSA 2015, november 17-20, 2015 (pp. 1–7). Marrakech.
Ballatore, A., Bertolotto, M., Wilson, D.C. (2013). Geographic knowledge extraction and semantic similarity in openstreetmap. Knowledge and Information Systems, 37(1), 61–81.
Article Google Scholar
Ballatore, A., Bertolotto, M., Wilson, D.C. (2014). An evaluative baseline for geo-semantic relatedness and similarity. GeoInformatica, 18(4), 747–767.
Article Google Scholar
Ballatore, A., Wilson, D.C., Bertolotto, M. (2013). Computing the semantic similarity of geographic terms using volunteered lexical definitions. International Journal of Geographical Information Science, 27(10), 2099–2118.
Article Google Scholar
Buscaldi, D., Roux, J.L., Flores, J.J.G., Popescu, A. (2013). LIPN-CORE: Semantic text similarity using n-grams, wordnet, syntactic analysis, ESA and information retrieval based features. In Proceedings of the Second Joint Conference on Lexical and Computational Semantics, *SEM 2013, June 13-14, 2013 (pp. 162–168). Atlanta.
Chaves-González, J.M., & Martinez-Gil, J. (2013). Evolutionary algorithm based on different semantic similarity functions for synonym recognition in the biomedical domain. Knowledge-Based Systems, 37, 62–69.
Article Google Scholar
Feng, C., & Flewelling, D.M. (2004). Assessment of semantic similarity between land use/land cover classification systems. Computers, Environment and Urban Systems, 28(3), 229–246.
Article Google Scholar
Grabisch, M., Marichal, J., Mesiar, R., Pap, E. (2011). Aggregation functions: Construction methods, conjunctive, disjunctive and mixed classes. Information Sciences, 181(1), 23–43.
Article MathSciNet MATH Google Scholar
Han, L., Finin, T., McNamee, P., Joshi, A., Yesha Y. (2013). Improving word similarity by augmenting PMI with estimates of word polysemy. IEEE Transactions on Knowledge and Data Engineering, 25(6), 1307–1322.
Article Google Scholar
Hobel, H., Fogliaroni, P., Frank, A.U. (2016). Deriving the geographic footprint of cognitive regions. in Geospatial Data in a Changing World - Selected Papers of the 19th AGILE Conference on Geographic Information Science, 14-17 june 2016 (pp 67–84). Helsinki.
Hsu, H., & Chen, C. (1996). Aggregation of fuzzy opinions under group decision making. Fuzzy Sets and Systems, 79(3), 279–285.
Article MathSciNet Google Scholar
Janowicz, K., Raubal, M., Kuhn, W. (2011). The semantics of similarity in geographic information retrieval. Journal Spatial Information Science, 2(1), 29–57.
Google Scholar
Janowicz, K., Raubal, M., Schwering, A., Kuhn, W. (2008). Semantic similarity measurement and geospatial applications. Transactions in GIS, 12(6), 651–659.
Article Google Scholar
Ji, Q., Haase, P., Qi, G. (2011). Combination of similarity measures in ontology matching using the OWA operator. In Recent Developments in the Ordered Weighted Averaging Operators: Theory and Practice (pp. 281–295).
Chapter Google Scholar
Kuncheva, L. (2001). Using measures of similarity and inclusion for multiple classifier fusion by decision templates. Fuzzy Sets and Systems, 122(3), 401–407.
Article MathSciNet MATH Google Scholar
Landauer, T.K., & Psotka, J. (2000). Simulating text understanding for educational applications with latent semantic analysis: Introduction to LSA. Interactive Learning Environments, 8(2), 73–86.
Article Google Scholar
Lastra-Díaz, J.J., & García-Serrano, A. (2015a). A new family of information content models with an experimental survey on wordnet. Knowledge-Based Systems, 89, 509–526.
Lastra-Díaz, J.J., & García-Serrano, A. (2015b). A novel family of ic-based similarity measures with a detailed experimental survey on wordnet. Engineering Applications of AI, 46, 140–153.
Li, X., Cong, G., Li, X., Pham, T.N., Krishnaswamy, S. (2015). Rank-geofm: A ranking based geographical factorization method for point of interest recommendation. In Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, August 9-13, 2015 (pp. 433–442). Santiago.
Li, Y., McLean, D., Bandar, Z., O’Shea, J., Crockett, K.A. (2006). Sentence similarity based on semantic nets and corpus statistics. IEEE Transactions on Knowledge and Data Engineering, 18(8), 1138–1150.
Article Google Scholar
Lim, K.H., Chan, J., Leckie, C., Karunasekera, S. (2015). Personalized tour recommendation based on user interests and points of interest visit durations. In Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, IJCAI 2015, July 25-31, 2015 (pp. 1778–1784). Buenos Aires.
Lin, D. (1998). An information-theoretic definition of similarity. In Proceedings of the Fifteenth International Conference on Machine Learning (ICML 1998), July 24-27, 1998 (pp. 296–304). Madison.
Martinez-Gil, J. (2014). An overview of textual semantic similarity measures based on web intelligence. Artificial Intelligence Review, 42(4), 935–943.
Article Google Scholar
Martinez-Gil, J. (2016a). Accurate semantic similarity measurement of biomedical nomenclature by means of fuzzy logic. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 24(2), 291–306.
Article Google Scholar
Martinez-Gil, J. (2016b). Coto: a novel approach for fuzzy aggregation of semantic similarity measures. Cognitive Systems Research, 40, 8–17.
Martinez-Gil, J., & Chaves-Gonzalez, J.M. (2019). Automatic design of semantic similarity controllers based on fuzzy logics. Expert Systems with Applications, 131, 45–59.
Article Google Scholar
Medina-Hernández, J.A., Gomez-castañeda, F., Moreno-Cadenas, J.A. (2009). An evolving fuzzy neural network based on the mapping of similarities. IEEE Transactions on Fuzzy Systems, 17(6), 1379–1396.
Article Google Scholar
Miller, G., & Charles, W. (1991). Contextual correlates of semantic similarity. Language and Cognitive Processes, 6(1), 1–28.
Article Google Scholar
Musavi, M.T., Kalantri, K., Ahmed, W., Chan, K.H. (1993). A minimum error neural network (MNN). Neural Networks, 6(3), 397–407.
Article Google Scholar
Pilehvar, M.T., & Navigli, R. (2015). From senses to texts: An all-in-one graph-based approach for measuring semantic similarity. Artificial Intelligence, 228, 95–128.
Article MathSciNet MATH Google Scholar
Pirró, G. (2009). A semantic similarity metric combining features and intrinsic information content. Data and Knowledge Engineering, 68(11), 1289–1308.
Article Google Scholar
Ranjbar, N., Mashhadirajab, F., Shamsfard, M., pour, R.H., Pour, A.V. (2017). Mahtab at semeval-2017 task 2 Combination of corpus-based and knowledge-based methods to measure semantic word similarity. In Proceedings of the 11th International Workshop on Semantic Evaluation, SemEval@ACL 2017, August 3-4, 2017 (pp. 256–260). Vancouver.
Resnik, P. (1995). Using information content to evaluate semantic similarity in a taxonomy. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, IJCAI 95, Montréal Québec, Canada, August 20-25 1995, (Vol. 2 pp. 448–453).
Rodríguez, M.A., & Egenhofer, M.J. (2003). Determining semantic similarity among entity classes from different ontologies. IEEE Transactions on Knowledge and Data Engineering, 15(2), 442–456.
Article Google Scholar
Rus, V., Lintean, M.C., Banjade, R., Niraula, N.B., Stefanescu, D. (2013). SEMILAR: The semantic similarity toolkit. In 51st Annual Meeting of the Association for Computational Linguistics, ACL 2013, Proceedings of the Conference System Demonstrations, 4-9 August 2013 (pp. 163–168 ). Sofia.
Rybinski, M., & Aldana-Montes, J.F. (2017). Domesa: a novel approach for extending domain-oriented lexical relatedness calculations with domain-specific semantics. Journal of Intelligent Information System, 49(3), 315–331.
Article Google Scholar
Setiono, R. (2001). Generating linear regression rules from neural networks using local least squares approximation. In Connectionist Models of Neurons, Learning Processes and Artificial Intelligence, 6th International Work-conference on Artificial and Natural Neural Networks, IWANN 2001 granada, spain, june 13-15, 2001, proceedings, Part I (pp. 277–284).
Chapter Google Scholar
Turney, P.D. (2013). Distributional semantics beyond words: Supervised learning of analogy and paraphrase. TACL, 1, 353–366.
Google Scholar
Webb, G.I., & Zheng, Z. (2004). Multistrategy ensemble learning: Reducing error by combining ensemble learning techniques. IEEE Transactions on Knowledge and Data Engineering, 16(8), 980–991.
Article Google Scholar
Zhang, P., Zhang, Z., Zhang, W., Wu, C. (2014). Semantic similarity computation based on multi-feature combination using hownet. JSW, 9(9), 2461–2466.
MathSciNet Google Scholar

Download references

Acknowledgments

We would like to thank the anonymous reviewers for their helpful and constructive comments that greatly contributed to improve this work. The research reported in this paper has been supported by the Austrian Ministry for Transport, Innovation and Technology, the Federal Ministry of Science, Research and Economy, and the Province of Upper Austria under the frame of the COMET Center SCCH [FFG: 844597].

Author information

Authors and Affiliations

Software Competence Center Hagenberg GmbH, Softwarepark 21, 4232, Hagenberg, Austria
Jorge Martinez-Gil

Authors

Jorge Martinez-Gil
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jorge Martinez-Gil.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Martinez-Gil, J. Semantic similarity aggregators for very short textual expressions: a case study on landmarks and points of interest. J Intell Inf Syst 53, 361–380 (2019). https://doi.org/10.1007/s10844-019-00561-0

Download citation

Received: 14 May 2018
Revised: 10 May 2019
Accepted: 14 May 2019
Published: 22 May 2019
Issue Date: October 2019
DOI: https://doi.org/10.1007/s10844-019-00561-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Semantic similarity aggregators for very short textual expressions: a case study on landmarks and points of interest

Abstract

Access this article

Similar content being viewed by others

A survey of density based clustering algorithms

The Sketch Engine: ten years on

Dataset search: a survey

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Semantic similarity aggregators for very short textual expressions: a case study on landmarks and points of interest

Abstract

Access this article

Similar content being viewed by others

A survey of density based clustering algorithms

The Sketch Engine: ten years on

Dataset search: a survey

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation