Skip to main content

Document Similarity Approach Using Grammatical Linkages with Graph Databases

  • Conference paper
  • First Online:
Book cover EAI International Conference on Big Data Innovation for Sustainable Cognitive Computing

Part of the book series: EAI/Springer Innovations in Communication and Computing ((EAISICC))

  • 680 Accesses

Abstract

Document similarity had become essential in many applications such as document retrieval, recommendation systems, and plagiarism checker. Many similarity evaluation approaches rely on word-based document representation, because it is very fast. But these approaches are not accurate when documents with different language and vocabulary are used. When graph representation is used for documents, they use some relational knowledge which is not feasible in many applications because of expensive graph operations. In this work a novel approach for document similarity computation which utilizes verbal intent has been developed. This improves the similarity and graph databases were also used for faster performance. The performance of the system is evaluated using various datasets and verbal intent-based approach has registered promising results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. R. Anna, Z. Silvia, Assessing semantic similarity of texts—methods and algorithms, in Proceedings of the 43rd International Conference Applications of Mathematics in Engineering and Economics, 2010, pp. 1–8

    Google Scholar 

  2. K. Julian, An algorithm for finding noun phrase correspondences in bilingual corpora, in Proceedings of the 31st Annual Meeting on Association of Computational Linguistics, 2012, pp. 17–22

    Google Scholar 

  3. P. Christian, R. Achim, M. Aditya, Efficient graph-based document similarity, in Proceedings of the 13th International Conference on the Semantic Web. Latest Advances and New Domains, vol 9678, 2016, pp. 334–349

    Google Scholar 

  4. E. Gunes, R. Dragomir, LexRank: graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res., 457–479 (2015)

    Google Scholar 

  5. Z. Ganggao, A. Carlos, Computing semantic similarity of concepts in knowledge graphs. IEEE Trans. Knowl. Data Eng. 29(1), 72–85 (2017)

    Article  Google Scholar 

  6. R. Philip, Using information content to evaluate semantic similarity in a taxonomy, ACM Digital Library, 1995, pp. 448–453

    Google Scholar 

  7. E. Agirre, E. Alfonseca, K. Hall, J. Kravalova, M. Paşca, A. Soroa, A study on similarity and relatedness using distributional and WordNet-based approaches, in Proceedings of Human Language Technology Annual Conference North American Chapter Association of Computational Linguistics, 2009, pp. 19–27

    Google Scholar 

  8. A. Broder et al., A semantic approach to contextual advertising, in Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007, pp. 559–566

    Google Scholar 

  9. J.-H. Lee et al., Semantic contextual advertising based on the open directory project. ACM Trans. Web 7(4), 1–24 (2013)

    Article  Google Scholar 

  10. N. Takagi, M. Tomohiro, Wsl: sentence similarity using semantic distance between words, in Proceedings of the Ninth International Workshop on Semantic Evaluation, 2015, pp. 128–131

    Google Scholar 

  11. A. Gupta, D.K. Yadav, Semantic similarity measure using information content approach with depth for similarity calculation. Int. J. Sci. Technol. Res. 3(2), 165–169 (2014)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Priya, V., Umamaheswari, K. (2020). Document Similarity Approach Using Grammatical Linkages with Graph Databases. In: Haldorai, A., Ramu, A., Mohanram, S., Onn, C. (eds) EAI International Conference on Big Data Innovation for Sustainable Cognitive Computing. EAI/Springer Innovations in Communication and Computing. Springer, Cham. https://doi.org/10.1007/978-3-030-19562-5_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-19562-5_13

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-19561-8

  • Online ISBN: 978-3-030-19562-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics