Sentence Level Paraphrase Recognition Based on Different Characteristics Combination

Zhang, Maoyuan; Zhang, Hong; Wu, Deyu; Pan, Xiaohang

doi:10.1007/978-3-319-12277-9_25

Maoyuan Zhang²¹,
Hong Zhang²¹,
Deyu Wu²¹ &
…
Xiaohang Pan²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8801))

Included in the following conference series:

1576 Accesses
1 Citations

Abstract

This paper has proposed a novel method based on different characteristics combination to do paraphrase recognition. We employ different measurements to weigh the lexical part and syntactic part due to that the different part of sentence makes distinguishing contribution to the sentence semantic during the task of paraphrase recognition. Our experiment is conducted by parsing the pair sentences of MSRPC first, then followed by adopting differentiated weights to calculate the power of different parts of the sentence.Through this method, we have obtained the outperform precision and average F value result compared with the previous approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Clough, P., et al.: MEasuring TExt Reuse. In: Proceedings of the 40th Anniversary Meeting for the Association for Computational Linguistics, Pennsylvania, PA, pp. 152–159 (2002)
Google Scholar
Barzilay, R., Lee, L.: Learn to paraphrase, An Unsupervised Approach Using Multiple-Sequence Alignment. In: Proceedings of HLT-NAACL, pp. 16–23 (2003)
Google Scholar
Malakasiotis, P., Androutsopoulos, I.: Learning Textual Entailment using SVMs and String Similarity Measures. In: Proceedings of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing, 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, pp. 42–47 (2007)
Google Scholar
Fernando, S., Stevenson, M.: A Semantic Similarity Approach to Paraphrase Detection. Computational Linguistics (2008)
Google Scholar
Erk, K., Pado, S.: Paraphrase assessment in structured vector space Exploring parameters and datasets. In: Proceeding of the 2nd European Conference on Computational Learning Theory, Athens, Greece, pp. 57–65 (2009)
Google Scholar
Wan, S., et al.: Using dependency-based features to take the para-farce out of paraphrase. In: Proceedings of the 2006 Australasian Language Technology Workshop, pp. 131–138 (2006)
Google Scholar
Qiu, L., Kan, M., Chua, T.: Paraphrase recognition via dissimilarity significance classification. In: EMNLP 2006 Association for Computational Linguistics, Sydney, pp. 18–26 (2006)
Google Scholar
Socher, R., et al.: Dynamic Pooling and Unfolding Recursive Auto encoders for Paraphrase Detection. In: Conference of Neural Information Processing Systems Foundation (2011)
Google Scholar
Lintean, M., Rus, V.: Paraphrase Identification Using Weighted Dependencies and Word Semantics. In: Proceedings of the Twenty-Second International FLAIRS Conference, Sanibel Island, Florida, USA, pp. 260–265. Association for the Advancement of Artificial Intelligence, Sundial Beach (2009)
Google Scholar
Pang, B., Knight, K., Marcu, D.: Syntax-based alignment of multiple translations, Extracting Paraphrases and Generating New Sentences. In: Proceedings of HLT-NAACL, pp. 102–109 (2003)
Google Scholar
Dolan, W.B., Brockett, C.: Automatically Constructing a Corpus of Sentential Paraphrases. In: Proceeding of the 3rd International Workshop on Paraphrase, Jeju island, Korea, pp. 9–16 (2005)
Google Scholar
Zhang, Y., Patrick, J.: Paraphrase identification by text canonicalization. In: Proceedings of the Australasian Language Technology Workshop, Sydney, Australia, pp. 160–166 (2005)
Google Scholar
Recasens, M., Vila, M.: On Paraphrase and Coreference. Computational Linguistics 36(4), 639–647 (2010)
Article Google Scholar
Resnik, P.: Using Information Content to Evaluate Semantic Similarity in a Taxonomy. In: International Joint Conference on AI, pp. 448–453 (1995)
Google Scholar
Dolan, B., Quirk, C., Brockett, C.: Unsupervised Construction of Large Paraphrase Corpora, Exploiting Massively Parallel News Sources. In: Proceeding of the 20th International Conference on Computational Linguistics, Geneva, Switzerland, pp. 350–356 (2004)
Google Scholar
Toutanova, K., Manning, C.D.: Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger. In: Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, pp. 63–70 (2000)
Google Scholar
Klein, D., Manning, C.D.: Accurate Unlexicalized Parsing. In: Proceedings of the 41st Meeting of the Association for Computational Linguistics, pp. 423–430 (2003)
Google Scholar
Malakasiotis, P.: Paraphrase Recognition Using Machine Learning to Combine Similarity Measures. In: Proceedings of the ACL-IJCNLP 2009 Student Research Workshop, Suntec, Singapore, pp. 27–35 (2009)
Google Scholar
Callison-Burch, C.: Syntactic Constraints on Paraphrases Extracted from Parallel Corpora. In: Proceeding EMNLP 2008 Proceedings of the Conference on Empirical Methods in Natural Language Processing, Stroudsburg, PA, USA, pp. 196–205 (2008)
Google Scholar
Fellbaum, C.: WordNet: An Electronic Lexical Database. MIT Press (1998)
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science Department, Central China Normal University, Luoyu Road. 152, Wuhan, China
Maoyuan Zhang, Hong Zhang, Deyu Wu & Xiaohang Pan

Authors

Maoyuan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Hong Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Deyu Wu
View author publications
You can also search for this author in PubMed Google Scholar
Xiaohang Pan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Technology, Tsinghua University, Haidian District, 100084, Beijing, China
Maosong Sun & Yang Liu &
Chinese Academy of Sciences, Institute of Automation, 100190, Beijing, China
Jun Zhao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, M., Zhang, H., Wu, D., Pan, X. (2014). Sentence Level Paraphrase Recognition Based on Different Characteristics Combination. In: Sun, M., Liu, Y., Zhao, J. (eds) Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data. NLP-NABD CCL 2014 2014. Lecture Notes in Computer Science(), vol 8801. Springer, Cham. https://doi.org/10.1007/978-3-319-12277-9_25

Download citation

DOI: https://doi.org/10.1007/978-3-319-12277-9_25
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12276-2
Online ISBN: 978-3-319-12277-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics