Abstract
This paper has proposed a novel method based on different characteristics combination to do paraphrase recognition. We employ different measurements to weigh the lexical part and syntactic part due to that the different part of sentence makes distinguishing contribution to the sentence semantic during the task of paraphrase recognition. Our experiment is conducted by parsing the pair sentences of MSRPC first, then followed by adopting differentiated weights to calculate the power of different parts of the sentence.Through this method, we have obtained the outperform precision and average F value result compared with the previous approaches.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Clough, P., et al.: MEasuring TExt Reuse. In: Proceedings of the 40th Anniversary Meeting for the Association for Computational Linguistics, Pennsylvania, PA, pp. 152–159 (2002)
Barzilay, R., Lee, L.: Learn to paraphrase, An Unsupervised Approach Using Multiple-Sequence Alignment. In: Proceedings of HLT-NAACL, pp. 16–23 (2003)
Malakasiotis, P., Androutsopoulos, I.: Learning Textual Entailment using SVMs and String Similarity Measures. In: Proceedings of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing, 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, pp. 42–47 (2007)
Fernando, S., Stevenson, M.: A Semantic Similarity Approach to Paraphrase Detection. Computational Linguistics (2008)
Erk, K., Pado, S.: Paraphrase assessment in structured vector space Exploring parameters and datasets. In: Proceeding of the 2nd European Conference on Computational Learning Theory, Athens, Greece, pp. 57–65 (2009)
Wan, S., et al.: Using dependency-based features to take the para-farce out of paraphrase. In: Proceedings of the 2006 Australasian Language Technology Workshop, pp. 131–138 (2006)
Qiu, L., Kan, M., Chua, T.: Paraphrase recognition via dissimilarity significance classification. In: EMNLP 2006 Association for Computational Linguistics, Sydney, pp. 18–26 (2006)
Socher, R., et al.: Dynamic Pooling and Unfolding Recursive Auto encoders for Paraphrase Detection. In: Conference of Neural Information Processing Systems Foundation (2011)
Lintean, M., Rus, V.: Paraphrase Identification Using Weighted Dependencies and Word Semantics. In: Proceedings of the Twenty-Second International FLAIRS Conference, Sanibel Island, Florida, USA, pp. 260–265. Association for the Advancement of Artificial Intelligence, Sundial Beach (2009)
Pang, B., Knight, K., Marcu, D.: Syntax-based alignment of multiple translations, Extracting Paraphrases and Generating New Sentences. In: Proceedings of HLT-NAACL, pp. 102–109 (2003)
Dolan, W.B., Brockett, C.: Automatically Constructing a Corpus of Sentential Paraphrases. In: Proceeding of the 3rd International Workshop on Paraphrase, Jeju island, Korea, pp. 9–16 (2005)
Zhang, Y., Patrick, J.: Paraphrase identification by text canonicalization. In: Proceedings of the Australasian Language Technology Workshop, Sydney, Australia, pp. 160–166 (2005)
Recasens, M., Vila, M.: On Paraphrase and Coreference. Computational Linguistics 36(4), 639–647 (2010)
Resnik, P.: Using Information Content to Evaluate Semantic Similarity in a Taxonomy. In: International Joint Conference on AI, pp. 448–453 (1995)
Dolan, B., Quirk, C., Brockett, C.: Unsupervised Construction of Large Paraphrase Corpora, Exploiting Massively Parallel News Sources. In: Proceeding of the 20th International Conference on Computational Linguistics, Geneva, Switzerland, pp. 350–356 (2004)
Toutanova, K., Manning, C.D.: Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger. In: Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, pp. 63–70 (2000)
Klein, D., Manning, C.D.: Accurate Unlexicalized Parsing. In: Proceedings of the 41st Meeting of the Association for Computational Linguistics, pp. 423–430 (2003)
Malakasiotis, P.: Paraphrase Recognition Using Machine Learning to Combine Similarity Measures. In: Proceedings of the ACL-IJCNLP 2009 Student Research Workshop, Suntec, Singapore, pp. 27–35 (2009)
Callison-Burch, C.: Syntactic Constraints on Paraphrases Extracted from Parallel Corpora. In: Proceeding EMNLP 2008 Proceedings of the Conference on Empirical Methods in Natural Language Processing, Stroudsburg, PA, USA, pp. 196–205 (2008)
Fellbaum, C.: WordNet: An Electronic Lexical Database. MIT Press (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Zhang, M., Zhang, H., Wu, D., Pan, X. (2014). Sentence Level Paraphrase Recognition Based on Different Characteristics Combination. In: Sun, M., Liu, Y., Zhao, J. (eds) Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data. NLP-NABD CCL 2014 2014. Lecture Notes in Computer Science(), vol 8801. Springer, Cham. https://doi.org/10.1007/978-3-319-12277-9_25
Download citation
DOI: https://doi.org/10.1007/978-3-319-12277-9_25
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12276-2
Online ISBN: 978-3-319-12277-9
eBook Packages: Computer ScienceComputer Science (R0)