Skip to main content

Sentence Level Paraphrase Recognition Based on Different Characteristics Combination

  • Conference paper
Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data (NLP-NABD 2014, CCL 2014)

Abstract

This paper has proposed a novel method based on different characteristics combination to do paraphrase recognition. We employ different measurements to weigh the lexical part and syntactic part due to that the different part of sentence makes distinguishing contribution to the sentence semantic during the task of paraphrase recognition. Our experiment is conducted by parsing the pair sentences of MSRPC first, then followed by adopting differentiated weights to calculate the power of different parts of the sentence.Through this method, we have obtained the outperform precision and average F value result compared with the previous approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Clough, P., et al.: MEasuring TExt Reuse. In: Proceedings of the 40th Anniversary Meeting for the Association for Computational Linguistics, Pennsylvania, PA, pp. 152–159 (2002)

    Google Scholar 

  2. Barzilay, R., Lee, L.: Learn to paraphrase, An Unsupervised Approach Using Multiple-Sequence Alignment. In: Proceedings of HLT-NAACL, pp. 16–23 (2003)

    Google Scholar 

  3. Malakasiotis, P., Androutsopoulos, I.: Learning Textual Entailment using SVMs and String Similarity Measures. In: Proceedings of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing, 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, pp. 42–47 (2007)

    Google Scholar 

  4. Fernando, S., Stevenson, M.: A Semantic Similarity Approach to Paraphrase Detection. Computational Linguistics (2008)

    Google Scholar 

  5. Erk, K., Pado, S.: Paraphrase assessment in structured vector space Exploring parameters and datasets. In: Proceeding of the 2nd European Conference on Computational Learning Theory, Athens, Greece, pp. 57–65 (2009)

    Google Scholar 

  6. Wan, S., et al.: Using dependency-based features to take the para-farce out of paraphrase. In: Proceedings of the 2006 Australasian Language Technology Workshop, pp. 131–138 (2006)

    Google Scholar 

  7. Qiu, L., Kan, M., Chua, T.: Paraphrase recognition via dissimilarity significance classification. In: EMNLP 2006 Association for Computational Linguistics, Sydney, pp. 18–26 (2006)

    Google Scholar 

  8. Socher, R., et al.: Dynamic Pooling and Unfolding Recursive Auto encoders for Paraphrase Detection. In: Conference of Neural Information Processing Systems Foundation (2011)

    Google Scholar 

  9. Lintean, M., Rus, V.: Paraphrase Identification Using Weighted Dependencies and Word Semantics. In: Proceedings of the Twenty-Second International FLAIRS Conference, Sanibel Island, Florida, USA, pp. 260–265. Association for the Advancement of Artificial Intelligence, Sundial Beach (2009)

    Google Scholar 

  10. Pang, B., Knight, K., Marcu, D.: Syntax-based alignment of multiple translations, Extracting Paraphrases and Generating New Sentences. In: Proceedings of HLT-NAACL, pp. 102–109 (2003)

    Google Scholar 

  11. Dolan, W.B., Brockett, C.: Automatically Constructing a Corpus of Sentential Paraphrases. In: Proceeding of the 3rd International Workshop on Paraphrase, Jeju island, Korea, pp. 9–16 (2005)

    Google Scholar 

  12. Zhang, Y., Patrick, J.: Paraphrase identification by text canonicalization. In: Proceedings of the Australasian Language Technology Workshop, Sydney, Australia, pp. 160–166 (2005)

    Google Scholar 

  13. Recasens, M., Vila, M.: On Paraphrase and Coreference. Computational Linguistics 36(4), 639–647 (2010)

    Article  Google Scholar 

  14. Resnik, P.: Using Information Content to Evaluate Semantic Similarity in a Taxonomy. In: International Joint Conference on AI, pp. 448–453 (1995)

    Google Scholar 

  15. Dolan, B., Quirk, C., Brockett, C.: Unsupervised Construction of Large Paraphrase Corpora, Exploiting Massively Parallel News Sources. In: Proceeding of the 20th International Conference on Computational Linguistics, Geneva, Switzerland, pp. 350–356 (2004)

    Google Scholar 

  16. Toutanova, K., Manning, C.D.: Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger. In: Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, pp. 63–70 (2000)

    Google Scholar 

  17. Klein, D., Manning, C.D.: Accurate Unlexicalized Parsing. In: Proceedings of the 41st Meeting of the Association for Computational Linguistics, pp. 423–430 (2003)

    Google Scholar 

  18. Malakasiotis, P.: Paraphrase Recognition Using Machine Learning to Combine Similarity Measures. In: Proceedings of the ACL-IJCNLP 2009 Student Research Workshop, Suntec, Singapore, pp. 27–35 (2009)

    Google Scholar 

  19. Callison-Burch, C.: Syntactic Constraints on Paraphrases Extracted from Parallel Corpora. In: Proceeding EMNLP 2008 Proceedings of the Conference on Empirical Methods in Natural Language Processing, Stroudsburg, PA, USA, pp. 196–205 (2008)

    Google Scholar 

  20. Fellbaum, C.: WordNet: An Electronic Lexical Database. MIT Press (1998)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Zhang, M., Zhang, H., Wu, D., Pan, X. (2014). Sentence Level Paraphrase Recognition Based on Different Characteristics Combination. In: Sun, M., Liu, Y., Zhao, J. (eds) Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data. NLP-NABD CCL 2014 2014. Lecture Notes in Computer Science(), vol 8801. Springer, Cham. https://doi.org/10.1007/978-3-319-12277-9_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-12277-9_25

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-12276-2

  • Online ISBN: 978-3-319-12277-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics