Skip to main content

Building a Lexically and Semantically-Rich Resource for Paraphrase Processing

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7614))

Abstract

In this paper, we present a methodology for building a lexically and semantically-rich resource for paraphrase processing on French. The paraphrase extraction model is rule-based and is guided by means of predicates. The extraction process comprises 4 main processing modules: 1. derived words extraction; 2. sentences extraction; 3. chunking & head word identification, and 4. predicate-argument structure mapping. We use the corpus provided by an agro-food industry enterprise to test the 4 modules of the paraphrase structures extractor. We explain how each processing module functions.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Kampeera, W., Cardey, S.: Paraphrases in Natural Language Processing. In: Proceedings of the 12th International Symposium on Social Communication - Comunicación Social en el Siglo XXI, vol. II, pp. 963–967. Santiago de Cuba, Cuba (2011)

    Google Scholar 

  2. Androutsopoulos, I., Malakasiotis, P.: A Survey of Paraphrasing and Textual Entailment Methods. Journal of Natural Language Processing 11, 151–198 (2009)

    Google Scholar 

  3. Hathout, N., Namer, F., Dal, G.: An Experimental Constructional Database: The MorTAL Project. In: Boucher, P. (ed.) Many Morphologies. Cascadilla, Somerville (2002)

    Google Scholar 

  4. Sajous, F., Navarro, E., Gaume, B., Prévot, L., Chudy, Y.: Semi-automatic Endogenous Enrichment of Collaboratively Constructed Lexical Resources: Piggybacking onto Wiktionary. In: Loftsson, H., Rögnvaldsson, E., Helgadóttir, S. (eds.) IceTAL 2010. LNCS, vol. 6233, pp. 332–344. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  5. Romary, L., Salmon-Alt, S., Francopoulo, G.: Standards going concrete: from LMF to Morphalou. In: Workshop on Electronic Dictionaries, Coling 2004, Geneva, Switzerland (2004)

    Google Scholar 

  6. Cardey, S., Greenfield, P.: Disambiguating and Tagging Using Systemic Grammar. In: Proceedings of the 8th International Symposium on Social Communication, pp. 559–564 (2009)

    Google Scholar 

  7. Fellbaum, C.: WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)

    MATH  Google Scholar 

  8. Fišer, D., Sagot, B.: Combining Multiple Resources to Build Reliable Wordnets. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2008. LNCS (LNAI), vol. 5246, pp. 61–68. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  9. Cardey, S., Greenfield, P., Bioud, M., Dziadkiewicz, A., Kuroda, K., Marcelino, I., Melian, C., Morgadinho, H., Robardet, G., Vienney, S.: The Classificatim Sense-Mining System. In: Salakoski, T., Ginter, F., Pyysalo, S., Pahikkala, T. (eds.) FinTAL 2006. LNCS (LNAI), vol. 4139, pp. 674–683. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  10. Lin, D., Pantel, P.: Discovery of Inference Rules for Question Answering. Natural Language Engineering 7(4), 343–360 (2001)

    Article  Google Scholar 

  11. Harris, Z.: Distributional Structure. In: Katz, J.J. (ed.) The Philosophy of Linguistics, pp. 26–47. Oxford University Press, New York (1985)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kampeera, W., Cardey-Greenfield, S. (2012). Building a Lexically and Semantically-Rich Resource for Paraphrase Processing. In: Isahara, H., Kanzaki, K. (eds) Advances in Natural Language Processing. JapTAL 2012. Lecture Notes in Computer Science(), vol 7614. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33983-7_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-33983-7_14

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-33982-0

  • Online ISBN: 978-3-642-33983-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics