Abstract
This paper presents SPIDER, a system for paraphrasing in document editing and revision with applicability in machine translation pre-editing. SPIDER applies its linguistic knowledge (dictionaries and grammars) to create paraphrases of distinct linguistic phenomena. The first version of this tool was initially developed for Portuguese (ReEscreve v01), but it is extensible to different languages and can also operate across languages. SPIDER has a totally new interface, new resources which contemplate a wider coverage of linguistic phenomena, and applicability to legal terminology, which is described here.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Barreiro, A., Scott, B., Kasper, W., Kiefer, B.: OpenLogos Rule-Based Machine Translation: Philosophy, Model, Resources and Customization. In: Forcada, M.L., Sánchez-Martínez, F. (eds.) Machine Translation, Special Issue on Free/Open Source Machine Translation (2011)
Barreiro, A.: Make it simple with paraphrases: Automated paraphrasing for authoring aids and machine translation. Lambert Academic Publishing (2011) ISBN 978-3-8383-8565-5
Barreiro, A., Cabral, L.M.: ReEscreve: a translator-friendly multi-purpose paraphrasing software tool. In: Goulet, M.-J., Melançon, C., Désilets, A., Macklovitch, E. (eds.) Proceedings of the Workshop Beyond Translation Memories: New Tools for Translators, The Twelfth Machine Translation Summit, Château Laurier, Ottawa, Ontario, Canada, August 29, pp. 1–8 (2009)
Barreiro, A.: Port4NooJ: Portuguese Linguistic Module and Bilingual Resources for Machine Translation. In: Blanco, X., Silberztein, M. (eds.) Proceedings of the 2007 International NooJ Conference, Barcelona, Spain, June 7-9, 2007, pp. 19–47. Cambridge Scholars Publishing, Newcastle (2008)
Callison-Burch, C.: Paraphrasing and Translation. PhD Thesis. University of Edinburgh (2007)
Fellbaum, C.: WordNet: An Electronic Lexical Database (Language, Speech, and Communication). The MIT Press, Cambridge (1998)
Hirao, T., Fukusima, T., Okumura, M., Nobata, C., Nanba, H.: Corpus and Evaluation Measures for Multiple Document Summarization with Multiple Sources. In: Proceedings of the COLING 2004, pp. 535–541 (2004)
Ibrahim, A., Katz, B., Lin, J.: Extracting structural paraphrases from aligned monolingual corpora. In: Proceedings of the Second International Workshop on Paraphrasing (ACL 2003) (2003)
Iordanskaja, L., Kittredge, R., Polguère, A.: Lexical Selection and Paraphrase in a Meaning-Text Generation Model. In: Paris, C.L., Swartout, W.R., Mann, W.C. (eds.) Natural Language Generation in Artificial Intelligence and Computational Linguistics, pp. 293–312. Kluwer Academic Publishers, Dordrecht (1991)
Maia, B., Sarmento, L., Santos, D., Cabral, L., Pinto, A.S. (2005). The Corpógrafo - a Web-based environment for corpus research. In: Corpus Linguistics Conference, Birmingham, UK (July 14-17, 2005) ISSN: 1747-9398
McKeown, K., Barzilay, R., Evans, D., Hatzivassiloglou, V., Klavans, J., Nenkova, A., Sable, C., Schiffman, B., Sigelman, S.: Tracking and Summarizing News on a Daily Basis with Columbia’s Newsblaster. In: Proceedings of the Human Language Technology Conference, San Diego, CA, USA (March 2002)
Mitamura, T., Nyberg, E.: Automatic rewriting for controlled language translation. In: NLPRS 2002, Workshop on Automatic Paraphrasing: Theories and Applications (2002)
Paşca, M., Dienes, P.: Aligning Needles in a Haystack: Paraphrase Acquisition Across the Web. In: Dale, R., Wong, K.-F., Su, J., Kwong, O.Y. (eds.) IJCNLP 2005. LNCS (LNAI), vol. 3651, pp. 119–130. Springer, Heidelberg (2005)
Scott, B., Barreiro, A.: OpenLogos MT and the SAL representation language. In: Pérez-Ortiz, J.A., Sánchez-Martínez, F., Tyers, F.M. (eds.) Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, November 2-3, pp. 19–26. Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Alicante, Spain (2009)
Scott, B.: The Logos Model: An Historical Perspective. Machine Translation 18, 1–72 (2003)
Scott, B.: The Logos System. In: MT Summit II, Munich, Germany (1989)
Shinyama, Y., Sekine, S.: Using Repeated Patterns across Comparable Articles for Paraphrase Acquisition. New York University. Proteus Technical Report (2005)
Silberztein, M.: NooJ: A Cooperative, Object-Oriented Architecture for NLP. In: INTEX pour la Linguistique et le traitement automatique des langues. Presses Universitaires de Franche-Comté. Cahiers de la MSH Ledoux, Besançon, France (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Barreiro, A. (2011). SPIDER: A System for Paraphrasing in Document Editing and Revision — Applicability in Machine Translation Pre-editing. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2011. Lecture Notes in Computer Science, vol 6609. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19437-5_30
Download citation
DOI: https://doi.org/10.1007/978-3-642-19437-5_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19436-8
Online ISBN: 978-3-642-19437-5
eBook Packages: Computer ScienceComputer Science (R0)