Skip to main content

Machine Translation and the Challenge of Patents

  • Chapter
  • First Online:

Part of the book series: The Information Retrieval Series ((INRE,volume 37))

Abstract

In this chapter, machine translation (MT) is first introduced in the context of patent information, and we touch upon what role it can play at various points in the intellectual property (IP) life cycle. We then step back to take a high-level look at what exactly defines MT, how it works, what makes it such a difficult task, as well as some of the more recent advances to overcome these hurdles and how we can go about ensuring that MT systems we develop are actually fit for purpose.

We then explore patent information as an application area for MT and describe how it presents a unique challenge not only for MT but for language technology in general. Finally, we take a closer look at some use cases involving MT and patents to show how they are already bringing significant value to consumers, but that there remains plenty of room for improvement.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   159.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. World Intellectual Property Organization (2013) PCT yearly review: the international patent system. WIPO, Geneva

    Google Scholar 

  2. Federmann C (2012) Hybrid machine translation using joint, binarised feature vectors. In: Proceedings of the 20th conference of the association for machine translation in the Americas. Association for Machine Translation in the Americas, San Diego

    Google Scholar 

  3. Machery W, Och FJ (2007) An empirical study on computing consensus translations from multiple machine translation systems. In: Proceedings of the 2007 joint conference on empirical methods in natural language processing and computational natural language learning (EMNLP-CoNLL). Association for Computational Linguistics, Prague, pp 986–995

    Google Scholar 

  4. Koehn P (2005) Europarl: a parallel corpus for statistical machine translation. In: Proceedings of the 10th machine translation summit, Phuket, pp 79–86

    Google Scholar 

  5. Lu B, Ka Pow C, Tsou BK (2011) The cultivation of a trilingual Chinese-English-Japanese parallel corpus from comparable patents. In: Proceedings of machine translation summit XIII, Xiamen, pp 472–479

    Google Scholar 

  6. Lu B, Tsou BK, Tao J, Oi Yee K, Zhu J (2010) Mining large-scale parallel corpora from multilingual patents: an English-Chinese example and its application to SMT. In: Proceedings of the 1st CIPS-SIGHAN joint conference on Chinese language processing (CLP-2010), Beijing, pp 79–86

    Google Scholar 

  7. Brown PF, Cocke J, Della-Pietra SA, Della-Pietra VJ, Jelinek F, Mercer RL et al (1988) A statistical approach to language translation. In: Proceedings of the 12th international conference on computational linguistics (CoLing). John von Neumann Society for Computing Sciences, Budapest, pp 71–76

    Google Scholar 

  8. Gale WA, Church KW (1991) A program for aligning sentences in bilingual corpora. In: Proceedings of the 29th annual meeting of the association for computational linguistics, Berkeley, pp 177–184

    Google Scholar 

  9. Och FJ, Ney H (2003) A systematic comparison of various statistical alignment models. Comput Linguist 29(1):19–51

    Article  MATH  Google Scholar 

  10. Tillmann C (2004) A unigram orientation model for statistical machine translation. In: Proceedings of human language technology conference and North American chapter of the association for computational linguistics annual meeting (HLT-NAACL), Boston, pp 101–104

    Google Scholar 

  11. Koehn P, Hoang H (2007) Factored translation models. In: Proceedings of the 2007 joint conference on empirical methods in natural language processing and computational natural language learning, Prague, pp 868–876

    Google Scholar 

  12. Quirk C, Menezes A, Cherry C (2005) Dependency treelet translation: syntactically informed phrasal SMT. In: 43rd Annual meeting of the association for computational linguistics, Ann Arbour, pp 271–279

    Google Scholar 

  13. Chiang D (2005) A hierarchical phrase-based model for statistical machine translation. 43rd Annual meeting of the association for computational linguistics, Ann Arbour, pp 263–270

    Google Scholar 

  14. Koehn P (2010) Statistical machine translation. Cambridge University Press, Cambridge

    MATH  Google Scholar 

  15. Banerjee P, Rubino R, Roturier J, van Genabith J (2013) Quality estimation-guided data selection for domain adaptation of SMT. In: Proceedings of the 14th machine translation summit, Nice, pp 101–108

    Google Scholar 

  16. Haddow B, Koehn P (2012) Analysing the effect of out-of-domain data on SMT systems. In: 7th workshop on statistical machine translation, Montreal, pp 422–432

    Google Scholar 

  17. Specia L, Raj D, Turchi M (2010) Machine translation evaluation versus quality estimation. Machine Transl 24(1):39–50

    Article  Google Scholar 

  18. Ueffing N, Ney H (2005) Word-class confidence estimation for machine translation using phrase-based translation models. In: Proceedings of human technology conference and conference on empirical methods in natural language processing, Vancouver, pp 763–770

    Google Scholar 

  19. He Y, Ma Y, van Genabith J, Way A (2010) Bridging SMT and TM with translation recommendation. In: The 48th annual meeting of the association for computational linguistics (ACL), Uppsala, pp 622–630

    Google Scholar 

  20. Mathur P, Cettolo M, Federico M (2013) Online learning approaches in computer assisted translation. In: 8th workshop on statistical machine translation, Sofia, pp 301–308

    Google Scholar 

  21. Potet M, Esperança-Rodier E, Blanchon H, Besacier L (2011) Preliminary experiments on using users’ post-editions to enhance a SMT system. In: Proceedings of the 15th international conference of the European association for machine translation (EAMT), Leuven, pp 161–168

    Google Scholar 

  22. Papineni K, Roukos S, Ward T, Zhu W-J (2002) BLEU: a method for automatic evaluation of machine translation. In: Proceedings of 40th annual meeting of the association for computational linguistics (ACL), Philadelphia, pp 311–318

    Google Scholar 

  23. Banerjee S, Lavie A (2005) METEOR: an automatic metric for MT evaluation with improved correlation with human judgments. In: Proceedings of the ACLT workshop on intrinsic and extrinsic evaluation measures for machine translation and/or summarization, Ann Arbor, pp 65–72

    Google Scholar 

  24. Turian JP, Shen L, Melamed ID (2003) Evaluation of machine translation and its evaluation. In: Proceedings of the 10th machine translation summit, New Orleans, pp 386–393

    Google Scholar 

  25. Callison-Burch C (2009) Fast, cheap, and creative: evaluating translation quality using Amazon’s Mechanical Turk. In: Proceedings of the 2009 conference on empirical methods in natural language processing, Singapore, pp 286–295

    Google Scholar 

  26. Rossi L, Wiggins D (2013) Applicability and application of machine translation quality metrics in the patent field. World Pat Inf 35(2):115–125

    Article  Google Scholar 

  27. O’Brien S (2003) Controlling controlled English: an analysis of several controlled language rule sets. In: The joint conference of the 8th international workshop of the European association for machine translation and the 4th controlled language applications workshop, Dublin, pp 105–114

    Google Scholar 

  28. Mügge U (2006) Fully automatic high quality machine translation of restricted text: a case study. In: Proceedings of the 28th international conference on translating and the computer, London

    Google Scholar 

  29. Roturier J (2006) An investigation into the impact of controlled English rules on the comprehensibility, usefulness, and acceptability of machine translated technical documentation for French and German users. Unpublished PhD Thesis, Dublin City University

    Google Scholar 

  30. Xiong H, Song L, Meng F, Lü Y, Liu Q (2011) The ICT’s patent MT system description for NTCIR-9. In: Proceedings of NTCIR-9 workshop meeting, Tokyo

    Google Scholar 

  31. Wu X, Matsuzaki T, Tsujii J (2011) SMT systems in the University of Tokyo for NTCIR-9 PatentMT. In: Proceedings of NTCIR-9 workshop meeting, Tokyo

    Google Scholar 

  32. Na H, Li J-J, Kim S-J, Lee J-H (2011) POSTECH’s statistical machine translation systems for NTCIR-9 PatentMT task (English-to-Japanese). In: Proceedings of NTCIR-9 workshop meeting, Tokyo

    Google Scholar 

  33. Wu P, Xu J, Yin Y, Zhang Y (2013) System description of BJTU-NLP MT for NTCIR-10 PatentMT. In: Proceedings of the 10th NTCIR conference on evaluation of information access technologies, Tokyo

    Google Scholar 

  34. Pouliquen B, Mazenc C (2011) COPPA, CLIR and TAPTA: three tools to assist in overcoming the patent barrier at WIPO. In: MT summit XIII: the thirteenth machine translation summit, Xiamen, pp 5–12

    Google Scholar 

  35. Tinsley J, Ceausu A, Zhang J, Depraetere H, Van de Walle J (2012) IPTranslator: facilitating patent search with machine translation. In: The tenth biennial conference of the association for machine translation in the Americas, San Diego

    Google Scholar 

  36. DePalma DA, Pielmeier H (2013) Great expectations for post-edited MT. Common Sense Advisory

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to John Tinsley .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer-Verlag GmbH Germany

About this chapter

Cite this chapter

Tinsley, J. (2017). Machine Translation and the Challenge of Patents. In: Lupu, M., Mayer, K., Kando, N., Trippe, A. (eds) Current Challenges in Patent Information Retrieval. The Information Retrieval Series, vol 37. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-53817-3_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-53817-3_16

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-53816-6

  • Online ISBN: 978-3-662-53817-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics