Machine Translation and the Challenge of Patents

Tinsley, John

doi:10.1007/978-3-662-53817-3_16

Machine Translation and the Challenge of Patents

John Tinsley⁷

Chapter
First Online: 26 March 2017

1603 Accesses
2 Citations

Part of the book series: The Information Retrieval Series ((INRE,volume 37))

Abstract

In this chapter, machine translation (MT) is first introduced in the context of patent information, and we touch upon what role it can play at various points in the intellectual property (IP) life cycle. We then step back to take a high-level look at what exactly defines MT, how it works, what makes it such a difficult task, as well as some of the more recent advances to overcome these hurdles and how we can go about ensuring that MT systems we develop are actually fit for purpose.

We then explore patent information as an application area for MT and describe how it presents a unique challenge not only for MT but for language technology in general. Finally, we take a closer look at some use cases involving MT and patents to show how they are already bringing significant value to consumers, but that there remains plenty of room for improvement.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 159.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

World Intellectual Property Organization (2013) PCT yearly review: the international patent system. WIPO, Geneva
Google Scholar
Federmann C (2012) Hybrid machine translation using joint, binarised feature vectors. In: Proceedings of the 20th conference of the association for machine translation in the Americas. Association for Machine Translation in the Americas, San Diego
Google Scholar
Machery W, Och FJ (2007) An empirical study on computing consensus translations from multiple machine translation systems. In: Proceedings of the 2007 joint conference on empirical methods in natural language processing and computational natural language learning (EMNLP-CoNLL). Association for Computational Linguistics, Prague, pp 986–995
Google Scholar
Koehn P (2005) Europarl: a parallel corpus for statistical machine translation. In: Proceedings of the 10th machine translation summit, Phuket, pp 79–86
Google Scholar
Lu B, Ka Pow C, Tsou BK (2011) The cultivation of a trilingual Chinese-English-Japanese parallel corpus from comparable patents. In: Proceedings of machine translation summit XIII, Xiamen, pp 472–479
Google Scholar
Lu B, Tsou BK, Tao J, Oi Yee K, Zhu J (2010) Mining large-scale parallel corpora from multilingual patents: an English-Chinese example and its application to SMT. In: Proceedings of the 1st CIPS-SIGHAN joint conference on Chinese language processing (CLP-2010), Beijing, pp 79–86
Google Scholar
Brown PF, Cocke J, Della-Pietra SA, Della-Pietra VJ, Jelinek F, Mercer RL et al (1988) A statistical approach to language translation. In: Proceedings of the 12th international conference on computational linguistics (CoLing). John von Neumann Society for Computing Sciences, Budapest, pp 71–76
Google Scholar
Gale WA, Church KW (1991) A program for aligning sentences in bilingual corpora. In: Proceedings of the 29th annual meeting of the association for computational linguistics, Berkeley, pp 177–184
Google Scholar
Och FJ, Ney H (2003) A systematic comparison of various statistical alignment models. Comput Linguist 29(1):19–51
Article MATH Google Scholar
Tillmann C (2004) A unigram orientation model for statistical machine translation. In: Proceedings of human language technology conference and North American chapter of the association for computational linguistics annual meeting (HLT-NAACL), Boston, pp 101–104
Google Scholar
Koehn P, Hoang H (2007) Factored translation models. In: Proceedings of the 2007 joint conference on empirical methods in natural language processing and computational natural language learning, Prague, pp 868–876
Google Scholar
Quirk C, Menezes A, Cherry C (2005) Dependency treelet translation: syntactically informed phrasal SMT. In: 43rd Annual meeting of the association for computational linguistics, Ann Arbour, pp 271–279
Google Scholar
Chiang D (2005) A hierarchical phrase-based model for statistical machine translation. 43rd Annual meeting of the association for computational linguistics, Ann Arbour, pp 263–270
Google Scholar
Koehn P (2010) Statistical machine translation. Cambridge University Press, Cambridge
MATH Google Scholar
Banerjee P, Rubino R, Roturier J, van Genabith J (2013) Quality estimation-guided data selection for domain adaptation of SMT. In: Proceedings of the 14th machine translation summit, Nice, pp 101–108
Google Scholar
Haddow B, Koehn P (2012) Analysing the effect of out-of-domain data on SMT systems. In: 7th workshop on statistical machine translation, Montreal, pp 422–432
Google Scholar
Specia L, Raj D, Turchi M (2010) Machine translation evaluation versus quality estimation. Machine Transl 24(1):39–50
Article Google Scholar
Ueffing N, Ney H (2005) Word-class confidence estimation for machine translation using phrase-based translation models. In: Proceedings of human technology conference and conference on empirical methods in natural language processing, Vancouver, pp 763–770
Google Scholar
He Y, Ma Y, van Genabith J, Way A (2010) Bridging SMT and TM with translation recommendation. In: The 48th annual meeting of the association for computational linguistics (ACL), Uppsala, pp 622–630
Google Scholar
Mathur P, Cettolo M, Federico M (2013) Online learning approaches in computer assisted translation. In: 8th workshop on statistical machine translation, Sofia, pp 301–308
Google Scholar
Potet M, Esperança-Rodier E, Blanchon H, Besacier L (2011) Preliminary experiments on using users’ post-editions to enhance a SMT system. In: Proceedings of the 15th international conference of the European association for machine translation (EAMT), Leuven, pp 161–168
Google Scholar
Papineni K, Roukos S, Ward T, Zhu W-J (2002) BLEU: a method for automatic evaluation of machine translation. In: Proceedings of 40th annual meeting of the association for computational linguistics (ACL), Philadelphia, pp 311–318
Google Scholar
Banerjee S, Lavie A (2005) METEOR: an automatic metric for MT evaluation with improved correlation with human judgments. In: Proceedings of the ACLT workshop on intrinsic and extrinsic evaluation measures for machine translation and/or summarization, Ann Arbor, pp 65–72
Google Scholar
Turian JP, Shen L, Melamed ID (2003) Evaluation of machine translation and its evaluation. In: Proceedings of the 10th machine translation summit, New Orleans, pp 386–393
Google Scholar
Callison-Burch C (2009) Fast, cheap, and creative: evaluating translation quality using Amazon’s Mechanical Turk. In: Proceedings of the 2009 conference on empirical methods in natural language processing, Singapore, pp 286–295
Google Scholar
Rossi L, Wiggins D (2013) Applicability and application of machine translation quality metrics in the patent field. World Pat Inf 35(2):115–125
Article Google Scholar
O’Brien S (2003) Controlling controlled English: an analysis of several controlled language rule sets. In: The joint conference of the 8th international workshop of the European association for machine translation and the 4th controlled language applications workshop, Dublin, pp 105–114
Google Scholar
Mügge U (2006) Fully automatic high quality machine translation of restricted text: a case study. In: Proceedings of the 28th international conference on translating and the computer, London
Google Scholar
Roturier J (2006) An investigation into the impact of controlled English rules on the comprehensibility, usefulness, and acceptability of machine translated technical documentation for French and German users. Unpublished PhD Thesis, Dublin City University
Google Scholar
Xiong H, Song L, Meng F, Lü Y, Liu Q (2011) The ICT’s patent MT system description for NTCIR-9. In: Proceedings of NTCIR-9 workshop meeting, Tokyo
Google Scholar
Wu X, Matsuzaki T, Tsujii J (2011) SMT systems in the University of Tokyo for NTCIR-9 PatentMT. In: Proceedings of NTCIR-9 workshop meeting, Tokyo
Google Scholar
Na H, Li J-J, Kim S-J, Lee J-H (2011) POSTECH’s statistical machine translation systems for NTCIR-9 PatentMT task (English-to-Japanese). In: Proceedings of NTCIR-9 workshop meeting, Tokyo
Google Scholar
Wu P, Xu J, Yin Y, Zhang Y (2013) System description of BJTU-NLP MT for NTCIR-10 PatentMT. In: Proceedings of the 10th NTCIR conference on evaluation of information access technologies, Tokyo
Google Scholar
Pouliquen B, Mazenc C (2011) COPPA, CLIR and TAPTA: three tools to assist in overcoming the patent barrier at WIPO. In: MT summit XIII: the thirteenth machine translation summit, Xiamen, pp 5–12
Google Scholar
Tinsley J, Ceausu A, Zhang J, Depraetere H, Van de Walle J (2012) IPTranslator: facilitating patent search with machine translation. In: The tenth biennial conference of the association for machine translation in the Americas, San Diego
Google Scholar
DePalma DA, Pielmeier H (2013) Great expectations for post-edited MT. Common Sense Advisory
Google Scholar

Download references

Author information

Authors and Affiliations

Iconic Translation Machines Ltd., Invent DCU, Glasnevin, Dublin, 9, Ireland
John Tinsley

Authors

John Tinsley
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to John Tinsley .

Editor information

Editors and Affiliations

Institute for Software Engineering & Interactive Systems, Vienna University of Technology, Vienna, Austria
Mihai Lupu
Research Platform Responsible Research and Innovation in Academic Practice, University of Vienna, Vienna, Austria
Katja Mayer
Information & Society Research Division, National Institute of Informatics, Tokyo, Japan
Noriko Kando
Patinformatics, LLC , Dublin, Ohio, USA
Anthony J. Trippe

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Tinsley, J. (2017). Machine Translation and the Challenge of Patents. In: Lupu, M., Mayer, K., Kando, N., Trippe, A. (eds) Current Challenges in Patent Information Retrieval. The Information Retrieval Series, vol 37. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-53817-3_16

Download citation

DOI: https://doi.org/10.1007/978-3-662-53817-3_16
Published: 26 March 2017
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-53816-6
Online ISBN: 978-3-662-53817-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics