Abstract
In an attempt to improve retrieval systems for the patent domain, significant efforts are invested to assist researchers in formulating better queries, preferably via automated query generation. Current work on query generation in patent retrieval is mostly based on statistical measures without considering whether these terms are the best choice. To learn from actual queries being posed by experts, we analyze query logs from USPTO patent examiners. Results show that US examiners pick the majority of query terms from the claim section, a large fraction of which, in turn, coincide with the subject feature terms which determine the extent of the protection of the patent right. Considering the lessons learned from evaluating existing search logs will help in improving (semi-) automated query generation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Amitay, E., Broder, A.: Introduction to special issue on query log analysis: Technology and Ethics. ACM Trans. Web 2, Article 18 (2008)
Ballesteros, L., Croft, W.: Phrasal translation and query expansion techniques for cross-language information retrieval. In: Proc. of SIGIR 1997, Philadelphia, Pennsylvania, USA, pp. 84–91 (1997)
Clough, P., Berendt, B.: Report on the Treble CLEF query log analysis workshop 2009. SIGIR Forum 43, 71–77 (2009)
European Patent Office. Guidelines for Examination in the European Patent Office (2010)
European Patent Office. How to get a European patent – Guide for Applicants Part 1, 13th edn. (2010)
Fujita, S.: Technology survey and invalidity search: An comparative study of different tasks for Japanese patent document retrieval. Information Processing and Management, An International Journal, 1154–1172 (2007)
Garside, R., Smith, N.: A hybrid grammatical tagger: CLAWS4. In: Garside, R., Leech, G., McEnery, A. (eds.) Corpus Annotation: Linguistic Information from Computer Text Corpora. Longman, London, pp. 102–121 (1997)
Hunt, D., Nyugen, L., Rodgers, M.: Patent Searching: Tools & Techniques. John Wiley & Sons, Inc. (2007)
Jochim, C., Lioma, C., Schütze, H., Koch, S., Ertl, T.: Preliminary study into query translation for patent retrieval. In: Proc. of PaIR 2011, Toronto, Canada, pp. 57–66 (2010)
Konishi, K.: Query terms extraction form Patent Documents for invalidity search. In: Proc. of NTCIR 2005: NTCIR-5 Workshop Meeting, Tokyo, Japan (2005)
Magdy, W., Jones, G.J.F.: A Study of Query Expansion Methods for Patent Retrieval. In: Proc. of PaIR 2011, Glasgow, Scotland, pp. 19–24 (2011)
Magdy, W., Leveling, J., Jones, G.J.F.: Exploring Structured Documents and Query Formulation Techniques for Patent Retrieval. In: Peters, C., Di Nunzio, G.M., Kurimo, M., Mostefa, D., Penas, A., Roda, G. (eds.) CLEF 2009. LNCS, vol. 6241, pp. 410–417. Springer, Heidelberg (2010)
Murray, G.C., Teevan, J.: Query log analysis: Social and technological challenges. SIGIR Forum 41, 112–120 (2007)
Silvestri, F.: Mining Query Logs: Turning Search Usage Data into Knowledge. Foundations and Trends in Information Retrieval 4(1-2), 1–174 (2010)
Silverstein, C., Marais, H., Henzinger, M., Moricz, M.: Analysis of a very large web search engine query log. SIGIR Forum 33, 6–12 (1999)
Torres, S., Hiemstra, D., Serdyukov, P.: Query log analysis in the context of information retrieval for children. In: Proc. of SIGIR 2010, Geneva, Switzerland, pp. 847–848 (2010)
Tyler, S., Teevan, J.: Large scale query log analysis of re-finding. In: Proc. of WSDM 2010, New York, USA, pp. 191–200 (2010)
Xue, X., Croft, W.: Automatic query generation for patent search. In: Proc. of CIKM 2009, Hong Kong, China, pp. 2037–2040 (2009)
Xue, X., Croft, W.: Transforming patents into prior-art queries. In: Proc. of SIGIR 2009, Boston, USA, pp. 808–880 (2009)
Zhang, Z., Yang, M., Li, S., Qi, H., Song, C.: Sogou Query Log Analysis: A Case Study for Collaborative Recommendation or Personalized IR. In: Proc. of IALP 2009, Singapore, pp. 304–307 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tannebaum, W., Rauber, A. (2012). Analyzing Query Logs of USPTO Examiners to Identify Useful Query Terms in Patent Documents for Query Expansion in Patent Searching: A Preliminary Study. In: Salampasis, M., Larsen, B. (eds) Multidisciplinary Information Retrieval. IRFC 2012. Lecture Notes in Computer Science, vol 7356. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31274-8_11
Download citation
DOI: https://doi.org/10.1007/978-3-642-31274-8_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31273-1
Online ISBN: 978-3-642-31274-8
eBook Packages: Computer ScienceComputer Science (R0)