Skip to main content

Abstract

This chapter presents information technology ( ) based patent retrieval models. It first compares and contrasts information retrieval ( ) with patent retrieval, and highlights their key differences. For instance, IR can be considered as a precision-oriented retrieval, whereas patent retrieval can be considered as a recall-oriented retrieval. The chapter then describes the boolean retrieval model, which was designed for IR but can be used for patent retrieval. To facilitate effective patent retrieval, a basic patent retrieval model is presented. With this model, representative keyword terms are extracted from the user query and are ranked according to their importance so that top-\(k\) relevant patents can be retrieved with irrelevant patents eliminated. Moreover, the chapter also presents some enhancements and extensions to the basic patent retrieval model, which include incorporation of relevance feedback, estimation of the importance of keyword terms, text preprocessing of patent documents, and handling of patent category frequency. In addition, two dynamic patent retrieval models are also described. These two models perform interactive patent retrieval via dispersion or accumulation to dynamically rank the patents. Experimental results with real-life datasets show that the models presented in this chapter outperformed many conventional search systems with respect to time and cost. While this chapter focuses on the theoretical aspects of IT based patent retrieval models which are of interest to IT specialists, practical illustrative examples in the chapter demonstrate the empirical aspects of patent retrieval models which are helpful to IT practitioners.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 299.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 379.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • U.S. Patent and Trademark Office: U.S. patent activity calendar years 1790 to the present, https://www.uspto.gov/web/offices/ac/ido/ oeip/taf/h_counts.htm

  • World Intellectual Property Organisation: International Patent Classification (IPC), http://www.wipo.int/classifications/ipc/en/ (2018)

  • A. Cuzzocrea, W. Lee, C.K. Leung: High-recall information retrieval from linked big data. In: Proc. IEEE COMPSAC, Vol. 2, ed. by S.I. Ahamed, C.K. Chang, W. Chu, I. Crnkovic, P. Hsiung, G. Huang, J. Yang (IEEE Computer Society, Los Alamitos 2015) pp. 712–717

    Google Scholar 

  • S. Adams: A practitioner's view on PaIR. In: Proc. PaIR 2011, ed. by M. Lupu, A. Rauber, A. Hanbury (ACM, New York 2011) pp. 37–38

    Google Scholar 

  • C.J.V. Rijsbergen: Information Retrieval, 2nd edn. (Butterworth-Heinemann, Newton 1979)

    Google Scholar 

  • E. Zhang, Y. Zhang: Average precision. In: Encyclopedia of Database Systems, ed. by L. Liu, M.T. Özsu (Springer, New York 2009) pp. 192–193

    Google Scholar 

  • W.B. Frakes, R. Baeza-Yates (Eds.): Information Retrieval: Data Structures and Algorithms (Prentice-Hall, Upper Saddle River 1992)

    Google Scholar 

  • M. Kobayashi, K. Takeda: Information retrieval on the web, ACM Comput. Surv. 32(2), 144–173 (2000)

    Article  Google Scholar 

  • A. Lashkari, F. Mahdavi, V. Ghomi: A boolean model in information retrieval for search engines. In: Proc. ICIME 2009 (IEEE Computer Society, Los Alamitos 2009) pp. 85–389

    Google Scholar 

  • W. Lee, C.K. Leung, J.J. Song: Reducing noises for recall-oriented patent retrieval. In: Proc. BD Cloud 2014, ed. by J. Chen, L.T. Yang (IEEE Computer Society, Los Alamitos 2014) pp. 579–586

    Google Scholar 

  • R. Fagin, A. Lotem, M. Naor: Optimal aggregation algorithms for middleware. In: Proc. PODS 2001 (ACM, New York 2001) pp. 102–113

    Google Scholar 

  • J.J. Song, W. Lee: High recall-low cost model for patent retrieval. In: Proc. Big DAS 2015, ed. by C.K. Leung, A. Nasridinov (ACM, New York 2015) pp. 213–216

    Google Scholar 

  • C.D. Manning, P. Raghavan, H. Schütze: Introduction to Information Retrieval (Cambridge Univ. Press, New York 2008)

    Book  Google Scholar 

  • J.J. Rocchio: Relevance Feedback in Information Retrieval (Prentice, Englewood Cliffs 1971)

    Google Scholar 

  • S. Kullback, R.A. Leibler: On information and sufficiency, Ann. Math. Statistics 22(1), 79–86 (1951)

    Article  Google Scholar 

  • K. Toutanova, D. Klein, C.D. Manning, Y. Singer: Feature-rich part-of-speech tagging with a cyclic dependency network. In: Proc. NAACL 2003 (Association for Computational Linguistics, Stroudsburg 2003) pp. 173–180

    Google Scholar 

  • S. Chakrabarti: Mining the Web: Discovering Knowledge from Hypertext Data (Morgan-Kaufmann, San Francisco 2002)

    Google Scholar 

  • J.W. Reed, Y. Jiao, T.E. Potok, B.A. Klump, M.T. Elmore, A.R. Hurson: TF-ICF: A new term weighting scheme for clustering dynamic data streams. In: Proc. ICMLA 2006, ed. by A. Wani, T. Li, L. Kurgan, J. Ye, Y. Liu (IEEE Computer Society, Los Alamitos 2006) pp. 258–263

    Google Scholar 

  • J.J. Song, W. Lee, J. Afshar: Retrieving patents with inverse patent category frequency. In: Proc. Big Comp 2016 (IEEE, Piscataway 2016) pp. 109–114

    Google Scholar 

  • J. Wu, Y. Chen, D. Dai, S. Chen, X. Wang: Clustering-based geometrical structure retrieval of man-made target in SAR images, IEEE Geosci. Remote Sens. Lett. 14(3), 279–283 (2017)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Carson Leung .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer International Publishing AG, part of Springer Nature

About this chapter

Cite this chapter

Leung, C., Lee, W., Song, J.J. (2019). Information Technology-Based Patent Retrieval Models. In: Glänzel, W., Moed, H.F., Schmoch, U., Thelwall, M. (eds) Springer Handbook of Science and Technology Indicators. Springer Handbooks. Springer, Cham. https://doi.org/10.1007/978-3-030-02511-3_34

Download citation

Publish with us

Policies and ethics