Abstract
This chapter presents information technology ( ) based patent retrieval models. It first compares and contrasts information retrieval ( ) with patent retrieval, and highlights their key differences. For instance, IR can be considered as a precision-oriented retrieval, whereas patent retrieval can be considered as a recall-oriented retrieval. The chapter then describes the boolean retrieval model, which was designed for IR but can be used for patent retrieval. To facilitate effective patent retrieval, a basic patent retrieval model is presented. With this model, representative keyword terms are extracted from the user query and are ranked according to their importance so that top-\(k\) relevant patents can be retrieved with irrelevant patents eliminated. Moreover, the chapter also presents some enhancements and extensions to the basic patent retrieval model, which include incorporation of relevance feedback, estimation of the importance of keyword terms, text preprocessing of patent documents, and handling of patent category frequency. In addition, two dynamic patent retrieval models are also described. These two models perform interactive patent retrieval via dispersion or accumulation to dynamically rank the patents. Experimental results with real-life datasets show that the models presented in this chapter outperformed many conventional search systems with respect to time and cost. While this chapter focuses on the theoretical aspects of IT based patent retrieval models which are of interest to IT specialists, practical illustrative examples in the chapter demonstrate the empirical aspects of patent retrieval models which are helpful to IT practitioners.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
U.S. Patent and Trademark Office: U.S. patent activity calendar years 1790 to the present, https://www.uspto.gov/web/offices/ac/ido/ oeip/taf/h_counts.htm
World Intellectual Property Organisation: International Patent Classification (IPC), http://www.wipo.int/classifications/ipc/en/ (2018)
A. Cuzzocrea, W. Lee, C.K. Leung: High-recall information retrieval from linked big data. In: Proc. IEEE COMPSAC, Vol. 2, ed. by S.I. Ahamed, C.K. Chang, W. Chu, I. Crnkovic, P. Hsiung, G. Huang, J. Yang (IEEE Computer Society, Los Alamitos 2015) pp. 712–717
S. Adams: A practitioner's view on PaIR. In: Proc. PaIR 2011, ed. by M. Lupu, A. Rauber, A. Hanbury (ACM, New York 2011) pp. 37–38
C.J.V. Rijsbergen: Information Retrieval, 2nd edn. (Butterworth-Heinemann, Newton 1979)
E. Zhang, Y. Zhang: Average precision. In: Encyclopedia of Database Systems, ed. by L. Liu, M.T. Özsu (Springer, New York 2009) pp. 192–193
W.B. Frakes, R. Baeza-Yates (Eds.): Information Retrieval: Data Structures and Algorithms (Prentice-Hall, Upper Saddle River 1992)
M. Kobayashi, K. Takeda: Information retrieval on the web, ACM Comput. Surv. 32(2), 144–173 (2000)
A. Lashkari, F. Mahdavi, V. Ghomi: A boolean model in information retrieval for search engines. In: Proc. ICIME 2009 (IEEE Computer Society, Los Alamitos 2009) pp. 85–389
W. Lee, C.K. Leung, J.J. Song: Reducing noises for recall-oriented patent retrieval. In: Proc. BD Cloud 2014, ed. by J. Chen, L.T. Yang (IEEE Computer Society, Los Alamitos 2014) pp. 579–586
R. Fagin, A. Lotem, M. Naor: Optimal aggregation algorithms for middleware. In: Proc. PODS 2001 (ACM, New York 2001) pp. 102–113
J.J. Song, W. Lee: High recall-low cost model for patent retrieval. In: Proc. Big DAS 2015, ed. by C.K. Leung, A. Nasridinov (ACM, New York 2015) pp. 213–216
C.D. Manning, P. Raghavan, H. Schütze: Introduction to Information Retrieval (Cambridge Univ. Press, New York 2008)
J.J. Rocchio: Relevance Feedback in Information Retrieval (Prentice, Englewood Cliffs 1971)
S. Kullback, R.A. Leibler: On information and sufficiency, Ann. Math. Statistics 22(1), 79–86 (1951)
K. Toutanova, D. Klein, C.D. Manning, Y. Singer: Feature-rich part-of-speech tagging with a cyclic dependency network. In: Proc. NAACL 2003 (Association for Computational Linguistics, Stroudsburg 2003) pp. 173–180
S. Chakrabarti: Mining the Web: Discovering Knowledge from Hypertext Data (Morgan-Kaufmann, San Francisco 2002)
J.W. Reed, Y. Jiao, T.E. Potok, B.A. Klump, M.T. Elmore, A.R. Hurson: TF-ICF: A new term weighting scheme for clustering dynamic data streams. In: Proc. ICMLA 2006, ed. by A. Wani, T. Li, L. Kurgan, J. Ye, Y. Liu (IEEE Computer Society, Los Alamitos 2006) pp. 258–263
J.J. Song, W. Lee, J. Afshar: Retrieving patents with inverse patent category frequency. In: Proc. Big Comp 2016 (IEEE, Piscataway 2016) pp. 109–114
J. Wu, Y. Chen, D. Dai, S. Chen, X. Wang: Clustering-based geometrical structure retrieval of man-made target in SAR images, IEEE Geosci. Remote Sens. Lett. 14(3), 279–283 (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer International Publishing AG, part of Springer Nature
About this chapter
Cite this chapter
Leung, C., Lee, W., Song, J.J. (2019). Information Technology-Based Patent Retrieval Models. In: Glänzel, W., Moed, H.F., Schmoch, U., Thelwall, M. (eds) Springer Handbook of Science and Technology Indicators. Springer Handbooks. Springer, Cham. https://doi.org/10.1007/978-3-030-02511-3_34
Download citation
DOI: https://doi.org/10.1007/978-3-030-02511-3_34
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-02510-6
Online ISBN: 978-3-030-02511-3
eBook Packages: Economics and FinanceEconomics and Finance (R0)